notbugAs an Amazon Associate I earn from qualifying purchases.
Want a good read? Try FreeBSD Mastery: Jails (IT Mastery Book 15)
Port details
py-ocrmypdf Adds an OCR text layer to scanned PDF files
12.7.2 textproc on this many watch lists=0 search for ports that depend on this port Find issues related to this port Report an issue related to this port View this port on Repology. pkg-fallout 12.4.0Version of this port present on the latest quarterly branch.
Maintainer: kai@FreeBSD.org search for ports maintained by this maintainer
Port Added: 2019-07-12 15:08:19
Last Update: 2021-11-07 10:03:57
Commit Hash: 4ae9102
Also Listed In: python
License: MPL20
Description:
SVNWeb : git : Homepage
pkg-plist: as obtained via: make generate-plist
There is no configure plist information for this port.
Dependency lines:
  • ${PYTHON_PKGNAMEPREFIX}ocrmypdf>0:textproc/py-ocrmypdf@${PY_FLAVOR}
To install the port:
cd /usr/ports/textproc/py-ocrmypdf/ && make install clean
To add the package, run one of these commands:
  • pkg install textproc/py-ocrmypdf
  • pkg install py38-ocrmypdf
NOTE: This is a Python port. Instead of py38-ocrmypdf listed in the above command, you can pick from the names under the Packages section.
PKGNAME: py38-ocrmypdf
Package flavors (<flavor>: <package>)
  • py38: py38-ocrmypdf
distinfo:
Packages (timestamps in pop-ups are UTC):
py37-ocrmypdf
ABIlatestquarterly
FreeBSD:11:aarch64-9.8.2
FreeBSD:11:amd64--
FreeBSD:11:armv6-9.8.2
FreeBSD:11:i386--
FreeBSD:11:mips--
FreeBSD:11:mips64--
FreeBSD:12:aarch64--
FreeBSD:12:amd64--
FreeBSD:12:armv6--
FreeBSD:12:armv7--
FreeBSD:12:i386--
FreeBSD:12:mips--
FreeBSD:12:mips64--
FreeBSD:12:powerpc64--
FreeBSD:13:aarch64--
FreeBSD:13:amd64--
FreeBSD:13:armv6--
FreeBSD:13:armv7--
FreeBSD:13:i386--
FreeBSD:13:mips--
FreeBSD:13:mips64--
FreeBSD:13:powerpc64--
FreeBSD:14:aarch64--
FreeBSD:14:amd64--
FreeBSD:14:armv6--
FreeBSD:14:armv7--
FreeBSD:14:i386--
FreeBSD:14:mips--
FreeBSD:14:mips64--
FreeBSD:14:powerpc6411.7.3-
 
py38-ocrmypdf
ABIlatestquarterly
FreeBSD:11:aarch64--
FreeBSD:11:amd6412.4.011.7.3
FreeBSD:11:armv6--
FreeBSD:11:i38612.4.011.7.3
FreeBSD:11:mips--
FreeBSD:11:mips64--
FreeBSD:12:aarch64-12.4.0
FreeBSD:12:amd6412.7.212.4.0
FreeBSD:12:armv6--
FreeBSD:12:armv7--
FreeBSD:12:i38612.7.212.4.0
FreeBSD:12:mips--
FreeBSD:12:mips64--
FreeBSD:12:powerpc64--
FreeBSD:13:aarch64-12.4.0
FreeBSD:13:amd6412.7.212.4.0
FreeBSD:13:armv6--
FreeBSD:13:armv7--
FreeBSD:13:i38612.7.212.4.0
FreeBSD:13:mips--
FreeBSD:13:mips64--
FreeBSD:13:powerpc64-11.7.3
FreeBSD:14:aarch6412.7.2-
FreeBSD:14:amd6412.7.2-
FreeBSD:14:armv6--
FreeBSD:14:armv7--
FreeBSD:14:i38612.7.2-
FreeBSD:14:mips--
FreeBSD:14:mips64--
FreeBSD:14:powerpc64--
 

Dependencies
NOTE: FreshPorts displays only information on required and default dependencies. Optional dependencies are not covered.
Build dependencies:
  1. py38-cffi>=1.9.1 : devel/py-cffi@py38
  2. py38-setuptools_scm>=0 : devel/py-setuptools_scm@py38
  3. py38-setuptools_scm_git_archive>=0 : devel/py-setuptools_scm_git_archive@py38
  4. py38-setuptools>0 : devel/py-setuptools@py38
  5. python3.8 : lang/python38
Runtime dependencies:
  1. py38-cffi>=1.9.1 : devel/py-cffi@py38
  2. py38-coloredlogs>=14.0 : devel/py-coloredlogs@py38
  3. py38-img2pdf>=0.3.0<0.5 : graphics/py-img2pdf@py38
  4. py38-pillow>=8.2.0 : graphics/py-pillow@py38
  5. py38-tqdm>=4 : misc/py-tqdm@py38
  6. py38-reportlab>=3.5.66 : print/py-reportlab@py38
  7. py38-pdfminer.six>=20191110<20211013 : textproc/py-pdfminer.six@py38
  8. py38-pikepdf>=2.10.0 : textproc/py-pikepdf@py38
  9. py38-pluggy>=0.13.0<2.0 : devel/py-pluggy@py38
  10. tesseract : graphics/tesseract
  11. pngquant : graphics/pngquant
  12. unpaper : graphics/unpaper
  13. ghostscript9-agpl-base>=9.16_2 : print/ghostscript9-agpl-base
  14. py38-setuptools>0 : devel/py-setuptools@py38
  15. python3.8 : lang/python38
  16. py38-importlib-resources>=5 : devel/py-importlib-resources@py38
Library dependencies:
  1. liblept.so : graphics/leptonica
There are no ports dependent upon this port

Configuration Options:
Options name:

USES:

FreshPorts was unable to extract/find any pkg message
Master Sites:
Expand this list (2 items)
Collapse this list.
  1. https://files.pythonhosted.org/packages/source/o/ocrmypdf/
  2. https://pypi.org/packages/source/o/ocrmypdf/
Collapse this list.

Number of commits found: 29

Commit History - (may be incomplete: see SVNWeb link above for full details)
DateByDescription
07 Nov 2021 10:03:57
 files touched by this commit commit hash:4ae910239acd614b4407f9bdd313998dd2fa30b3  12.7.2
Kai Knoblich (kai) search for other commits by this committer
textproc/py-ocrmypdf: Update to 12.7.2

Changelog since 12.4.0:

https://github.com/jbarlow83/OCRmyPDF/blob/v12.7.2/docs/release_notes.rst
13 Sep 2021 07:54:21
 files touched by this commit commit hash:c4c673f8184e6ef26d236453e1eacce4b3c2c43e  12.4.0
Kai Knoblich (kai) search for other commits by this committer
textproc/py-ocrmypdf: Update to 12.4.0

Changelog since 12.3.2:

https://github.com/jbarlow83/OCRmyPDF/blob/v12.4.0/docs/release_notes.rst
10 Aug 2021 15:42:26
 files touched by this commit commit hash:0161da44a9ea736c8d49a9ecd7d794261f34898a  12.3.2
Kai Knoblich (kai) search for other commits by this committer
textproc/py-ocrmypdf: Update to 12.3.2

Changelog since 12.2.0:

https://github.com/jbarlow83/OCRmyPDF/blob/v12.3.2/docs/release_notes.rst
05 Jul 2021 10:30:38
 files touched by this commit commit hash:705073d51245490cce8084a080106bbe2fb2ba97  12.2.0
Kai Knoblich (kai) search for other commits by this committer
textproc/py-ocrmypdf: Update to 12.2.0

Changelogs since 11.7.3:

https://github.com/jbarlow83/OCRmyPDF/blob/v12.2.0/docs/release_notes.rst

MFH:		No (feature release)
07 Apr 2021 08:09:01
 files touched by this commit commit hash:cf118ccf875508b9a1c570044c93cfcc82bd455c  11.7.3
Mathieu Arnold (mat) search for other commits by this committer
One more small cleanup, forgotten yesterday.
Reported by:	lwhsu
06 Apr 2021 14:31:07
 files touched by this commit commit hash:305f148f482daf30dcf728039d03d019f88344eb  11.7.3
Mathieu Arnold (mat) search for other commits by this committer
Remove # $FreeBSD$ from Makefiles.
24 Mar 2021 14:10:42
Original commit files touched by this commit Revision:569124  11.7.3
kai search for other commits by this committer
textproc/py-ocrmypdf: Update to 11.7.3

Changelog since 11.6.2:

https://github.com/jbarlow83/OCRmyPDF/blob/v11.7.3/docs/release_notes.rst
27 Feb 2021 17:53:36
Original commit files touched by this commit Revision:566721  11.6.2
kai search for other commits by this committer
textproc/py-ocrmypdf: Update to 11.6.2

Changelog since 11.6.0:

https://github.com/jbarlow83/OCRmyPDF/blob/v11.6.2/docs/release_notes.rst
01 Feb 2021 12:53:18
Original commit files touched by this commit Revision:563575  11.6.0
kai search for other commits by this committer
textproc/py-ocrmypdf: Update to 11.6.0

Changelog since 11.3.1:

https://github.com/jbarlow83/OCRmyPDF/blob/v11.6.0/docs/release_notes.rst
30 Oct 2020 07:28:15
Original commit files touched by this commit Revision:553669  11.3.1
kai search for other commits by this committer
textproc/py-ocrmypdf: Update to 11.3.1

* The license was changed by upstream with the 11.0.0 release.
  Adapt LICENSE accordingly.

Changelog since 10.3.1:

https://github.com/jbarlow83/OCRmyPDF/blob/v11.3.1/docs/release_notes.rst
29 Jul 2020 06:21:41
Original commit files touched by this commit Revision:543648  10.3.1
kai search for other commits by this committer
textproc/py-ocrmypdf: Update to 10.3.1

Changelog since 10.2.0:

https://github.com/jbarlow83/OCRmyPDF/blob/v10.3.1/docs/release_notes.rst
06 Jul 2020 06:39:04
Original commit files touched by this commit Revision:541315  10.2.0
kai search for other commits by this committer
textproc/py-ocrmypdf: Update to 10.2.0

Changelog since 9.8.2:

https://github.com/jbarlow83/OCRmyPDF/blob/v10.2.0/docs/release_notes.rst
22 Jun 2020 10:08:59
Original commit files touched by this commit Revision:539800  9.8.2
kai search for other commits by this committer
textproc/py-ocrmypdf: Fix typo in COMMENT

Reported by:	Frank Ebert (via private e-mail)
09 Jun 2020 11:16:29
Original commit files touched by this commit Revision:538311  9.8.2
kai search for other commits by this committer
textproc/py-ocrmypdf: Update to 9.8.2

* Remove uneccessary pytest-cov dependency from TEST_DEPENDS while I'm here.

Changelog since 9.8.0:

https://github.com/jbarlow83/OCRmyPDF/blob/v9.8.2/docs/release_notes.rst
30 Apr 2020 08:45:34
Original commit files touched by this commit Revision:533419  9.8.0
kai search for other commits by this committer
textproc/py-ocrmypdf: Update to 9.8.0

* Pet portclippy while I'm here by placing related variables to the USES
  block.

Changelog:

https://github.com/jbarlow83/OCRmyPDF/blob/v9.8.0/docs/release_notes.rst
14 Apr 2020 11:39:58
Original commit files touched by this commit Revision:531672  9.7.1
kai search for other commits by this committer
textproc/py-ocrmypdf: Update to 9.7.1

Changelog:

https://github.com/jbarlow83/OCRmyPDF/blob/v9.7.1/docs/release_notes.rst
10 Apr 2020 10:03:01
Original commit files touched by this commit Revision:531307  9.7.0
kai search for other commits by this committer
textproc/py-ocrmypdf: Update to 9.7.0

Changelog:

https://github.com/jbarlow83/OCRmyPDF/blob/v9.7.0/docs/release_notes.rst

MFH:		2020Q2 (fix regressions with Ghostscript >= 9.52)
24 Mar 2020 19:54:57
Original commit files touched by this commit Revision:529063  9.6.0
antoine search for other commits by this committer
Convert the tree to use PY_PILLOW

With hat:	portmgr
Originally submitted by:	kai
17 Feb 2020 08:42:11
Original commit files touched by this commit Revision:526351  9.6.0
kai search for other commits by this committer
textproc/py-ocrmypdf: Update to 9.6.0

Changelog since 9.3.0:

https://github.com/jbarlow83/OCRmyPDF/blob/v9.6.0/docs/release_notes.rst

MFH:		2020Q1 (blanket: regression fix)
02 Jan 2020 19:14:49
Original commit files touched by this commit Revision:521868  9.3.0
kai search for other commits by this committer
textproc/py-ocrmypdf: Update to 9.3.0

Changelog since 9.1.1:

https://github.com/jbarlow83/OCRmyPDF/blob/v9.3.0/docs/release_notes.rst

MFH:		2020Q1
22 Nov 2019 10:20:47
Original commit files touched by this commit Revision:518137  9.1.1
kai search for other commits by this committer
textproc/py-ocrmypdf: Update to 9.1.1

Changelog since 9.0.5:

https://github.com/jbarlow83/OCRmyPDF/blob/v9.1.1/docs/release_notes.rst
13 Nov 2019 16:26:14
Original commit files touched by this commit Revision:517380  9.0.5
kai search for other commits by this committer
textproc/py-ocrmypdf: Update to 9.0.5

* Remove the workaround that coped with the files that were generated in the
  CFFI out-of-line ABI mode (through devel/py-cffi) as they are now captured
  correctly during the installation process.

Changelog since 9.0.3:

https://github.com/jbarlow83/OCRmyPDF/blob/v9.0.5/docs/release_notes.rst

MFH:		2019Q4
10 Sep 2019 20:10:53
Original commit files touched by this commit Revision:511774  9.0.3
kai search for other commits by this committer
textproc/py-ocrmypdf: Update to 9.0.3

* Introduce new default option UNPAPER after graphics/unpaper was updated to
  a more recent version in r511568.

Changelog:

https://github.com/jbarlow83/OCRmyPDF/blob/v9.0.3/docs/release_notes.rst
05 Sep 2019 16:15:10
Original commit files touched by this commit Revision:511194  9.0.2
kai search for other commits by this committer
textproc/py-ocrmypdf: Update to 9.0.2

* Set the PNGQUANT option as default and remove the related comments since
  now the issue with the incorrect inverted PNG images has been fixed.

Changelog:

https://github.com/jbarlow83/OCRmyPDF/blob/v9.0.2/docs/release_notes.rst
13 Aug 2019 19:32:17
Original commit files touched by this commit Revision:508849  9.0.1
kai search for other commits by this committer
textproc/py-ocrmypdf: Update to 9.0.1

* Make the dependency for graphics/pngquant optional as pngquant is not
  strictly required but should be provided where possible.
* Set the new option as non-default for a while because there's a bug that
  leads to incorrect inverted PNG images in conjunction with pngquant.
* Install shell auto-completions to improve user experience.
* Remove the "slow" tests from the "do-test" target. There's very little
  point to run those at downstream.

Changelog since 8.3.2:

https://github.com/jbarlow83/OCRmyPDF/blob/v9.0.1/docs/release_notes.rst
03 Aug 2019 22:21:57
Original commit files touched by this commit Revision:508034  8.3.2
kai search for other commits by this committer
textproc/py-ocrmypdf: Update to 8.3.2

Changelog:

https://github.com/jbarlow83/OCRmyPDF/blob/v8.3.2/docs/release_notes.rst
15 Jul 2019 10:14:38
Original commit files touched by this commit Revision:506655  8.3.1
kai search for other commits by this committer
textproc/py-ocrmypdf: Use defined locale in whole build process

* Set the locale globally for {CONFIGURE,MAKE}_ENV so that the locale
  doesn't need to be set explicitly for the do-test target.

* Use ECHO_CMD instead of ECHO as the latter one is already defined in
  /usr/share/mk/sys.mk, which can either be "echo" or "true" if the make
  flag -s is given.

Suggested by:	sunpoet
12 Jul 2019 17:20:33
Original commit files touched by this commit Revision:506483  8.3.1
kai search for other commits by this committer
textproc/py-ocrmypdf: Apply workaround for missing files a bit earlier

* The post-stage target runs very late in the install process and should be
   only used if the post-install target ran too early.

   Thus place the workaround that copes with the missing files to the
   post-install target where it fits better.

Suggested by:	mat
12 Jul 2019 15:08:04
Original commit files touched by this commit Revision:506461  8.3.1
kai search for other commits by this committer
New port: textproc/py-ocrmypdf

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be
searched or copy-pasted.

Main features:

* Generates a searchable PDF/A file from a regular PDF
* Places OCR text accurately below the image to ease copy / paste
* Keeps the exact resolution of the original embedded images
* When possible, inserts OCR information as a "lossless" operation without
  disrupting any other content
* Optimizes PDF images, often producing files smaller than the input file
* If requested deskews and/or cleans the image before performing OCR
* Validates input and output files
* Distributes work across all available CPU cores
* Uses Tesseract OCR engine to recognize more than 100 languages
* Scales properly to handle files with thousands of pages
* Battle-tested on millions of PDFs

WWW: https://github.com/jbarlow83/OCRmyPDF

Reviewed by:	0mp, koobs
Differential Revision:	https://reviews.freebsd.org/D20927

Number of commits found: 29