FreshPorts -- textproc/py-ocrmypdf: Adds an OCR text layer to scanned PDF files

Port details on branch 2022Q4

py-ocrmypdf Adds an OCR text layer to scanned PDF files

13.7.0 textproc =0 16.10.4Version of this port present on the latest quarterly branch.

Maintainer: kai@FreeBSD.org

Port Added: 2022-10-22 17:39:20

Last Update: 2022-10-22 17:35:14

Commit Hash: 2260bf2

Also Listed In: python

License: MPL20

WWW:

https://github.com/ocrmypdf/OCRmyPDF

Description:

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched or copy-pasted. Main features: * Generates a searchable PDF/A file from a regular PDF * Places OCR text accurately below the image to ease copy / paste * Keeps the exact resolution of the original embedded images * When possible, inserts OCR information as a "lossless" operation without disrupting any other content * Optimizes PDF images, often producing files smaller than the input file * If requested deskews and/or cleans the image before performing OCR * Validates input and output files * Distributes work across all available CPU cores * Uses Tesseract OCR engine to recognize more than 100 languages * Scales properly to handle files with thousands of pages * Battle-tested on millions of PDFs

¦ ¦ ¦ ¦

Manual pages:

FreshPorts has no man page information for this port.

pkg-plist: as obtained via: make generate-plist

There is no configure plist information for this port.

Dependency lines:

${PYTHON_PKGNAMEPREFIX}ocrmypdf>0:textproc/py-ocrmypdf@${PY_FLAVOR}

To install the port:

cd /usr/ports/textproc/py-ocrmypdf/ && make install clean

To add the package, run one of these commands:

pkg install textproc/py-ocrmypdf
pkg install py39-ocrmypdf

NOTE: If this package has multiple flavors (see below), then use one of them instead of the name specified above.
NOTE: This is a Python port. Instead of py39-ocrmypdf listed in the above command, you can pick from the names under the Packages section.

PKGNAME: py39-ocrmypdf

Package flavors (<flavor>: <package>)

py39: py39-ocrmypdf

distinfo:

TIMESTAMP = 1665938750 SHA256 (ocrmypdf-13.7.0.tar.gz) = 45fa226f6753f6e0be1e6304d3363a6d8047bb4cb0cf0d25728c3b9c9a0bff40 SIZE (ocrmypdf-13.7.0.tar.gz) = 7911229

Packages (timestamps in pop-ups are UTC):

py39-ocrmypdf
ABI	aarch64	amd64	armv6	armv7	i386	powerpc	powerpc64	powerpc64le
FreeBSD:13:latest	-	-	-	-	-	-	-	-
FreeBSD:13:quarterly	-	-	-	-	-	-	15.4.4	-
FreeBSD:14:latest	-	-	-	-	-	-	-	14.2.1
FreeBSD:14:quarterly	-	-	-	-	-	-	15.4.4	-
FreeBSD:15:latest	-	-	n/a	-	n/a	-	-	-

py311-ocrmypdf
ABI	aarch64	amd64	armv6	armv7	i386	powerpc	powerpc64	powerpc64le
FreeBSD:13:latest	16.10.1	16.10.4	-	16.10.1	16.10.1	-	-	-
FreeBSD:13:quarterly	16.10.0	16.10.1	-	16.10.0	16.10.4	-	-	-
FreeBSD:14:latest	16.10.1	16.10.4	-	16.10.1	16.10.4	-	-	-
FreeBSD:14:quarterly	16.10.1	16.10.4	-	16.10.1	16.10.4	-	-	-
FreeBSD:15:latest	16.10.1	16.10.1	n/a	16.10.1	n/a	-	-	-

Dependencies

NOTE: FreshPorts displays only information on required and default dependencies. Optional dependencies are not covered.

Build dependencies:

py39-setuptools_scm>=0 : devel/py-setuptools_scm@py39
py39-setuptools_scm_git_archive>=0 : devel/py-setuptools_scm_git_archive@py39
py39-setuptools>=63.1.0 : devel/py-setuptools@py39
python3.9 : lang/python39

Runtime dependencies:

py39-coloredlogs>=14.0 : devel/py-coloredlogs@py39
py39-img2pdf>=0.3.0 : graphics/py-img2pdf@py39
py39-packaging>=20 : devel/py-packaging@py39
py39-pillow>=8.2.0 : graphics/py-pillow@py39
py39-tqdm>=4 : misc/py-tqdm@py39
py39-reportlab>=3.5.66 : print/py-reportlab@py39
py39-pdfminer.six>=20191110 : textproc/py-pdfminer.six@py39
py39-pikepdf>=4.0.0 : textproc/py-pikepdf@py39
py39-pluggy>=0.13.0 : devel/py-pluggy@py39
tesseract : graphics/tesseract
pngquant : graphics/pngquant
unpaper : graphics/unpaper
ghostscript9-agpl-base>=9.16_2 : print/ghostscript9-agpl-base
py39-setuptools>=63.1.0 : devel/py-setuptools@py39
python3.9 : lang/python39
py39-typing-extensions>=4 : devel/py-typing-extensions@py39

There are no ports dependent upon this port

Configuration Options:

===> The following configuration options are available for py39-ocrmypdf-13.7.0: PNGQUANT=on: Optimizes the encoding of PNG-style images in PDFs UNPAPER=on: Deskew and clean up pages before OCR processing ===> Use 'make config' to modify these settings

Options name:

textproc_py-ocrmypdf

USES:

ghostscript:run python:3.7+ shebangfix

FreshPorts was unable to extract/find any pkg message

Master Sites:

Expand this list (2 items)

Collapse this list.

Number of commits found: 1

Commit History - (may be incomplete: for full details, see links to repositories near top of page)

Commit

Credits

Log message

13.7.0
22 Oct 2022 17:35:14

Kai Knoblich (kai)

textproc/py-ocrmypdf: Update to 13.7.0

There is already a new major release (= 14.x) out, but the build system
has been changed here, which therefore takes a little more time to
implement.  So use the last release of the 13.x series for a while.

Changelog since 13.6.0:

https://github.com/ocrmypdf/OCRmyPDF/blob/v13.7.0/docs/release_notes.rst

MFH:		2022Q4
(cherry picked from commit 96b0bf14075f83df424fb862141aff00837a79a8)

Number of commits found: 1

Login
User Login Create account

Servers and bandwidth provided by
New York Internet, iXsystems, and RootBSD

This site
What is FreshPorts? About the authors Issues FAQ How big is it? Security Policy Privacy Blog Contact

Search
Enter Keywords: more...

Latest Vulnerabilities

ruby32	Jul 25
ruby33	Jul 25
ruby34	Jul 25
ruby35	Jul 25
rubygem-resolv	Jul 25
firefox	Jul 24
firefox	Jul 24
firefox	Jul 24
firefox	Jul 24
firefox	Jul 24
firefox	Jul 24
firefox	Jul 24
firefox	Jul 24
firefox	Jul 24
firefox	Jul 24

29 vulnerabilities affecting 280 ports have been reported in the past 14 days

^* - modified, not new

All vulnerabilities

Last processed:
2025-07-25 22:01:02 UTC

Ports
Home Categories Deleted ports Sanity Test Failures Newsfeeds

Statistics

Graphs
NEW Graphs (Javascript)

Calculated hourly:

Port count	33526
Broken	128
Deprecated	238
Ignore	245
Forbidden	0
Restricted	2
No CDROM	1
Vulnerable	40
Expired	20
Set to expire	186
Interactive	0
new 24 hours	3
new 48 hours	9
new 7 days	51
new fortnight	90
new month	225

Bot filter coming soon