notbugAs an Amazon Associate I earn from qualifying purchases.
Want a good read? Try FreeBSD Mastery: Jails (IT Mastery Book 15)
Want a good monitor light? See my photosAll times are UTC
Ukraine

Bot filter coming soon

To deter bots pegging the database CPU to 100%, a bot testing filter to be added to the website. This should not affect newsfeeds etc. Anubis seems light-weight - it is already in use within the FreeBSD Project. This notice is just a heads up in case you see something odd. This notice will be updated after Anubis is installed.

Port details on branch 2022Q4
py-ocrmypdf Adds an OCR text layer to scanned PDF files
13.7.0 textproc on this many watch lists=0 search for ports that depend on this port Find issues related to this port Report an issue related to this port View this port on Repology. pkg-fallout 16.10.4Version of this port present on the latest quarterly branch.
Maintainer: kai@FreeBSD.org search for ports maintained by this maintainer
Port Added: 2022-10-22 17:39:20
Last Update: 2022-10-22 17:35:14
Commit Hash: 2260bf2
Also Listed In: python
License: MPL20
WWW:
https://github.com/ocrmypdf/OCRmyPDF
Description:
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched or copy-pasted. Main features: * Generates a searchable PDF/A file from a regular PDF * Places OCR text accurately below the image to ease copy / paste * Keeps the exact resolution of the original embedded images * When possible, inserts OCR information as a "lossless" operation without disrupting any other content * Optimizes PDF images, often producing files smaller than the input file * If requested deskews and/or cleans the image before performing OCR * Validates input and output files * Distributes work across all available CPU cores * Uses Tesseract OCR engine to recognize more than 100 languages * Scales properly to handle files with thousands of pages * Battle-tested on millions of PDFs
Homepage    cgit ¦ Codeberg ¦ GitHub ¦ GitLab ¦ SVNWeb - no subversion history for this port

Manual pages:
FreshPorts has no man page information for this port.
pkg-plist: as obtained via: make generate-plist
There is no configure plist information for this port.
Dependency lines:
  • ${PYTHON_PKGNAMEPREFIX}ocrmypdf>0:textproc/py-ocrmypdf@${PY_FLAVOR}
To install the port:
cd /usr/ports/textproc/py-ocrmypdf/ && make install clean
To add the package, run one of these commands:
  • pkg install textproc/py-ocrmypdf
  • pkg install py39-ocrmypdf
NOTE: If this package has multiple flavors (see below), then use one of them instead of the name specified above.
NOTE: This is a Python port. Instead of py39-ocrmypdf listed in the above command, you can pick from the names under the Packages section.
PKGNAME: py39-ocrmypdf
Package flavors (<flavor>: <package>)
  • py39: py39-ocrmypdf
distinfo:
TIMESTAMP = 1665938750 SHA256 (ocrmypdf-13.7.0.tar.gz) = 45fa226f6753f6e0be1e6304d3363a6d8047bb4cb0cf0d25728c3b9c9a0bff40 SIZE (ocrmypdf-13.7.0.tar.gz) = 7911229

Packages (timestamps in pop-ups are UTC):
py39-ocrmypdf
ABIaarch64amd64armv6armv7i386powerpcpowerpc64powerpc64le
FreeBSD:13:latest--------
FreeBSD:13:quarterly------15.4.4-
FreeBSD:14:latest-------14.2.1
FreeBSD:14:quarterly------15.4.4-
FreeBSD:15:latest--n/a-n/a---
 

py311-ocrmypdf
ABIaarch64amd64armv6armv7i386powerpcpowerpc64powerpc64le
FreeBSD:13:latest16.10.116.10.4-16.10.116.10.1---
FreeBSD:13:quarterly16.10.016.10.1-16.10.016.10.4---
FreeBSD:14:latest16.10.116.10.4-16.10.116.10.4---
FreeBSD:14:quarterly16.10.116.10.4-16.10.116.10.4---
FreeBSD:15:latest16.10.116.10.1n/a16.10.1n/a---
 

Dependencies
NOTE: FreshPorts displays only information on required and default dependencies. Optional dependencies are not covered.
Build dependencies:
  1. py39-setuptools_scm>=0 : devel/py-setuptools_scm@py39
  2. py39-setuptools_scm_git_archive>=0 : devel/py-setuptools_scm_git_archive@py39
  3. py39-setuptools>=63.1.0 : devel/py-setuptools@py39
  4. python3.9 : lang/python39
Runtime dependencies:
  1. py39-coloredlogs>=14.0 : devel/py-coloredlogs@py39
  2. py39-img2pdf>=0.3.0 : graphics/py-img2pdf@py39
  3. py39-packaging>=20 : devel/py-packaging@py39
  4. py39-pillow>=8.2.0 : graphics/py-pillow@py39
  5. py39-tqdm>=4 : misc/py-tqdm@py39
  6. py39-reportlab>=3.5.66 : print/py-reportlab@py39
  7. py39-pdfminer.six>=20191110 : textproc/py-pdfminer.six@py39
  8. py39-pikepdf>=4.0.0 : textproc/py-pikepdf@py39
  9. py39-pluggy>=0.13.0 : devel/py-pluggy@py39
  10. tesseract : graphics/tesseract
  11. pngquant : graphics/pngquant
  12. unpaper : graphics/unpaper
  13. ghostscript9-agpl-base>=9.16_2 : print/ghostscript9-agpl-base
  14. py39-setuptools>=63.1.0 : devel/py-setuptools@py39
  15. python3.9 : lang/python39
  16. py39-typing-extensions>=4 : devel/py-typing-extensions@py39
There are no ports dependent upon this port

Configuration Options:
===> The following configuration options are available for py39-ocrmypdf-13.7.0: PNGQUANT=on: Optimizes the encoding of PNG-style images in PDFs UNPAPER=on: Deskew and clean up pages before OCR processing ===> Use 'make config' to modify these settings
Options name:
textproc_py-ocrmypdf
USES:
ghostscript:run python:3.7+ shebangfix
FreshPorts was unable to extract/find any pkg message
Master Sites:
Expand this list (2 items)
Collapse this list.
  1. https://files.pythonhosted.org/packages/source/o/ocrmypdf/
  2. https://pypi.org/packages/source/o/ocrmypdf/
Collapse this list.

Number of commits found: 1

Commit History - (may be incomplete: for full details, see links to repositories near top of page)
CommitCreditsLog message
13.7.0
22 Oct 2022 17:35:14
commit hash: 2260bf29fb3265f19eaf235b3ee8cf732c51b3ebcommit hash: 2260bf29fb3265f19eaf235b3ee8cf732c51b3ebcommit hash: 2260bf29fb3265f19eaf235b3ee8cf732c51b3ebcommit hash: 2260bf29fb3265f19eaf235b3ee8cf732c51b3eb files touched by this commit
Kai Knoblich (kai) search for other commits by this committer
textproc/py-ocrmypdf: Update to 13.7.0

There is already a new major release (= 14.x) out, but the build system
has been changed here, which therefore takes a little more time to
implement.  So use the last release of the 13.x series for a while.

Changelog since 13.6.0:

https://github.com/ocrmypdf/OCRmyPDF/blob/v13.7.0/docs/release_notes.rst

MFH:		2022Q4
(cherry picked from commit 96b0bf14075f83df424fb862141aff00837a79a8)

Number of commits found: 1