notbugAs an Amazon Associate I earn from qualifying purchases.
Want a good read? Try FreeBSD Mastery: Jails (IT Mastery Book 15)
Want a good monitor light? See my photosAll times are UTC
Ukraine

Bot filter coming soon

To deter bots pegging the database CPU to 100%, a bot testing filter to be added to the website. This should not affect newsfeeds etc. Anubis seems light-weight - it is already in use within the FreeBSD Project. This notice is just a heads up in case you see something odd. This notice will be updated after Anubis is installed.

Port details
py-tokenizer Tokenizer for Icelandic text
3.5.0_1 textprocnew! on this many watch lists=0 search for ports that depend on this port Find issues related to this port Report an issue related to this port View this port on Repology. pkg-fallout Package not present on quarterly.This port was created during this quarter. It will be in the next quarterly branch but not the current one.
Maintainer: otis@FreeBSD.org search for ports maintained by this maintainer
Port Added: 2025-08-27 11:15:08
Last Update: 2025-08-27 14:40:21
Commit Hash: 2d8be35
Also Listed In: python
License: MIT
WWW:
https://github.com/mideind/Tokenizer
Description:
Tokenizer: A tokenizer for Icelandic text Tokenization is a necessary first step in many natural language processing tasks, such as word counting, parsing, spell checking, corpus generation, and statistical analysis of text.
Homepage    cgit ¦ Codeberg ¦ GitHub ¦ GitLab ¦ SVNWeb - no subversion history for this port

Manual pages:
FreshPorts has no man page information for this port.
pkg-plist: as obtained via: make generate-plist
There is no configure plist information for this port.
Dependency lines:
  • ${PYTHON_PKGNAMEPREFIX}tokenizer>0:textproc/py-tokenizer@${PY_FLAVOR}
To install the port:
cd /usr/ports/textproc/py-tokenizer/ && make install clean
To add the package, run one of these commands:
  • pkg install textproc/py-tokenizer
  • pkg install py311-tokenizer
NOTE: If this package has multiple flavors (see below), then use one of them instead of the name specified above.
NOTE: This is a Python port. Instead of py311-tokenizer listed in the above command, you can pick from the names under the Packages section.
PKGNAME: py311-tokenizer
Package flavors (<flavor>: <package>)
  • py311: py311-tokenizer
distinfo:
TIMESTAMP = 1756293103 SHA256 (tokenizer-3.5.0.tar.gz) = f26694d6be85815d23db167ee9b1c9e2ed7b672cfc8b9baa607ba3aba6070727 SIZE (tokenizer-3.5.0.tar.gz) = 127323

No package information for this port in our database
Sometimes this happens. Not all ports have packages. This is doubly so for new ports, like this one.
Dependencies
NOTE: FreshPorts displays only information on required and default dependencies. Optional dependencies are not covered.
Build dependencies:
  1. py311-setuptools>0 : devel/py-setuptools@py311
  2. py311-wheel>0 : devel/py-wheel@py311
  3. python3.11 : lang/python311
  4. py311-build>=0 : devel/py-build@py311
  5. py311-installer>=0 : devel/py-installer@py311
Test dependencies:
  1. python3.11 : lang/python311
Runtime dependencies:
  1. python3.11 : lang/python311
There are no ports dependent upon this port

Configuration Options:
No options to configure
Options name:
textproc_py-tokenizer
USES:
python
FreshPorts was unable to extract/find any pkg message
Master Sites:
Expand this list (2 items)
Collapse this list.
  1. https://files.pythonhosted.org/packages/source/t/tokenizer/
  2. https://pypi.org/packages/source/t/tokenizer/
Collapse this list.

Number of commits found: 2

Commit History - (may be incomplete: for full details, see links to repositories near top of page)
CommitCreditsLog message
3.5.0_1
27 Aug 2025 14:40:21
commit hash: 2d8be352aaceb6f5195f45d2d94f92026573300ecommit hash: 2d8be352aaceb6f5195f45d2d94f92026573300ecommit hash: 2d8be352aaceb6f5195f45d2d94f92026573300ecommit hash: 2d8be352aaceb6f5195f45d2d94f92026573300e files touched by this commit
Juraj Lutter (otis) search for other commits by this committer
textproc/py-tokenizer: Correct pkg-descr

A text for different package has slipped through and made it
to pkg-descr of this package.

Put the correct description instead.
3.5.0
27 Aug 2025 11:13:40
commit hash: 6259f7882a8dc19deaf79bbbb7d28c2fce54aa34commit hash: 6259f7882a8dc19deaf79bbbb7d28c2fce54aa34commit hash: 6259f7882a8dc19deaf79bbbb7d28c2fce54aa34commit hash: 6259f7882a8dc19deaf79bbbb7d28c2fce54aa34 files touched by this commit
Juraj Lutter (otis) search for other commits by this committer
textproc/py-tokenizer: Add new port

Add textproc/py-tokenizer, a tokenizer for Icelandic text.

Number of commits found: 2