notbugAs an Amazon Associate I earn from qualifying purchases.
Want a good read? Try FreeBSD Mastery: Jails (IT Mastery Book 15)
Want a good monitor light? See my photosAll times are UTC
Ukraine

Bot filter coming soon

To deter bots pegging the database CPU to 100%, a bot testing filter to be added to the website. This should not affect newsfeeds etc. Anubis seems light-weight - it is already in use within the FreeBSD Project. This notice is just a heads up in case you see something odd. This notice will be updated after Anubis is installed.

Port details on branch 2022Q2
ucto Advanced rule-based (regular-expression) and unicode-aware tokenizer
0.24.1_2 textproc on this many watch lists=0 search for ports that depend on this port Find issues related to this port Report an issue related to this port View this port on Repology. pkg-fallout 0.24.1_2Version of this port present on the latest quarterly branch.
Maintainer: yuri@FreeBSD.org search for ports maintained by this maintainer
Port Added: 2022-04-24 04:16:20
Last Update: 2022-04-10 19:47:23
Commit Hash: 035e778
License: APACHE20
WWW:
https://languagemachines.github.io/ucto/
Description:
Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing steps such as changing case that you can all use to make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation. Ucto comes with tokenisation rules for several languages and can be easily extended to suit other languages. It has been incorporated for tokenizing Dutch text in Frog, our Dutch morpho-syntactic processor. WWW: https://languagemachines.github.io/ucto/
Homepage    cgit ¦ Codeberg ¦ GitHub ¦ GitLab ¦ SVNWeb - no subversion history for this port

Manual pages:
FreshPorts has no man page information for this port.
pkg-plist: as obtained via: make generate-plist
Expand this list (19 items)
Collapse this list.
  1. @ldconfig
  2. /usr/local/share/licenses/ucto-0.24.1_2/catalog.mk
  3. /usr/local/share/licenses/ucto-0.24.1_2/LICENSE
  4. /usr/local/share/licenses/ucto-0.24.1_2/APACHE20
  5. bin/ucto
  6. include/ucto/my_textcat.h
  7. include/ucto/setting.h
  8. include/ucto/tokenize.h
  9. lib/libucto.so
  10. lib/libucto.so.6
  11. lib/libucto.so.6.0.0
  12. libdata/pkgconfig/ucto.pc
  13. man/man1/ucto.1.gz
  14. share/ucto/textcat.cfg
  15. share/ucto/textcat_alt.cfg
  16. share/ucto/textcat_normal.cfg
  17. @owner
  18. @group
  19. @mode
Collapse this list.
Dependency lines:
  • ucto>0:textproc/ucto
To install the port:
cd /usr/ports/textproc/ucto/ && make install clean
To add the package, run one of these commands:
  • pkg install textproc/ucto
  • pkg install ucto
NOTE: If this package has multiple flavors (see below), then use one of them instead of the name specified above.
PKGNAME: ucto
Flavors: there is no flavor information for this port.
distinfo:
TIMESTAMP = 1640970217 SHA256 (LanguageMachines-ucto-v0.24.1_GH0.tar.gz) = f386c3a1f000255153c52044e64257789b301428f525711aeaccfc020ff38827 SIZE (LanguageMachines-ucto-v0.24.1_GH0.tar.gz) = 399511

No package information for this port in our database
Sometimes this happens. Not all ports have packages. Perhaps there is a build error. Check the fallout link: pkg-fallout
Dependencies
NOTE: FreshPorts displays only information on required and default dependencies. Optional dependencies are not covered.
Build dependencies:
  1. autoconf-archive>0 : devel/autoconf-archive
  2. uctodata>0 : textproc/uctodata
  3. gmake>=4.3 : devel/gmake
  4. pkgconf>=1.3.0_1 : devel/pkgconf
  5. autoconf>=2.69 : devel/autoconf
  6. automake>=1.16.1 : devel/automake
  7. libtoolize : devel/libtool
Runtime dependencies:
  1. uctodata>0 : textproc/uctodata
Library dependencies:
  1. libexttextcat-2.0.so : textproc/libexttextcat
  2. libfolia.so : textproc/libfolia
  3. libicuio.so : devel/icu
  4. libticcutils.so : devel/ticcutils
  5. libxml2.so : textproc/libxml2
  6. libedit.so.0 : devel/libedit
  7. libreadline.so.8 : devel/readline
There are no ports dependent upon this port

Configuration Options:
No options to configure
Options name:
textproc_ucto
USES:
autoreconf compiler:c++11-lang gmake gnome libedit libtool pkgconfig readline
FreshPorts was unable to extract/find any pkg message
Master Sites:
Expand this list (1 items)
Collapse this list.
  1. https://codeload.github.com/LanguageMachines/ucto/tar.gz/v0.24.1?dummy=/
Collapse this list.

Number of commits found: 1

Commit History - (may be incomplete: for full details, see links to repositories near top of page)
CommitCreditsLog message
0.24.1_2
10 Apr 2022 19:47:23
commit hash: 035e778ee53345464c2d7466a3895e17073ad1f6commit hash: 035e778ee53345464c2d7466a3895e17073ad1f6commit hash: 035e778ee53345464c2d7466a3895e17073ad1f6commit hash: 035e778ee53345464c2d7466a3895e17073ad1f6 files touched by this commit
Charlie Li (vishwin) search for other commits by this committer
textproc/libxml2: bump all LIB_DEPENDS consumers

This is a separate and direct commit to quarterly as PORTREVISIONs
may not match from main.

PR: 262853, 262940, 262877
Approved by: fluffy (mentor)

Number of commits found: 1