Port details |
- ucto Advanced rule-based (regular-expression) and unicode-aware tokenizer
- 0.32.1 textproc =2 0.32.1Version of this port present on the latest quarterly branch.
- Maintainer: yuri@FreeBSD.org
- Port Added: 2018-12-08 10:15:13
- Last Update: 2024-04-03 08:16:01
- Commit Hash: 28c2d29
- People watching this port, also watch:: jdictionary, py311-Automat, py311-python-gdsii, py39-PyOpenGL, p5-Sane
- License: APACHE20
- WWW:
- https://languagemachines.github.io/ucto/
- Description:
- Ucto tokenizes text files: it separates words from punctuation, and splits
sentences. It offers several other basic preprocessing steps such as changing
case that you can all use to make your text suited for further processing such
as indexing, part-of-speech tagging, or machine translation.
Ucto comes with tokenisation rules for several languages and can be easily
extended to suit other languages. It has been incorporated for tokenizing Dutch
text in Frog, our Dutch morpho-syntactic processor.
- ¦ ¦ ¦ ¦
- Manual pages:
- FreshPorts has no man page information for this port.
- pkg-plist: as obtained via:
make generate-plist - Dependency lines:
-
- To install the port:
- cd /usr/ports/textproc/ucto/ && make install clean
- To add the package, run one of these commands:
- pkg install textproc/ucto
- pkg install ucto
NOTE: If this package has multiple flavors (see below), then use one of them instead of the name specified above.- PKGNAME: ucto
- Flavors: there is no flavor information for this port.
- distinfo:
- TIMESTAMP = 1712131816
SHA256 (ucto-0.32.1.tar.gz) = 04323faddabd13d1fbdf188b77e180f6db3fca3395118943e0af13964fa8b6e5
SIZE (ucto-0.32.1.tar.gz) = 503512
Packages (timestamps in pop-ups are UTC):
- Dependencies
- NOTE: FreshPorts displays only information on required and default dependencies. Optional dependencies are not covered.
- Build dependencies:
-
- uctodata>0 : textproc/uctodata
- gmake>=4.4.1 : devel/gmake
- pkgconf>=1.3.0_1 : devel/pkgconf
- Runtime dependencies:
-
- uctodata>0 : textproc/uctodata
- Library dependencies:
-
- libexttextcat-2.0.so : textproc/libexttextcat
- libfolia.so : textproc/libfolia
- libicuio.so : devel/icu
- libticcutils.so : devel/ticcutils
- libxml2.so : textproc/libxml2
- libedit.so.0 : devel/libedit
- libreadline.so.8 : devel/readline
- There are no ports dependent upon this port
Configuration Options:
- No options to configure
- Options name:
- textproc_ucto
- USES:
- compiler:c++11-lang gmake gnome libedit libtool pathfix pkgconfig readline
- FreshPorts was unable to extract/find any pkg message
- Master Sites:
|
Commit History - (may be incomplete: for full details, see links to repositories near top of page) |
Commit | Credits | Log message |
0.32.1 03 Apr 2024 08:16:01 |
Yuri Victorovich (yuri) |
textproc/ucto: update 0.29 → 0.32.1 |
0.29_2 22 Mar 2024 22:05:28 |
Nuno Teixeira (eduardo) |
textproc/ucto: Move man pages to share/man
Approved by: portmgr (blanket) |
0.29_1 17 Nov 2023 11:46:00 |
Jan Beich (jbeich) |
devel/icu: update to 74.1
Changes: https://github.com/unicode-org/icu/releases/tag/release-74-1
Reported by: GitHub (watch releases)
PR: 274317
Exp-run by: antoine (incomplete)
Approved by: fluffy |
0.29 25 Jun 2023 04:01:33 |
Yuri Victorovich (yuri) Author: Daniel Engberg |
textproc/ucto: Use upstream release archive and minor fixes
PR: 272177 |
0.29 29 May 2023 18:33:22 |
Yuri Victorovich (yuri) |
textproc/ucto: Update 0.27 → 0.29 |
0.27_1 13 Apr 2023 00:56:37 |
Jan Beich (jbeich) |
devel/icu: update to 73.1
- Temporarily switch to GitHub auto archive (release artifacts are N/A atm)
Changes: https://github.com/unicode-org/icu/releases/tag/release-73-1
Reported by: GitHub (watch releases)
PR: 270422
Exp-run by: antoine |
0.27 24 Jan 2023 20:31:53 |
Yuri Victorovich (yuri) |
textproc/ucto: Update 0.25 → 0.27
Reported by: portscout |
0.25_2 19 Oct 2022 08:10:29 |
Jan Beich (jbeich) |
devel/icu: update to 72.1
Changes: https://github.com/unicode-org/icu/releases/tag/release-72-1
Reported by: GitHub (watch releases)
PR: 266582
Exp-run by: antoine |
07 Sep 2022 21:58:51 |
Stefan Eßer (se) |
Remove WWW entries moved into port Makefiles
Commit b7f05445c00f has added WWW entries to port Makefiles based on
WWW: lines in pkg-descr files.
This commit removes the WWW: lines of moved-over URLs from these
pkg-descr files.
Approved by: portmgr (tcberner) |
0.25_1 07 Sep 2022 21:10:59 |
Stefan Eßer (se) |
Add WWW entries to port Makefiles
It has been common practice to have one or more URLs at the end of the
ports' pkg-descr files, one per line and prefixed with "WWW:". These
URLs should point at a project website or other relevant resources.
Access to these URLs required processing of the pkg-descr files, and
they have often become stale over time. If more than one such URL was
present in a pkg-descr file, only the first one was tarnsfered into
the port INDEX, but for many ports only the last line did contain the
port specific URL to further information.
There have been several proposals to make a project URL available as
a macro in the ports' Makefiles, over time.
(Only the first 15 lines of the commit message are shown above ) |
0.25_1 12 Aug 2022 14:46:53 |
Dima Panov (fluffy) |
*/*: bump all consumers after recent boost upgrade |
0.25 23 Jul 2022 00:21:27 |
Yuri Victorovich (yuri) |
textproc/ucto: Update 0.24.1 -> 0.25
Reported by: portscout |
0.24.1_3 10 Apr 2022 19:11:41 |
Charlie Li (vishwin) |
textproc/libxml2: bump all LIB_DEPENDS consumers
This is a separate commit to facilitate easier cherry-picking for
quarterly.
PR: 262853, 262940, 262877, 263126
Approved by: fluffy (mentor) |
0.24.1_2 07 Apr 2022 00:39:10 |
Jan Beich (jbeich) |
devel/icu: update to 71.1
Changes: https://github.com/unicode-org/icu/releases/tag/release-71-1
Reported by: GitHub (watch releases)
PR: 262654
Exp-run by: antoine
Approved by: fluffy |
0.24.1_1 26 Mar 2022 08:27:27 |
Matthias Fechner (mfechner) |
textproc/libxml2: bump all dependencies
This should make sure that all dependent ports will pick
up the new version commited with a13ec21cd733f67a9fc0dc00ab45268bdc236246 |
0.24.1 31 Dec 2021 17:25:34 |
Yuri Victorovich (yuri) |
textproc/ucto: Update 0.23 -> 0.24.1
Reported by: portscout |
0.23_1 28 Oct 2021 16:37:30 |
Jan Beich (jbeich) |
devel/icu: update to 70.1
Changes: https://github.com/unicode-org/icu/releases/tag/release-70-1
Reported by: GitHub (watch releases)
PR: 258794
Exp-run by: antoine |
0.23 14 Jul 2021 03:55:50 |
Yuri Victorovich (yuri) |
textproc/ucto: Update 0.22 -> 0.23
Reported by: portscout |
0.22 21 Apr 2021 21:33:53 |
Yuri Victorovich (yuri) |
textproc/ucto: Update 0.14.1 -> 0.22 |
0.14.1_12 21 Apr 2021 21:33:51 |
Yuri Victorovich (yuri) |
devel/ticcutils: Update 0.20 -> 0.26
Reported by: portscout |
0.14.1_11 08 Apr 2021 23:33:31 |
Jan Beich (jbeich) |
devel/icu: update to 69.1
Changes: https://github.com/unicode-org/icu/releases/tag/release-69-1
Reported by: GitHub (watch releases) |
0.14.1_10 07 Apr 2021 08:09:01 |
Mathieu Arnold (mat) |
One more small cleanup, forgotten yesterday.
Reported by: lwhsu |
0.14.1_10 06 Apr 2021 14:31:07 |
Mathieu Arnold (mat) |
Remove # $FreeBSD$ from Makefiles. |
0.14.1_10 03 Nov 2020 00:56:27 |
jbeich |
devel/icu: update to 68.1
Changes: http://site.icu-project.org/download/68
ABI: https://abi-laboratory.pro/tracker/timeline/icu4c/
Reported by: GitHub (watch releases) |
0.14.1_9 23 Apr 2020 20:14:49 |
jbeich |
devel/icu: update to 67.1
Changes: http://site.icu-project.org/download/67
ABI: https://abi-laboratory.pro/tracker/timeline/icu4c/
Reported by: GitHub (watch releases) |
0.14.1_8 13 Mar 2020 09:47:17 |
jbeich |
devel/icu: update to 66.1
Changes: http://site.icu-project.org/download/66
ABI: https://abi-laboratory.pro/tracker/timeline/icu4c/ |
0.14.1_7 08 Mar 2020 21:56:37 |
jbeich |
Drop dependency on devel/openmp
- Drop if devel/llvm* was used as a substitute
Approved by: yuri, rene (earlier version)
Differential Revision: https://reviews.freebsd.org/D23020 |
0.14.1_6 03 Oct 2019 20:35:30 |
jbeich |
devel/icu: update to 65.1
Changes: http://site.icu-project.org/download/65
ABI: https://abi-laboratory.pro/tracker/timeline/icu4c/ |
0.14.1_5 21 Sep 2019 18:59:57 |
jbeich |
devel/openmp: update to 9.0.0
- Connect tests to the framework to help QA remaining patches
- Force rebuild all consumers after https://reviews.llvm.org/D59783
Changes: http://llvm.org/viewvc/llvm-project/openmp/tags/RELEASE_900/final/?view=log
ABI: https://abi-laboratory.pro/tracker/timeline/llvm_openmp/ |
0.14.1_4 26 Jul 2019 20:46:57 |
gerald |
Bump PORTREVISION for ports depending on the canonical version of GCC
as defined in Mk/bsd.default-versions.mk which has moved from GCC 8.3
to GCC 9.1 under most circumstances now after revision 507371.
This includes ports
- with USE_GCC=yes or USE_GCC=any,
- with USES=fortran,
- using Mk/bsd.octave.mk which in turn features USES=fortran, and
- with USES=compiler specifying openmp, nestedfct, c11, c++0x, c++11-lang,
c++11-lib, c++14-lang, c++17-lang, or gcc-c++11-lib
plus, everything INDEX-11 shows with a dependency on lang/gcc9 now.
PR: 238330 |
0.14.1_3 16 Jun 2019 16:31:08 |
linimon |
Now that devel/openmp has been enabled on powerpc64, fix these newly
unblocked ports on GCC-based systems:
cc1plus: error: unrecognized command line option "-std=c++11"
Approved by: portmgr (tier-2 blanket) |
0.14.1_3 08 May 2019 16:05:53 |
jbeich |
devel/openmp: phase out
- Expire after the last version without /usr/lib/libomp.so
- Drop SOVERSION for seamless transition (i.e., avoid conditionals)
PR: 236907
Approved by: bapt (maintainer)
Differential Revision: https://reviews.freebsd.org/D19767 |
0.14.1_2 09 Apr 2019 14:04:50 |
sunpoet |
Update devel/readline to 8.0
- Bump PORTREVISION of dependent ports for shlib change
Changes: https://tiswww.case.edu/php/chet/readline/CHANGES
PR: 236156
Exp-run by: antoine |
0.14.1_1 27 Mar 2019 21:11:09 |
jbeich |
devel/icu: update to 64.1
Changes: http://site.icu-project.org/download/64
ABI: https://abi-laboratory.pro/tracker/timeline/icu4c/
PR: 236325
Exp-run by: antoine
Differential Revision: https://reviews.freebsd.org/D19479 |
0.14.1 10 Dec 2018 09:03:02 |
yuri |
textproc/ucto: Update 0.14 -> 0.14.1
Reported by: GH issue notification |
0.14 08 Dec 2018 04:26:45 |
yuri |
New ports: textproc/ucto, textproc/uctodata: Advanced rule-based
(regular-expression) and unicode-aware tokenizer and its data port |