notbugAs an Amazon Associate I earn from qualifying purchases.
Want a good read? Try FreeBSD Mastery: Jails (IT Mastery Book 15)
Want a good monitor light? See my photosAll times are UTC
Ukraine
Port details
py-tokenizers Fast state-of-the-art tokenizers optimized for research and production
0.21.0 textproc on this many watch lists=2 search for ports that depend on this port Find issues related to this port Report an issue related to this port View this port on Repology. pkg-fallout 0.20.0_2Version of this port present on the latest quarterly branch.
Maintainer: tagattie@FreeBSD.org search for ports maintained by this maintainer
Port Added: 2024-02-12 08:36:07
Last Update: 2024-12-06 02:24:32
Commit Hash: 2b6d4bc
People watching this port, also watch:: jdictionary, py311-Automat, py311-python-gdsii, py39-PyOpenGL, p5-Sane
Also Listed In: python
License: APACHE20
WWW:
https://github.com/huggingface/tokenizers
Description:
Provides an implementation of today's most used tokenizers, with a focus on performance and versatility. Main features: - Train new vocabularies and tokenize, using today's most used tokenizers. - Extremely fast (both training and tokenization), thanks to the Rust implementation. Takes less than 20 seconds to tokenize a GB of text on a server's CPU. - Easy to use, but also extremely versatile. - Designed for research and production. - Normalization comes with alignments tracking. It's always possible to get the part of the original sentence that corresponds to a given token. - Does all the pre-processing: Truncate, Pad, add the special tokens your model needs.
Homepage    cgit ¦ Codeberg ¦ GitHub ¦ GitLab ¦ SVNWeb - no subversion history for this port

Manual pages:
FreshPorts has no man page information for this port.
pkg-plist: as obtained via: make generate-plist
There is no configure plist information for this port.
Dependency lines:
  • ${PYTHON_PKGNAMEPREFIX}tokenizers>0:textproc/py-tokenizers@${PY_FLAVOR}
To install the port:
cd /usr/ports/textproc/py-tokenizers/ && make install clean
To add the package, run one of these commands:
  • pkg install textproc/py-tokenizers
  • pkg install py311-tokenizers
NOTE: If this package has multiple flavors (see below), then use one of them instead of the name specified above.
NOTE: This is a Python port. Instead of py311-tokenizers listed in the above command, you can pick from the names under the Packages section.
PKGNAME: py311-tokenizers
Package flavors (<flavor>: <package>)
  • py311: py311-tokenizers
distinfo:
TIMESTAMP = 1733450061 SHA256 (tokenizers-0.21.0.tar.gz) = ee0894bf311b75b0c03079f33859ae4b2334d675d4e93f5a4132e1eae2834fe4 SIZE (tokenizers-0.21.0.tar.gz) = 343021

Expand this list (264 items)

Collapse this list.

SHA256 (rust/crates/aho-corasick-1.1.3.crate) = 8e60d3430d3a69478ad0993f19238d2df97c507009a52b3c10addcd7f6bcb916 SIZE (rust/crates/aho-corasick-1.1.3.crate) = 183311 SHA256 (rust/crates/anstream-0.6.18.crate) = 8acc5369981196006228e28809f761875c0327210a891e941f4c683b3a99529b SIZE (rust/crates/anstream-0.6.18.crate) = 29681 SHA256 (rust/crates/anstyle-1.0.10.crate) = 55cc3b69f167a1ef2e161439aa98aed94e6028e5f9a59be9a6ffb47aef1651f9 SIZE (rust/crates/anstyle-1.0.10.crate) = 15725 SHA256 (rust/crates/anstyle-parse-0.2.6.crate) = 3b2d16507662817a6a20a9ea92df6652ee4f94f914589377d69f3b21bc5798a9 SIZE (rust/crates/anstyle-parse-0.2.6.crate) = 22343 SHA256 (rust/crates/anstyle-query-1.1.2.crate) = 79947af37f4177cfead1110013d678905c37501914fba0efea834c3fe9a8d60c SIZE (rust/crates/anstyle-query-1.1.2.crate) = 9969 SHA256 (rust/crates/anstyle-wincon-3.0.6.crate) = 2109dbce0e72be3ec00bed26e6a7479ca384ad226efdd66db8fa2e3a38c83125 SIZE (rust/crates/anstyle-wincon-3.0.6.crate) = 12271 SHA256 (rust/crates/autocfg-1.4.0.crate) = ace50bade8e6234aa140d9a2f552bbee1db4d353f69b8217bc503490fc1a9f26 SIZE (rust/crates/autocfg-1.4.0.crate) = 17712 SHA256 (rust/crates/base64-0.13.1.crate) = 9e1b586273c5702936fe7b7d6896644d8be71e6314cfe09d3167c95f712589e8 SIZE (rust/crates/base64-0.13.1.crate) = 61002 SHA256 (rust/crates/bitflags-1.3.2.crate) = bef38d45163c2f1dde094a7dfd33ccf595c92905c8f8f4fdc18d06fb1037718a SIZE (rust/crates/bitflags-1.3.2.crate) = 23021 SHA256 (rust/crates/bitflags-2.6.0.crate) = b048fb63fd8b5923fc5aa7b340d8e156aec7ec02f0c78fa8a6ddc2613f6f71de SIZE (rust/crates/bitflags-2.6.0.crate) = 45357 SHA256 (rust/crates/bumpalo-3.16.0.crate) = 79296716171880943b8470b5f8d03aa55eb2e645a4874bdbb28adb49162e012c SIZE (rust/crates/bumpalo-3.16.0.crate) = 85677 SHA256 (rust/crates/byteorder-1.5.0.crate) = 1fd0f2584146f6f2ef48085050886acf353beff7305ebd1ae69500e27c67f64b SIZE (rust/crates/byteorder-1.5.0.crate) = 23288 SHA256 (rust/crates/cc-1.2.1.crate) = fd9de9f2205d5ef3fd67e685b0df337994ddd4495e2a28d185500d0e1edfea47 SIZE (rust/crates/cc-1.2.1.crate) = 92731 SHA256 (rust/crates/cfg-if-1.0.0.crate) = baf1de4339761588bc0619e3cbc0120ee582ebb74b53b4efbf79117bd2da40fd SIZE (rust/crates/cfg-if-1.0.0.crate) = 7934 SHA256 (rust/crates/colorchoice-1.0.3.crate) = 5b63caa9aa9397e2d9480a9b13673856c78d8ac123288526c37d7839f2a86990 SIZE (rust/crates/colorchoice-1.0.3.crate) = 7923 SHA256 (rust/crates/console-0.15.8.crate) = 0e1f83fc076bd6dd27517eacdf25fef6c4dfe5f1d7448bafaaf3a26f13b5e4eb SIZE (rust/crates/console-0.15.8.crate) = 36364 SHA256 (rust/crates/crossbeam-deque-0.8.5.crate) = 613f8cc01fe9cf1a3eb3d7f488fd2fa8388403e97039e2f73692932e291a770d SIZE (rust/crates/crossbeam-deque-0.8.5.crate) = 21726 SHA256 (rust/crates/crossbeam-epoch-0.9.18.crate) = 5b82ac4a3c2ca9c3460964f020e1402edd5753411d7737aa39c3714ad1b5420e SIZE (rust/crates/crossbeam-epoch-0.9.18.crate) = 46875 SHA256 (rust/crates/crossbeam-utils-0.8.20.crate) = 22ec99545bb0ed0ea7bb9b8e1e9122ea386ff8a48c0922e43f36d45ab09e0e80 SIZE (rust/crates/crossbeam-utils-0.8.20.crate) = 42487 SHA256 (rust/crates/darling-0.20.10.crate) = 6f63b86c8a8826a49b8c21f08a2d07338eec8d900540f8630dc76284be802989 SIZE (rust/crates/darling-0.20.10.crate) = 32031 SHA256 (rust/crates/darling_core-0.20.10.crate) = 95133861a8032aaea082871032f5815eb9e98cef03fa916ab4500513994df9e5 SIZE (rust/crates/darling_core-0.20.10.crate) = 65015 SHA256 (rust/crates/darling_macro-0.20.10.crate) = d336a2a514f6ccccaa3e09b02d41d35330c07ddf03a62165fcec10bb561c7806 SIZE (rust/crates/darling_macro-0.20.10.crate) = 1874 SHA256 (rust/crates/derive_builder-0.20.2.crate) = 507dfb09ea8b7fa618fcf76e953f4f5e192547945816d5358edffe39f6f94947 SIZE (rust/crates/derive_builder-0.20.2.crate) = 36661 SHA256 (rust/crates/derive_builder_core-0.20.2.crate) = 2d5bcf7b024d6835cfb3d473887cd966994907effbe9227e8c8219824d06c4e8 SIZE (rust/crates/derive_builder_core-0.20.2.crate) = 31397 SHA256 (rust/crates/derive_builder_macro-0.20.2.crate) = ab63b0e2bf4d5928aff72e83a7dace85d7bba5fe12dcc3c5a572d78caffd3f3c SIZE (rust/crates/derive_builder_macro-0.20.2.crate) = 6360 SHA256 (rust/crates/either-1.13.0.crate) = 60b1af1c220855b6ceac025d3f6ecdd2b7c4894bfe9cd9bda4fbb4bc7c0d4cf0 SIZE (rust/crates/either-1.13.0.crate) = 19169 SHA256 (rust/crates/encode_unicode-0.3.6.crate) = a357d28ed41a50f9c765dbfe56cbc04a64e53e5fc58ba79fbc34c10ef3df831f SIZE (rust/crates/encode_unicode-0.3.6.crate) = 45741 SHA256 (rust/crates/env_filter-0.1.2.crate) = 4f2c92ceda6ceec50f43169f9ee8424fe2db276791afde7b2cd8bc084cb376ab SIZE (rust/crates/env_filter-0.1.2.crate) = 13327 SHA256 (rust/crates/env_logger-0.11.5.crate) = e13fa619b91fb2381732789fc5de83b45675e882f66623b7d8cb4f643017018d SIZE (rust/crates/env_logger-0.11.5.crate) = 30683 SHA256 (rust/crates/errno-0.3.9.crate) = 534c5cf6194dfab3db3242765c03bbe257cf92f22b38f6bc0c58d59108a820ba SIZE (rust/crates/errno-0.3.9.crate) = 10690 SHA256 (rust/crates/esaxx-rs-0.1.10.crate) = d817e038c30374a4bcb22f94d0a8a0e216958d4c3dcde369b1439fec4bdda6e6 SIZE (rust/crates/esaxx-rs-0.1.10.crate) = 175210 SHA256 (rust/crates/fastrand-2.2.0.crate) = 486f806e73c5707928240ddc295403b1b93c96a02038563881c4a2fd84b81ac4 SIZE (rust/crates/fastrand-2.2.0.crate) = 15031 SHA256 (rust/crates/fnv-1.0.7.crate) = 3f9eec918d3f24069decb9af1554cad7c880e2da24a9afd88aca000531ab82c1 SIZE (rust/crates/fnv-1.0.7.crate) = 11266 SHA256 (rust/crates/getrandom-0.2.15.crate) = c4567c8db10ae91089c99af84c68c38da3ec2f087c3f82960bcdbf3656b6f4d7 SIZE (rust/crates/getrandom-0.2.15.crate) = 37163 SHA256 (rust/crates/heck-0.5.0.crate) = 2304e00983f87ffb38b55b444b5e3b60a884b5d30c0fca7d82fe33449bbe55ea SIZE (rust/crates/heck-0.5.0.crate) = 11517 SHA256 (rust/crates/humantime-2.1.0.crate) = 9a3a5bfb195931eeb336b2a7b4d761daec841b97f947d34394601737a7bba5e4 SIZE (rust/crates/humantime-2.1.0.crate) = 16749 SHA256 (rust/crates/ident_case-1.0.1.crate) = b9e0384b61958566e926dc50660321d12159025e767c18e043daf26b70104c39 SIZE (rust/crates/ident_case-1.0.1.crate) = 3492 SHA256 (rust/crates/indicatif-0.17.9.crate) = cbf675b85ed934d3c67b5c5469701eec7db22689d0a2139d856e0925fa28b281 SIZE (rust/crates/indicatif-0.17.9.crate) = 65246 SHA256 (rust/crates/indoc-2.0.5.crate) = b248f5224d1d606005e02c97f5aa4e88eeb230488bcc03bc9ca4d7991399f2b5 SIZE (rust/crates/indoc-2.0.5.crate) = 14396 SHA256 (rust/crates/is_terminal_polyfill-1.70.1.crate) = 7943c866cc5cd64cbc25b2e01621d07fa8eb2a1a23160ee81ce38704e97b8ecf SIZE (rust/crates/is_terminal_polyfill-1.70.1.crate) = 7492 SHA256 (rust/crates/itertools-0.11.0.crate) = b1c173a5686ce8bfa551b3563d0c2170bf24ca44da99c7ca4bfdab5418c3fe57 SIZE (rust/crates/itertools-0.11.0.crate) = 125074 SHA256 (rust/crates/itertools-0.12.1.crate) = ba291022dbbd398a455acf126c1e341954079855bc60dfdda641363bd6922569 SIZE (rust/crates/itertools-0.12.1.crate) = 137761 SHA256 (rust/crates/itoa-1.0.14.crate) = d75a2a4b1b190afb6f5425f10f6a8f959d2ea0b9c2b1d79553551850539e4674 SIZE (rust/crates/itoa-1.0.14.crate) = 11210 SHA256 (rust/crates/js-sys-0.3.72.crate) = 6a88f1bda2bd75b0452a14784937d796722fdebfe50df998aeb3f0b7603019a9 SIZE (rust/crates/js-sys-0.3.72.crate) = 54096 SHA256 (rust/crates/lazy_static-1.5.0.crate) = bbd2bcb4c963f2ddae06a2efc7e9f3591312473c50c6685e1f298068316e66fe SIZE (rust/crates/lazy_static-1.5.0.crate) = 14025 SHA256 (rust/crates/libc-0.2.166.crate) = c2ccc108bbc0b1331bd061864e7cd823c0cab660bbe6970e66e2c0614decde36 SIZE (rust/crates/libc-0.2.166.crate) = 760418 SHA256 (rust/crates/linux-raw-sys-0.4.14.crate) = 78b3ae25bc7c8c38cec158d1f2757ee79e9b3740fbc7ccf0e59e4b08d793fa89 SIZE (rust/crates/linux-raw-sys-0.4.14.crate) = 1826665 SHA256 (rust/crates/log-0.4.22.crate) = a7a70ba024b9dc04c27ea2f0c0548feb474ec5c54bba33a7f72f873a39d07b24 SIZE (rust/crates/log-0.4.22.crate) = 44027 SHA256 (rust/crates/macro_rules_attribute-0.2.0.crate) = 8a82271f7bc033d84bbca59a3ce3e4159938cb08a9c3aebbe54d215131518a13 SIZE (rust/crates/macro_rules_attribute-0.2.0.crate) = 15408 SHA256 (rust/crates/macro_rules_attribute-proc_macro-0.2.0.crate) = b8dd856d451cc0da70e2ef2ce95a18e39a93b7558bedf10201ad28503f918568 SIZE (rust/crates/macro_rules_attribute-proc_macro-0.2.0.crate) = 8264 SHA256 (rust/crates/matrixmultiply-0.3.9.crate) = 9380b911e3e96d10c1f415da0876389aaf1b56759054eeb0de7df940c456ba1a SIZE (rust/crates/matrixmultiply-0.3.9.crate) = 57819 SHA256 (rust/crates/memchr-2.7.4.crate) = 78ca9ab1a0babb1e7d5695e3530886289c18cf2f87ec19a575a0abdce112e3a3 SIZE (rust/crates/memchr-2.7.4.crate) = 96670 SHA256 (rust/crates/memoffset-0.9.1.crate) = 488016bfae457b036d996092f6cb448677611ce4449e970ceaf42695203f218a SIZE (rust/crates/memoffset-0.9.1.crate) = 9032 SHA256 (rust/crates/minimal-lexical-0.2.1.crate) = 68354c5c6bd36d73ff3feceb05efa59b6acb7626617f4962be322a825e61f79a SIZE (rust/crates/minimal-lexical-0.2.1.crate) = 94841 SHA256 (rust/crates/monostate-0.1.13.crate) = 0d208407d7552cd041d8cdb69a1bc3303e029c598738177a3d87082004dc0e1e SIZE (rust/crates/monostate-0.1.13.crate) = 14073 SHA256 (rust/crates/monostate-impl-0.1.13.crate) = a7ce64b975ed4f123575d11afd9491f2e37bbd5813fbfbc0f09ae1fbddea74e0 SIZE (rust/crates/monostate-impl-0.1.13.crate) = 7302 SHA256 (rust/crates/ndarray-0.15.6.crate) = adb12d4e967ec485a5f71c6311fe28158e9d6f4bc4a447b474184d0f91a8fa32 SIZE (rust/crates/ndarray-0.15.6.crate) = 275225 SHA256 (rust/crates/ndarray-0.16.1.crate) = 882ed72dce9365842bf196bdeedf5055305f11fc8c03dee7bb0194a6cad34841 SIZE (rust/crates/ndarray-0.16.1.crate) = 291026 SHA256 (rust/crates/nom-7.1.3.crate) = d273983c5a657a70a3e8f2a01329822f3b8c8172b73826411a55751e404a0a4a SIZE (rust/crates/nom-7.1.3.crate) = 117570 SHA256 (rust/crates/num-complex-0.4.6.crate) = 73f88a1307638156682bada9d7604135552957b7818057dcef22705b4d509495 SIZE (rust/crates/num-complex-0.4.6.crate) = 30352 SHA256 (rust/crates/num-integer-0.1.46.crate) = 7969661fd2958a5cb096e56c8e1ad0444ac2bbcd0061bd28660485a44879858f SIZE (rust/crates/num-integer-0.1.46.crate) = 22331 SHA256 (rust/crates/num-traits-0.2.19.crate) = 071dfc062690e90b734c0b2273ce72ad0ffa95f0c74596bc250dcfd960262841 SIZE (rust/crates/num-traits-0.2.19.crate) = 51631 SHA256 (rust/crates/number_prefix-0.4.0.crate) = 830b246a0e5f20af87141b25c173cd1b609bd7779a4617d6ec582abaf90870f3 SIZE (rust/crates/number_prefix-0.4.0.crate) = 6922 SHA256 (rust/crates/numpy-0.22.1.crate) = edb929bc0da91a4d85ed6c0a84deaa53d411abfb387fc271124f91bf6b89f14e SIZE (rust/crates/numpy-0.22.1.crate) = 82267 SHA256 (rust/crates/once_cell-1.20.2.crate) = 1261fe7e33c73b354eab43b1273a57c8f967d0391e80353e51f764ac02cf6775 SIZE (rust/crates/once_cell-1.20.2.crate) = 33394 SHA256 (rust/crates/onig-6.4.0.crate) = 8c4b31c8722ad9171c6d77d3557db078cab2bd50afcc9d09c8b315c59df8ca4f SIZE (rust/crates/onig-6.4.0.crate) = 32616 SHA256 (rust/crates/onig_sys-69.8.1.crate) = 7b829e3d7e9cc74c7e315ee8edb185bf4190da5acde74afd7fc59c35b1f086e7 SIZE (rust/crates/onig_sys-69.8.1.crate) = 638216 SHA256 (rust/crates/paste-1.0.15.crate) = 57c0d7b74b563b49d38dae00a0c37d4d6de9b432382b2892f0574ddcae73fd0a SIZE (rust/crates/paste-1.0.15.crate) = 18374 SHA256 (rust/crates/pkg-config-0.3.31.crate) = 953ec861398dccce10c670dfeaf3ec4911ca479e9c02154b3a215178c5f566f2 SIZE (rust/crates/pkg-config-0.3.31.crate) = 20880 SHA256 (rust/crates/portable-atomic-1.10.0.crate) = 280dc24453071f1b63954171985a0b0d30058d287960968b9b2aca264c8d4ee6 SIZE (rust/crates/portable-atomic-1.10.0.crate) = 174760 SHA256 (rust/crates/portable-atomic-util-0.2.4.crate) = d8a2f0d8d040d7848a709caf78912debcc3f33ee4b3cac47d73d1e1069e83507 SIZE (rust/crates/portable-atomic-util-0.2.4.crate) = 47043 SHA256 (rust/crates/ppv-lite86-0.2.20.crate) = 77957b295656769bb8ad2b6a6b09d897d94f05c41b069aede1fcdaa675eaea04 SIZE (rust/crates/ppv-lite86-0.2.20.crate) = 22478 SHA256 (rust/crates/proc-macro2-1.0.92.crate) = 37d3544b3f2748c54e147655edb5025752e2303145b5aefb3c3ea2c78b973bb0 SIZE (rust/crates/proc-macro2-1.0.92.crate) = 52353 SHA256 (rust/crates/pyo3-0.22.6.crate) = f402062616ab18202ae8319da13fa4279883a2b8a9d9f83f20dbade813ce1884 SIZE (rust/crates/pyo3-0.22.6.crate) = 546746 SHA256 (rust/crates/pyo3-build-config-0.22.6.crate) = b14b5775b5ff446dd1056212d778012cbe8a0fbffd368029fd9e25b514479c38 SIZE (rust/crates/pyo3-build-config-0.22.6.crate) = 31369 SHA256 (rust/crates/pyo3-ffi-0.22.6.crate) = 9ab5bcf04a2cdcbb50c7d6105de943f543f9ed92af55818fd17b660390fc8636 SIZE (rust/crates/pyo3-ffi-0.22.6.crate) = 69352 SHA256 (rust/crates/pyo3-macros-0.22.6.crate) = 0fd24d897903a9e6d80b968368a34e1525aeb719d568dba8b3d4bfa5dc67d453 SIZE (rust/crates/pyo3-macros-0.22.6.crate) = 8182 SHA256 (rust/crates/pyo3-macros-backend-0.22.6.crate) = 36c011a03ba1e50152b4b394b479826cad97e7a21eb52df179cd91ac411cbfbe SIZE (rust/crates/pyo3-macros-backend-0.22.6.crate) = 66312 SHA256 (rust/crates/quote-1.0.37.crate) = b5b9d34b8991d19d98081b46eacdd8eb58c6f2b201139f7c5f643cc155a633af SIZE (rust/crates/quote-1.0.37.crate) = 28558 SHA256 (rust/crates/rand-0.8.5.crate) = 34af8d1a0e25924bc5b7c43c079c942339d8f0a8b57c39049bef581b46327404 SIZE (rust/crates/rand-0.8.5.crate) = 87113 SHA256 (rust/crates/rand_chacha-0.3.1.crate) = e6c10a63a0fa32252be49d21e7709d4d4baf8d231c2dbce1eaa8141b9b127d88 SIZE (rust/crates/rand_chacha-0.3.1.crate) = 15251 SHA256 (rust/crates/rand_core-0.6.4.crate) = ec0be4795e2f6a28069bec0b5ff3e2ac9bafc99e6a9a7dc3547996c5c816922c SIZE (rust/crates/rand_core-0.6.4.crate) = 22666 SHA256 (rust/crates/rawpointer-0.2.1.crate) = 60a357793950651c4ed0f3f52338f53b2f809f32d83a07f72909fa13e4c6c1e3 SIZE (rust/crates/rawpointer-0.2.1.crate) = 7490 SHA256 (rust/crates/rayon-1.10.0.crate) = b418a60154510ca1a002a752ca9714984e21e4241e804d32555251faf8b78ffa SIZE (rust/crates/rayon-1.10.0.crate) = 180155 SHA256 (rust/crates/rayon-cond-0.3.0.crate) = 059f538b55efd2309c9794130bc149c6a553db90e9d99c2030785c82f0bd7df9 SIZE (rust/crates/rayon-cond-0.3.0.crate) = 9913 SHA256 (rust/crates/rayon-core-1.12.1.crate) = 1465873a3dfdaa8ae7cb14b4383657caab0b3e8a0aa9ae8e04b044854c8dfce2 SIZE (rust/crates/rayon-core-1.12.1.crate) = 70701 SHA256 (rust/crates/regex-1.11.1.crate) = b544ef1b4eac5dc2db33ea63606ae9ffcfac26c1416a2806ae0bf5f56b201191 SIZE (rust/crates/regex-1.11.1.crate) = 254170 SHA256 (rust/crates/regex-automata-0.4.9.crate) = 809e8dc61f6de73b46c85f4c96486310fe304c434cfa43669d7b40f711150908 SIZE (rust/crates/regex-automata-0.4.9.crate) = 618525 SHA256 (rust/crates/regex-syntax-0.8.5.crate) = 2b15c43186be67a4fd63bee50d0303afffcef381492ebe2c5d87f324e1b8815c SIZE (rust/crates/regex-syntax-0.8.5.crate) = 357541 SHA256 (rust/crates/rustc-hash-1.1.0.crate) = 08d43f7aa6b08d49f382cde6a7982047c3426db949b1424bc4b7ec9ae12c6ce2 SIZE (rust/crates/rustc-hash-1.1.0.crate) = 9331 SHA256 (rust/crates/rustix-0.38.41.crate) = d7f649912bc1495e167a6edee79151c84b1bad49748cb4f1f1167f459f6224f6 SIZE (rust/crates/rustix-0.38.41.crate) = 378463 SHA256 (rust/crates/ryu-1.0.18.crate) = f3cb5ba0dc43242ce17de99c180e96db90b235b8a9fdc9543c96d2209116bd9f SIZE (rust/crates/ryu-1.0.18.crate) = 47713 SHA256 (rust/crates/serde-1.0.215.crate) = 6513c1ad0b11a9376da888e3e0baa0077f1aed55c17f50e7b2397136129fb88f SIZE (rust/crates/serde-1.0.215.crate) = 78527 SHA256 (rust/crates/serde_derive-1.0.215.crate) = ad1e866f866923f252f05c889987993144fb74e722403468a4ebd70c3cd756c0 SIZE (rust/crates/serde_derive-1.0.215.crate) = 57092 SHA256 (rust/crates/serde_json-1.0.133.crate) = c7fceb2473b9166b2294ef05efcb65a3db80803f0b03ef86a5fc88a2b85ee377 SIZE (rust/crates/serde_json-1.0.133.crate) = 150739 SHA256 (rust/crates/shlex-1.3.0.crate) = 0fda2ff0d084019ba4d7c6f371c95d8fd75ce3524c3cb8fb653a3023f6323e64 SIZE (rust/crates/shlex-1.3.0.crate) = 18713 SHA256 (rust/crates/smallvec-1.13.2.crate) = 3c5e1a9a646d36c3599cd173a41282daf47c44583ad367b8e6837255952e5c67 SIZE (rust/crates/smallvec-1.13.2.crate) = 35216 SHA256 (rust/crates/spm_precompiled-0.1.4.crate) = 5851699c4033c63636f7ea4cf7b7c1f1bf06d0cc03cfb42e711de5a5c46cf326 SIZE (rust/crates/spm_precompiled-0.1.4.crate) = 557527 SHA256 (rust/crates/strsim-0.11.1.crate) = 7da8b5736845d9f2fcb837ea5d9e2628564b3b043a70948a3f0b778838c5fb4f SIZE (rust/crates/strsim-0.11.1.crate) = 14266 SHA256 (rust/crates/syn-2.0.89.crate) = 44d46482f1c1c87acd84dea20c1bf5ebff4c757009ed6bf19cfd36fb10e92c4e SIZE (rust/crates/syn-2.0.89.crate) = 289852 SHA256 (rust/crates/target-lexicon-0.12.16.crate) = 61c41af27dd6d1e27b1b16b489db798443478cef1f06a660c96db617ba5de3b1 SIZE (rust/crates/target-lexicon-0.12.16.crate) = 26488 SHA256 (rust/crates/tempfile-3.14.0.crate) = 28cce251fcbc87fac86a866eeb0d6c2d536fc16d06f184bb61aeae11aa4cee0c SIZE (rust/crates/tempfile-3.14.0.crate) = 35065 SHA256 (rust/crates/thiserror-1.0.69.crate) = b6aaf5339b578ea85b50e080feb250a3e8ae8cfcdff9a461c9ec2904bc923f52 SIZE (rust/crates/thiserror-1.0.69.crate) = 22198 SHA256 (rust/crates/thiserror-impl-1.0.69.crate) = 4fee6c4efc90059e10f81e6d42c60a18f76588c3d74cb83a0b242a2b6c7504c1 SIZE (rust/crates/thiserror-impl-1.0.69.crate) = 18365 SHA256 (rust/crates/unicode-ident-1.0.14.crate) = adb9e6ca4f869e1180728b7950e35922a7fc6397f7b641499e8f3ef06e50dc83 SIZE (rust/crates/unicode-ident-1.0.14.crate) = 47547 SHA256 (rust/crates/unicode-normalization-alignments-0.1.12.crate) = 43f613e4fa046e69818dd287fdc4bc78175ff20331479dab6e1b0f98d57062de SIZE (rust/crates/unicode-normalization-alignments-0.1.12.crate) = 91546 SHA256 (rust/crates/unicode-segmentation-1.12.0.crate) = f6ccf251212114b54433ec949fd6a7841275f9ada20dddd2f29e9ceea4501493 SIZE (rust/crates/unicode-segmentation-1.12.0.crate) = 106323 SHA256 (rust/crates/unicode-width-0.1.14.crate) = 7dd6e30e90baa6f72411720665d41d89b9a3d039dc45b8faea1ddd07f617f6af SIZE (rust/crates/unicode-width-0.1.14.crate) = 271615 SHA256 (rust/crates/unicode-width-0.2.0.crate) = 1fc81956842c57dac11422a97c3b8195a1ff727f06e85c84ed2e8aa277c9a0fd SIZE (rust/crates/unicode-width-0.2.0.crate) = 271509 SHA256 (rust/crates/unicode_categories-0.1.1.crate) = 39ec24b3121d976906ece63c9daad25b85969647682eee313cb5779fdd69e14e SIZE (rust/crates/unicode_categories-0.1.1.crate) = 87298 SHA256 (rust/crates/unindent-0.2.3.crate) = c7de7d73e1754487cb58364ee906a499937a0dfabd86bcb980fa99ec8c8fa2ce SIZE (rust/crates/unindent-0.2.3.crate) = 7306 SHA256 (rust/crates/utf8parse-0.2.2.crate) = 06abde3611657adf66d383f00b093d7faecc7fa57071cce2578660c9f1010821 SIZE (rust/crates/utf8parse-0.2.2.crate) = 13499 SHA256 (rust/crates/wasi-0.11.0+wasi-snapshot-preview1.crate) = 9c8d87e72b64a3b4db28d11ce29237c246188f4f51057d65a7eab63b7987e423 SIZE (rust/crates/wasi-0.11.0+wasi-snapshot-preview1.crate) = 28131 SHA256 (rust/crates/wasm-bindgen-0.2.95.crate) = 128d1e363af62632b8eb57219c8fd7877144af57558fb2ef0368d0087bddeb2e SIZE (rust/crates/wasm-bindgen-0.2.95.crate) = 43894 SHA256 (rust/crates/wasm-bindgen-backend-0.2.95.crate) = cb6dd4d3ca0ddffd1dd1c9c04f94b868c37ff5fac97c30b97cff2d74fce3a358 SIZE (rust/crates/wasm-bindgen-backend-0.2.95.crate) = 30487 SHA256 (rust/crates/wasm-bindgen-macro-0.2.95.crate) = e79384be7f8f5a9dd5d7167216f022090cf1f9ec128e6e6a482a2cb5c5422c56 SIZE (rust/crates/wasm-bindgen-macro-0.2.95.crate) = 6938 SHA256 (rust/crates/wasm-bindgen-macro-support-0.2.95.crate) = 26c6ab57572f7a24a4985830b120de1594465e5d500f24afe89e16b4e833ef68 SIZE (rust/crates/wasm-bindgen-macro-support-0.2.95.crate) = 21331 SHA256 (rust/crates/wasm-bindgen-shared-0.2.95.crate) = 65fc09f10666a9f147042251e0dda9c18f166ff7de300607007e96bdebc1068d SIZE (rust/crates/wasm-bindgen-shared-0.2.95.crate) = 7649 SHA256 (rust/crates/web-time-1.1.0.crate) = 5a6580f308b1fad9207618087a65c04e7a10bc77e02c8e84e9b00dd4b12fa0bb SIZE (rust/crates/web-time-1.1.0.crate) = 18026 SHA256 (rust/crates/windows-sys-0.52.0.crate) = 282be5f36a8ce781fad8c8ae18fa3f9beff57ec1b52cb3de0789201425d9a33d SIZE (rust/crates/windows-sys-0.52.0.crate) = 2576877 SHA256 (rust/crates/windows-sys-0.59.0.crate) = 1e38bc4d79ed67fd075bcc251a1c39b32a1776bbe92e5bef1f0bf1f8c531853b SIZE (rust/crates/windows-sys-0.59.0.crate) = 2387323 SHA256 (rust/crates/windows-targets-0.52.6.crate) = 9b724f72796e036ab90c1021d4780d4d3d648aca59e491e6b98e725b84e99973 SIZE (rust/crates/windows-targets-0.52.6.crate) = 6403 SHA256 (rust/crates/windows_aarch64_gnullvm-0.52.6.crate) = 32a4622180e7a0ec044bb555404c800bc9fd9ec262ec147edd5989ccd0c02cd3 SIZE (rust/crates/windows_aarch64_gnullvm-0.52.6.crate) = 435718 SHA256 (rust/crates/windows_aarch64_msvc-0.52.6.crate) = 09ec2a7bb152e2252b53fa7803150007879548bc709c039df7627cabbd05d469 SIZE (rust/crates/windows_aarch64_msvc-0.52.6.crate) = 832615 SHA256 (rust/crates/windows_i686_gnu-0.52.6.crate) = 8e9b5ad5ab802e97eb8e295ac6720e509ee4c243f69d781394014ebfe8bbfa0b SIZE (rust/crates/windows_i686_gnu-0.52.6.crate) = 880402 SHA256 (rust/crates/windows_i686_gnullvm-0.52.6.crate) = 0eee52d38c090b3caa76c563b86c3a4bd71ef1a819287c19d586d7334ae8ed66 SIZE (rust/crates/windows_i686_gnullvm-0.52.6.crate) = 475940 SHA256 (rust/crates/windows_i686_msvc-0.52.6.crate) = 240948bc05c5e7c6dabba28bf89d89ffce3e303022809e73deaefe4f6ec56c66 SIZE (rust/crates/windows_i686_msvc-0.52.6.crate) = 901163 SHA256 (rust/crates/windows_x86_64_gnu-0.52.6.crate) = 147a5c80aabfbf0c7d901cb5895d1de30ef2907eb21fbbab29ca94c5b08b1a78 SIZE (rust/crates/windows_x86_64_gnu-0.52.6.crate) = 836363 SHA256 (rust/crates/windows_x86_64_gnullvm-0.52.6.crate) = 24d5b23dc417412679681396f2b49f3de8c1473deb516bd34410872eff51ed0d SIZE (rust/crates/windows_x86_64_gnullvm-0.52.6.crate) = 435707 SHA256 (rust/crates/windows_x86_64_msvc-0.52.6.crate) = 589f6da84c646204747d1270a2a5661ea66ed1cced2631d546fdfb155959f9ec SIZE (rust/crates/windows_x86_64_msvc-0.52.6.crate) = 832564 SHA256 (rust/crates/zerocopy-0.7.35.crate) = 1b9b4fd18abc82b8136838da5d50bae7bdea537c574d8dc1a34ed098d6c166f0 SIZE (rust/crates/zerocopy-0.7.35.crate) = 152645 SHA256 (rust/crates/zerocopy-derive-0.7.35.crate) = fa4f8080344d4671fb4e831a13ad1e68092748387dfc4f55e356242fae12ce3e SIZE (rust/crates/zerocopy-derive-0.7.35.crate) = 37829

Collapse this list.


Packages (timestamps in pop-ups are UTC):
py311-tokenizers
ABIaarch64amd64armv6armv7i386powerpcpowerpc64powerpc64le
FreeBSD:13:latest0.20.3_10.21.0-0.20.3_10.21.0---
FreeBSD:13:quarterly0.20.0_20.20.0_2--0.20.0_2---
FreeBSD:14:latest0.20.3_10.21.0-0.20.3_10.21.0---
FreeBSD:14:quarterly0.20.0_20.20.0_2--0.20.0_2---
FreeBSD:15:latest0.20.3_10.21.0n/a0.20.3_1n/a---
Dependencies
NOTE: FreshPorts displays only information on required and default dependencies. Optional dependencies are not covered.
Build dependencies:
  1. py311-maturin>=1.0<2.0 : devel/py-maturin@py311
  2. rust>=1.83.0 : lang/rust
  3. pkgconf>=1.3.0_1 : devel/pkgconf
  4. python3.11 : lang/python311
  5. py311-build>=0 : devel/py-build@py311
  6. py311-installer>=0 : devel/py-installer@py311
Test dependencies:
  1. oniguruma.pc : devel/oniguruma
  2. python3.11 : lang/python311
Runtime dependencies:
  1. py311-huggingface-hub>=0.16.4<1.0 : misc/py-huggingface-hub@py311
  2. python3.11 : lang/python311
This port is required by:
for Run
  1. misc/py-aider-chat
  2. misc/py-anthropic
  3. misc/py-litellm

Configuration Options:
No options to configure
Options name:
textproc_py-tokenizers
USES:
cargo python
FreshPorts was unable to extract/find any pkg message
Master Sites:
Expand this list (2 items)
Collapse this list.
  1. https://files.pythonhosted.org/packages/source/t/tokenizers/
  2. https://pypi.org/packages/source/t/tokenizers/
Collapse this list.

Number of commits found: 16

Commit History - (may be incomplete: for full details, see links to repositories near top of page)
CommitCreditsLog message
0.21.0
06 Dec 2024 02:24:32
commit hash: 2b6d4bc08c9f4a5634b9dca9ecb3b6fb02682033commit hash: 2b6d4bc08c9f4a5634b9dca9ecb3b6fb02682033commit hash: 2b6d4bc08c9f4a5634b9dca9ecb3b6fb02682033commit hash: 2b6d4bc08c9f4a5634b9dca9ecb3b6fb02682033 files touched by this commit
Hiroki Tagato (tagattie) search for other commits by this committer
textproc/py-tokenizers: update to 0.21.0

Changelog: https://github.com/huggingface/tokenizers/releases/tag/v0.21.0

Reported by:	Repology
0.20.3_2
01 Dec 2024 09:24:18
commit hash: 1d1fd15a3e499eed9e604dac94bab73c2fea4b2fcommit hash: 1d1fd15a3e499eed9e604dac94bab73c2fea4b2fcommit hash: 1d1fd15a3e499eed9e604dac94bab73c2fea4b2fcommit hash: 1d1fd15a3e499eed9e604dac94bab73c2fea4b2f files touched by this commit
Mikael Urankar (mikael) search for other commits by this committer
lang/rust: Bump revisions after 1.83.0

PR:		283000
0.20.3_1
08 Nov 2024 08:24:20
commit hash: ca048c2884dc9822310a87e2dfebb8b1647095e2commit hash: ca048c2884dc9822310a87e2dfebb8b1647095e2commit hash: ca048c2884dc9822310a87e2dfebb8b1647095e2commit hash: ca048c2884dc9822310a87e2dfebb8b1647095e2 files touched by this commit
Mikael Urankar (mikael) search for other commits by this committer
lang/rust: Bump revisions after 1.82.0

PR:		282516
0.20.3
07 Nov 2024 12:30:23
commit hash: c857714fe0171b077b877238ca80112148241977commit hash: c857714fe0171b077b877238ca80112148241977commit hash: c857714fe0171b077b877238ca80112148241977commit hash: c857714fe0171b077b877238ca80112148241977 files touched by this commit
Hiroki Tagato (tagattie) search for other commits by this committer
textproc/py-tokenizers: update to 0.20.3

Changelog:
- https://github.com/huggingface/tokenizers/releases/tag/v0.20.2
- https://github.com/huggingface/tokenizers/releases/tag/v0.20.3

Reported by:	portscout
0.20.1
18 Oct 2024 05:12:30
commit hash: 1f65a995d572591f6d84d7a207e0c7cc5a7aefedcommit hash: 1f65a995d572591f6d84d7a207e0c7cc5a7aefedcommit hash: 1f65a995d572591f6d84d7a207e0c7cc5a7aefedcommit hash: 1f65a995d572591f6d84d7a207e0c7cc5a7aefed files touched by this commit
Hiroki Tagato (tagattie) search for other commits by this committer
textproc/py-tokenizers: update to 0.20.1

While here, add LICENSE_FILE.

Changelog: https://github.com/huggingface/tokenizers/releases/tag/v0.20.1

Reported by:	Repology
0.20.0_2
10 Sep 2024 11:00:34
commit hash: 89ab81d0e2eca6b298b0efd3f4577d468abf18d5commit hash: 89ab81d0e2eca6b298b0efd3f4577d468abf18d5commit hash: 89ab81d0e2eca6b298b0efd3f4577d468abf18d5commit hash: 89ab81d0e2eca6b298b0efd3f4577d468abf18d5 files touched by this commit
Mikael Urankar (mikael) search for other commits by this committer
lang/rust: Bump revisions after 1.81.0

PR:		281300
0.20.0_1
10 Sep 2024 10:58:07
commit hash: c0d07f1127793ffc97a6c1db78423d0d16bdc5efcommit hash: c0d07f1127793ffc97a6c1db78423d0d16bdc5efcommit hash: c0d07f1127793ffc97a6c1db78423d0d16bdc5efcommit hash: c0d07f1127793ffc97a6c1db78423d0d16bdc5ef files touched by this commit
Mikael Urankar (mikael) search for other commits by this committer
Author: Siva Mahadevan
*/*: remove STRIP_CMD calls in rust based ports

This is not needed after bc4fedc1fec0d359365c04d43be9e32bf101a50e

PR:		246993
Differential Revision:	https://reviews.freebsd.org/D46503
0.20.0_1
26 Aug 2024 08:08:01
commit hash: 7a55f9542531c541f36ae5cf3a58d6ce92ec7bffcommit hash: 7a55f9542531c541f36ae5cf3a58d6ce92ec7bffcommit hash: 7a55f9542531c541f36ae5cf3a58d6ce92ec7bffcommit hash: 7a55f9542531c541f36ae5cf3a58d6ce92ec7bff files touched by this commit
Mikael Urankar (mikael) search for other commits by this committer
lang/rust: Bump revisions after 1.80.1

PR:		280490
0.20.0
10 Aug 2024 22:35:02
commit hash: ece290008e97142c0099b1dce0194d4d999af53ecommit hash: ece290008e97142c0099b1dce0194d4d999af53ecommit hash: ece290008e97142c0099b1dce0194d4d999af53ecommit hash: ece290008e97142c0099b1dce0194d4d999af53e files touched by this commit
Hiroki Tagato (tagattie) search for other commits by this committer
textproc/py-tokenizers: update to 0.20.0

Changelog: https://github.com/huggingface/tokenizers/releases/tag/v0.20.0

Reported by:	portscout
0.19.1_2
18 Jun 2024 10:59:14
commit hash: 21476ca8a9b5feb3485a3fa29299c0b447c90350commit hash: 21476ca8a9b5feb3485a3fa29299c0b447c90350commit hash: 21476ca8a9b5feb3485a3fa29299c0b447c90350commit hash: 21476ca8a9b5feb3485a3fa29299c0b447c90350 files touched by this commit
Mikael Urankar (mikael) search for other commits by this committer
lang/rust: Bump revisions after 1.79.0

PR:		279707
0.19.1_1
13 May 2024 11:03:24
commit hash: 2322b661000b69fcc75fe68fec25e5b4e48c2a8ecommit hash: 2322b661000b69fcc75fe68fec25e5b4e48c2a8ecommit hash: 2322b661000b69fcc75fe68fec25e5b4e48c2a8ecommit hash: 2322b661000b69fcc75fe68fec25e5b4e48c2a8e files touched by this commit
Mikael Urankar (mikael) search for other commits by this committer
lang/rust: Bump revisions after 1.78.0

PR:		278834
0.19.1
21 Apr 2024 08:18:00
commit hash: fbf53c1600b33ffffebd7ad72f96acd3220baed4commit hash: fbf53c1600b33ffffebd7ad72f96acd3220baed4commit hash: fbf53c1600b33ffffebd7ad72f96acd3220baed4commit hash: fbf53c1600b33ffffebd7ad72f96acd3220baed4 files touched by this commit
Hiroki Tagato (tagattie) search for other commits by this committer
textproc/py-tokenizers: update to 0.19.1

Changelog:
- https://github.com/huggingface/tokenizers/releases/tag/v0.19.0
- https://github.com/huggingface/tokenizers/releases/tag/v0.19.1

Reported by:	Repology
0.15.2_2
23 Mar 2024 09:41:46
commit hash: d632fbe55e45559d03a0bef7211913964aeba45acommit hash: d632fbe55e45559d03a0bef7211913964aeba45acommit hash: d632fbe55e45559d03a0bef7211913964aeba45acommit hash: d632fbe55e45559d03a0bef7211913964aeba45a files touched by this commit
Mikael Urankar (mikael) search for other commits by this committer
lang/rust: Bump revisions after 1.77.0

PR:		277786
0.15.2_1
19 Feb 2024 11:59:23
commit hash: 7f5a7b0e524c8d32727681a4324905e350dc433ccommit hash: 7f5a7b0e524c8d32727681a4324905e350dc433ccommit hash: 7f5a7b0e524c8d32727681a4324905e350dc433ccommit hash: 7f5a7b0e524c8d32727681a4324905e350dc433c files touched by this commit
Mikael Urankar (mikael) search for other commits by this committer
lang/rust: Bump revisions after 1.76.0

PR:		276920
0.15.2
14 Feb 2024 09:17:15
commit hash: f6fa55bd8e81236b4dfc09c6ea2e8931792369b9commit hash: f6fa55bd8e81236b4dfc09c6ea2e8931792369b9commit hash: f6fa55bd8e81236b4dfc09c6ea2e8931792369b9commit hash: f6fa55bd8e81236b4dfc09c6ea2e8931792369b9 files touched by this commit
Hiroki Tagato (tagattie) search for other commits by this committer
textproc/py-tokenizers: update to 0.15.2

While here, enable tests.

Changelog: https://github.com/huggingface/tokenizers/releases/tag/v0.15.2

Reported by:	portscout
0.15.1
12 Feb 2024 08:34:14
commit hash: e3dfc2fad4565873d807ba820a24094b97890f98commit hash: e3dfc2fad4565873d807ba820a24094b97890f98commit hash: e3dfc2fad4565873d807ba820a24094b97890f98commit hash: e3dfc2fad4565873d807ba820a24094b97890f98 files touched by this commit
Hiroki Tagato (tagattie) search for other commits by this committer
textproc/py-tokenizers: add port: Fast state-of-the-art tokenizers optimized for
research and production

Provides an implementation of today's most used tokenizers, with a
focus on performance and versatility.

Main features:
- Train new vocabularies and tokenize, using today's most used
  tokenizers.
- Extremely fast (both training and tokenization), thanks to the Rust
  implementation. Takes less than 20 seconds to tokenize a GB of text
  on a server's CPU.
- Easy to use, but also extremely versatile.
- Designed for research and production.
- Normalization comes with alignments tracking. It's always possible
  to get the part of the original sentence that corresponds to a given
  token.
- Does all the pre-processing: Truncate, Pad, add the special tokens
  your model needs.

WWW: https://github.com/huggingface/tokenizers

Number of commits found: 16