FreshPorts -- The Place For Ports If you buy from Amazon USA, please support us by using this link.
Follow us
Blog
Twitter

Port details
libtextcat Language guessing by N-Gram-Based Text Categorization
2.2_6 textproc on this many watch lists=5 search for ports that depend on this port Find issues related to this port Report an issue related to this port
Maintainer: thierry@FreeBSD.org search for ports maintained by this maintainer
Port Added: 04 Dec 2006 22:04:18
License: BSD3CLAUSE
Libtextcat is a library with functions that implement the classification
technique described in Cavnar & Trenkle, "N-Gram-Based Text Categorization" [1].
It was primarily developed for language guessing, a task on which it is known to
perform with near-perfect accuracy.

The central idea of the Cavnar & Trenkle technique is to calculate a
"fingerprint" of a document with an unknown category, and compare this with the
fingerprints of a number of documents of which the categories are known. The
categories of the closest matches are output as the classification. A
fingerprint is a list of the most frequent n-grams occurring in a document,
ordered by frequency. Fingerprints are compared with a simple out-of-place
metric.

[1] The document that started it all: William B. Cavnar & John M. Trenkle (1994)
N-Gram-Based Text Categorization, <http://citeseer.ist.psu.edu/68861.html>.

WWW: http://software.wise-guys.nl/libtextcat/
SVNWeb : Homepage : PortsMon

To install the port: cd /usr/ports/textproc/libtextcat/ && make install clean
To add the package: pkg install libtextcat

PKGNAME: libtextcat

distinfo:

SHA256 (libtextcat-2.2.tar.gz) = 5677badffc48a8d332e345ea4fe225e3577f53fc95deeec8306000b256829655
SIZE (libtextcat-2.2.tar.gz) = 540999

This port is required by:

for Libraries * - deleted ports are only shown under the This port is required by section. It was harder to do for the Required section. Perhaps later...
Configuration Options
===> The following configuration options are available for libtextcat-2.2_6:
     DOCS=on: Build and/or install documentation
===> Use 'make config' to modify these settings

USES:
libtool

Master Sites:
  1. http://software.wise-guys.nl/download/

Number of commits found: 17

Commit History - (may be incomplete: see SVNWeb link above for full details)
DateByDescription
01 Dec 2016 11:18:04
Original commit files touched by this commit  2.2_6
Revision:427491
mat search for other commits by this committer
Don't quote {} in find -exec calls.

Braces are not shell metacharacters, and they do not need to be quoted.
By the time find parses its arguments and dicovers them, the quoting
will have been removed by the shell anyway.

Sponsored by:	Absolight
08 Dec 2014 21:17:00
Original commit files touched by this commit  2.2_6
Revision:374324
bapt search for other commits by this committer
Cleanup plist
03 Jul 2014 08:29:15
Original commit files touched by this commit  2.2_6
Revision:360343
tijl search for other commits by this committer
- Convert to USES=libtool
- Add DOCS option
- Remove CONFLICTS_INSTALL on libexttextcat that is no longer needed
27 Apr 2014 19:15:43
Original commit files touched by this commit  2.2_5
Revision:352452
thierry search for other commits by this committer
- Stagify;

- Fix the license.
20 Sep 2013 23:17:32
Original commit files touched by this commit  2.2_5
Revision:327773
bapt search for other commits by this committer
Add NO_STAGE all over the place in preparation for the staging support (cat:
textproc)
09 Oct 2012 23:33:13
Original commit files touched by this commit  2.2_5
Revision:305621
cy search for other commits by this committer
Allow textproc/libtextcat and textproc/libexttextcat to coexist on the same
system.

Reviewed by:	thierry, crees, bapt
Approved by:	bapt (using his portmgr and office hats), thierry
13 May 2012 14:37:29
Original commit files touched by this commit  2.2_4
thierry search for other commits by this committer
Register conflict with libexttextcat.

Reported by:    avg
31 Jan 2011 19:59:52
Original commit files touched by this commit  2.2_4
thierry search for other commits by this committer
Adding another header needed for LibreOffice.

Requeted by:    bapt
Feature safe:   yes
27 Jan 2011 22:30:42
Original commit files touched by this commit  2.2_3
thierry search for other commits by this committer
Allow some parts to be used from LibreOffice (C++ code).

Submitted by:   bapt
Feature safe:   yes
09 Dec 2010 20:51:25
Original commit files touched by this commit  2.2_2
thierry search for other commits by this committer
Adding more headers (1).

Since I'm there, define the license.

Requested by:   bapt (1)
X-Teaser:       LibreOffice is coming!
04 Dec 2010 07:34:27
Original commit files touched by this commit  2.2_1
ade search for other commits by this committer
Sync to new bsd.autotools.mk
02 Aug 2009 19:36:34
Original commit files touched by this commit  2.2_1
mezz search for other commits by this committer
-Repocopy devel/libtool15 -> libtool22 and libltdl15 -> libltdl22.
-Update libtool and libltdl to 2.2.6a.
-Remove devel/libtool15 and devel/libltdl15.
-Fix ports build with libtool22/libltdl22.
-Bump ports that depend on libltdl22 due to shared library version change.
-Explain what to do update in the UPDATING.

It has been tested with GNOME2, XFCE4, KDE3, KDE4 and other many wm/desktop
and applications in the runtime.

With help:      marcus and kwm
Pointyhat-exp:  a few times by pav
Tested by:      pgollucci, "Romain Tartière" <romain@blogreen.org>, and
                a few MarcusCom CVS users. Also, I might have missed a few.
Repocopy by:    marcus
Approved by:    portmgr
21 Aug 2008 06:18:49
Original commit files touched by this commit  2.2_1
rafan search for other commits by this committer
Update CONFIGURE_ARGS for how we pass CONFIGURE_TARGET to configure script.
Specifically, newer autoconf (> 2.13) has different semantic of the
configure target. In short, one should use --build=CONFIGURE_TARGET
instead of CONFIGURE_TARGET directly. Otherwise, you will get a warning
and the old semantic may be removed in later autoconf releases.

To workaround this issue, many ports hack the CONFIGURE_TARGET variable
so that it contains the ``--build='' prefix.

To solve this issue, under the fact that some ports still have
configure script generated by the old autoconf, we use runtime detection
in the do-configure target so that the proper argument can be used.

Changes to Mk/*:
 - Add runtime detection magic in bsd.port.mk
(Only the first 15 lines of the commit message are shown above View all of this commit message)
01 Jun 2008 20:39:59
Original commit files touched by this commit  2.2_1
thierry search for other commits by this committer
Fix plist with NOPORTDOCS.

Reported by:    itetcu
23 Aug 2007 22:13:35
Original commit files touched by this commit  2.2_1
thierry search for other commits by this committer
Import patches (imported from OpenOffice.org 2.3 by Fedora, at least in
part). These patches, released under a BSD license, seem to improve the
accuracy of language detection, especially those that don't have a
Latin script.
01 Feb 2007 02:42:06
Original commit files touched by this commit  2.2
kris search for other commits by this committer
Use libtool port instead of included version to avoid objformat a.out botch
04 Dec 2006 21:45:23
Original commit files touched by this commit  2.2
thierry search for other commits by this committer
Libtextcat is a library with functions that implement the classification
technique described in Cavnar & Trenkle, "N-Gram-Based Text Categorization".
It was primarily developed for language guessing, a task on which it is known to
perform with near-perfect accuracy.

WWW: http://software.wise-guys.nl/libtextcat/

Number of commits found: 17

Login
User Login
Create account

Servers and bandwidth provided by
New York Internet, SuperNews, and RootBSD

This site
What is FreshPorts?
About the authors
Issues
FAQ
How big is it?
The latest upgrade!
Privacy
Forums
Blog
Contact

Search
Enter Keywords:
 
more...

Latest Vulnerabilities
apache24Dec 06
cryptoppDec 06
mod_http2-develDec 06
py-cryptography*Dec 06
qemu*Dec 06
qemu-devel*Dec 06
qemu-sbruno*Dec 06
chromiumDec 05
ImageMagickDec 04
ImageMagick-nox11Dec 04
ImageMagick7Dec 04
ImageMagick7Dec 04
ImageMagick7-nox11Dec 04
ImageMagick7-nox11Dec 04
py-pillowDec 04

35 vulnerabilities affecting 74 ports have been reported in the past 14 days

* - modified, not new

All vulnerabilities


Ports
Home
Categories
Deleted ports
Sanity Test Failures
Newsfeeds

Statistics
Graphs
NEW Graphs (Javascript)
Traffic

Calculated hourly:
Port count 26960
Broken 313
Deprecated 176
Ignore 577
Forbidden 1
Restricted 214
No CDROM 82
Vulnerable 59
Expired 9
Set to expire 155
Interactive 0
new 24 hours 25
new 48 hours35
new 7 days80
new fortnight129
new month292

Servers and bandwidth provided by
New York Internet, SuperNews, and RootBSD
Valid HTML, CSS, and RSS.
Copyright © 2000-2014 Dan Langille. All rights reserved.