irstlm
- Ebuilds: 2, Testing: 6.00.05-r1 Description:
The IRST Language Modeling Toolkit features algorithms
and data structures suitable to estimate, store, and access very large LMs.
The software has been integrated into a popular open source Statistical Machine
Translation decoder called Moses, and is compatible with language models created
with other tools, such as the SRILM Tooolkit.
Homepage:https://hlt-mt.fbk.eu/technologies/irstlm License: LGPL-3
mitlm
- Ebuilds: 1, Testing: 0.4.2 Description:
The MIT Language Modeling (MITLM) toolkit is a set of tools
designed for the efficient estimation of statistical n-gram
language models involving iterative parameter estimation.
It achieves much of its efficiency through the use of a compact
vector representation of n-grams.
Homepage:https://github.com/mitlm/mitlm License: MIT
openfst
- Ebuilds: 1, Testing: 1.8.2 Description: Finite State Transducer tools by Google et al
Homepage:http://www.openfst.org License: Apache-2.0
pqdump
- Ebuilds: 1, Testing: 0.1.0 Description: simple program to dump Parquet files
Homepage:https://github.com/Berrysoft/pqdump License: BSD Apache-2.0 Apache-2.0-with-LLVM-exceptions BSD CC0-1.0 MIT Unicode-DFS-2016 Unlicense ZLIB
stanford-parser
- Ebuilds: 1, Testing: 4.2.0 Description:
Stanford parser is a natural language parser implemented in Java and using
statistical methods. It includes PCFG and dependency parsers.
Homepage:https://www-nlp.stanford.edu/software/lex-parser.html License: GPL-2
stanford-tagger
- Ebuilds: 1, Testing: 4.2.0 Description:
University of Stanford’s Natural language pos tagger. Uses log linear
pos taggers such as Maximum Entropy model tagging.
Homepage:http://nlp.stanford.edu/software/tagger.shtml License: GPL-2