Categories: None [Edit]

tokenizer

https://rubygems.org/gems/tokenizer
https://github.com/arbox/tokenizer
A simple multilingual tokenizer for NLP tasks. This tool provides a CLI and a library for linguistic tokenization which is an anavoidable step for many HLT (human language technology) tasks in the preprocessing phase for further syntactic, semantic and other higher level processing goals. Use it for tokenization of German, English and French texts.

Total

Ranking: 10,545 of 189,455
Downloads: 270,137

Daily

Ranking: 7,232 of 189,432
Downloads: 230

Depended by

RankDownloadsName
8,248460,789metanorma-iso
69,95014,194social_tokenizer
81,01011,393jekyll-related-posts
88,5939,821smalltext
128,9004,934meiou
138,9674,360shalmaneser-frappe
150,5283,774TacTalk
163,2633,025chunkify
174,2482,312sentimentanalyzer

Depends on

RankDownloadsName

Owners

#GravatarHandle
1iconarbox