Categories: None [Edit]

tiny_segmenter

https://rubygems.org/gems/tiny_segmenter
https://github.com/6/tiny_segmenter
Ruby port of TinySegmenter.js for tokenizing Japanese text. Uses a Naive Bayes model that has been trained using the RWCP corpus and optimized using L1-norm regularization. The resultant model is quite compact, yet has a 95% accuracy rate.

Total

Ranking: 7,901 of 193,241
Downloads: 545,775

Daily

Ranking: 14,896 of 193,214
Downloads: 23

Depended by

RankDownloadsName
40,08231,540nhkore
181,5501,633kanji-translator

Depends on

RankDownloadsName
81,330,550,542rake
29983,169,856rspec

Owners

#GravatarHandle
1iconpag