Categories: None [Edit]

tiny_segmenter

https://rubygems.org/gems/tiny_segmenter
https://github.com/6/tiny_segmenter
Ruby port of TinySegmenter.js for tokenizing Japanese text. Uses a Naive Bayes model that has been trained using the RWCP corpus and optimized using L1-norm regularization. The resultant model is quite compact, yet has a 95% accuracy rate.

Total

Ranking: 7,736 of 187,591
Downloads: 507,790

Daily

Ranking: 5,823 of 187,571
Downloads: 251

Depended by

RankDownloadsName
40,83528,881nhkore
180,3971,174kanji-translator

Depends on

RankDownloadsName
101,165,649,814rake
28925,037,502rspec

Owners

#GravatarHandle
1iconpag