Categories: None [Edit]
tiny_segmenter
Ruby port of TinySegmenter.js for tokenizing Japanese text. Uses a Naive Bayes model that has been trained using the RWCP corpus and optimized using L1-norm regularization. The resultant model is quite compact, yet has a 95% accuracy rate.
Total
Ranking: 7,790 of 188,757
Downloads: 519,772
Daily
Ranking: 7,250 of 188,742
Downloads: 231
Downloads Trends
Ranking Trends
Num of Versions Trends
Popular Versions (Major)
Popular Versions (Major.Minor)
Depended by
| Rank | Downloads | Name |
|---|---|---|
| 40,300 | 29,982 | nhkore |
| 180,598 | 1,373 | kanji-translator |
Owners
| # | Gravatar | Handle |
|---|---|---|
| 1 | pag |