Categories: None [Edit]

maxixe

https://rubygems.org/gems/maxixe
https://github.com/rogerbraun/Maxixe
Maxixe is an implementation of the Tango algorithm describe in the paper "Mostly-unsupervised statistical segmentation of Japanese kanji sequences" by Ando and Lee. While the paper deals with Japanese characters, it should work on any unsegmented text given enough corpus data and a tuning of the algorithm parameters.

Total

Ranking: 94,709 of 192,861
Downloads: 9,075

Daily

Ranking: 43,140 of 192,676
Downloads: 3

Depended by

RankDownloadsName

Depends on

RankDownloadsName
29976,888,025rspec
83764,094,507text

Owners

#GravatarHandle
1iconrogerbraun