Categories: None [Edit]

wp2txt

https://rubygems.org/gems/wp2txt
https://github.com/yohasebe/wp2txt
WP2TXT extracts text and category data from Wikipedia dump files (encoded in XML / compressed with Bzip2), removing MediaWiki markup and other metadata.

Total

Ranking: 20,443 of 180,702
Downloads: 64,485

Daily

Ranking: 10,925 of 180,689
Downloads: 80

Depended by

RankDownloadsName

Depends on

RankDownloadsName
12,107,069,537bundler
16863,261,344rake
21797,830,115nokogiri
24772,716,171rspec
55457,247,303parallel
61433,399,305ruby-progressbar
205155,345,907htmlentities
36589,222,653optimist
42576,950,791tty-spinner
73152,106,046pastel

Owners

#GravatarHandle
1iconyohasebe