Categories: None [Edit]

wp2txt

https://rubygems.org/gems/wp2txt
https://github.com/yohasebe/wp2txt
WP2TXT extracts text and category data from Wikipedia dump files (encoded in XML / compressed with Bzip2), removing MediaWiki markup and other metadata.

Total

Ranking: 20,448 of 180,689
Downloads: 64,405

Daily

Ranking: 45,393 of 180,681
Downloads: 7

Depended by

RankDownloadsName

Depends on

RankDownloadsName
12,105,896,669bundler
16862,884,664rake
21797,443,133nokogiri
24772,514,050rspec
55456,993,046parallel
61433,178,407ruby-progressbar
205155,258,905htmlentities
36589,162,091optimist
42476,910,666tty-spinner
73152,038,403pastel

Owners

#GravatarHandle
1iconyohasebe