Categories: None [Edit]

wp2txt

https://rubygems.org/gems/wp2txt
https://github.com/yohasebe/wp2txt
WP2TXT extracts text and category data from Wikipedia dump files (encoded in XML / compressed with Bzip2), removing MediaWiki markup and other metadata.

Total

Ranking: 22,078 of 193,767
Downloads: 73,946

Daily

Ranking: 50,827 of 193,752
Downloads: 2

Depended by

RankDownloadsName

Depends on

RankDownloadsName
13,495,674,061bundler
81,346,467,157rake
151,212,706,430nokogiri
29989,755,551rspec
40779,738,226parallel
97475,999,505simplecov
118402,874,348webmock
217232,519,870htmlentities
279186,046,291sqlite3
332153,253,893tty-spinner
387130,844,650optimist
464111,109,109pastel
3,3874,149,768tty-progressbar

Owners

#GravatarHandle
1iconyohasebe