Mc's Gems

#	Total Rank	Daily Rank	Name	Summary
1	24,023	23,772	generalscraper	Scrapes Google
2	25,946	18,921	linkedindata	Scrapes all LinkedIn profiles including terms you specify.
3	28,037	22,059	jsontochart	Take JSON files and outputs html for various types of charts
4	30,032	38,209	entityextractor	Extracts entities and terms from any JSON.
5	30,695	20,339	linkedincrawler	Crawls public LinkedIn profiles via Google
6	33,420	35,174	dircrawl	Run block on all files in dir
7	40,326	50,011	wordcloud	Takes input and outputs the same text with word size changed based on frequency.
8	41,235	46,875	uploadconvert	Converts documents to the appropriate format for Transparency Toolkit.
9	42,599	26,665	linkedinparser	Parses public LinkedIn profiles
10	43,878	42,071	parsefile	OCR file and extract metadata using Apache Tika and Tesseract
11	48,223	53,432	urlarchiver	Saves html and pdfs of websites.
12	49,406	46,875	twittercrawler	Crawls Twitter
13	52,154	53,432	extractpatterns	Extracts entities and terms from any JSON.
14	54,102	62,145	timelinegen	TimelineGen generates JSON files for use as TimelineJS data.
15	56,157	62,145	sunlightcongress	Access to Sunlight Foundation's congress data.
16	57,190	36,670	indeedparser	Parses Indeed resumes
17	60,980	46,875	jsontonetworkgraph	Generates node and link data from any JSON.
18	62,514	62,145	piplrequest	Gets data from Pipl
19	65,230	57,621	tsjobcrawler	Crawls job listing websites for jobs requiring security clearance.
20	70,794	67,903	requestmanager	Manages proxies, wait intervals, etc
21	75,177	93,128	effscraper	Scrapes EFF court documents then extracts the plaintext and metadata.
22	75,542	93,128	countryconvert	Converts 2-char ISO country codes to 3-char.
23	75,571	83,223	termextractor	Extracts entities and terms from any JSON.
24	79,701	53,432	indeedcrawler	Crawls Indeed resumes
25	80,123	62,145	jsontomap	Converts a JSON into a GeoJSON.
26	80,576	93,128	sunlightpartytime	Access to Sunlight Foundation's Party Time data.
27	87,791	125,342	acluscraper	Scrapes ACLU court documents then extracts the plaintext and metadata.
28	90,552	93,128	piplcollector	Gets data from Pipl for dir of files
29	91,625	74,353	jsoncrossreference	Crossreferences JSONs and returns the matches
30	95,353	108,114	wlsearchscraper	Gets a list of documents from the WikiLeaks search that match certain terms.
31	106,372	125,342	datacalc	Some data calculation/manipulation for Transparency Toolkit.
32	112,032	93,128	jsoncombiner	Input multiple JSONs, get back one with all the data
33	112,909	108,114	doc_integrity_check	Encrypts, verifies, and checks hashes of files
34	115,022	93,128	jsontochoropleth	Converts as JSON to a world choropleth map.
35	116,596	125,342	ttcalc	Calculation functions for Transparency Toolkit.
36	117,152	125,342	sigadparse	Extracts SIGADs from documents
37	131,408	125,342	harvesterreporter	Incremental result reporting for Transparency Toolkit
38	148,509	155,721	guardianscraper	Scrapes Guardian articles.
39	150,236	125,342	indeedscraper	Get resumes and job listings from indeed based on search terms and locations.
40	150,283	155,721	nametoemail	Gets a list of possible email addresses.
41	172,576	155,721	docintegritycheck	Encrypts, verifies, and checks hashes of files