Mc's Gems

icon
#Total RankDaily RankNameSummary
122,33263,432generalscraperScrapes Google
224,15663,432linkedindataScrapes all LinkedIn profiles including terms you specify.
326,02163,432jsontochartTake JSON files and outputs html for various types of charts
428,08863,432entityextractorExtracts entities and terms from any JSON.
528,86963,432linkedincrawlerCrawls public LinkedIn profiles via Google
631,61641,916dircrawlRun block on all files in dir
738,44463,432wordcloudTakes input and outputs the same text with word size changed based on frequency.
839,42163,432uploadconvertConverts documents to the appropriate format for Transparency Toolkit.
941,16563,432linkedinparserParses public LinkedIn profiles
1041,86963,432parsefileOCR file and extract metadata using Apache Tika and Tesseract
1145,81963,432urlarchiverSaves html and pdfs of websites.
1248,44163,432twittercrawlerCrawls Twitter
1350,22863,432extractpatternsExtracts entities and terms from any JSON.
1451,73763,432timelinegenTimelineGen generates JSON files for use as TimelineJS data.
1553,67963,432sunlightcongressAccess to Sunlight Foundation's congress data.
1655,21263,432indeedparserParses Indeed resumes
1758,55463,432jsontonetworkgraphGenerates node and link data from any JSON.
1861,14563,432piplrequestGets data from Pipl
1964,48541,916tsjobcrawlerCrawls job listing websites for jobs requiring security clearance.
2067,73818,158requestmanagerManages proxies, wait intervals, etc
2172,12030,305effscraperScrapes EFF court documents then extracts the plaintext and metadata.
2272,72863,432countryconvertConverts 2-char ISO country codes to 3-char.
2372,95963,432termextractorExtracts entities and terms from any JSON.
2477,17063,432sunlightpartytimeAccess to Sunlight Foundation's Party Time data.
2577,32063,432jsontomapConverts a JSON into a GeoJSON.
2677,51263,432indeedcrawlerCrawls Indeed resumes
2784,64063,432acluscraperScrapes ACLU court documents then extracts the plaintext and metadata.
2887,75863,432jsoncrossreferenceCrossreferences JSONs and returns the matches
2988,39863,432piplcollectorGets data from Pipl for dir of files
3090,30563,432wlsearchscraperGets a list of documents from the WikiLeaks search that match certain terms.
31102,22463,432datacalcSome data calculation/manipulation for Transparency Toolkit.
32107,55463,432jsoncombinerInput multiple JSONs, get back one with all the data
33111,17763,432jsontochoroplethConverts as JSON to a world choropleth map.
34112,13530,305doc_integrity_checkEncrypts, verifies, and checks hashes of files
35113,21063,432ttcalcCalculation functions for Transparency Toolkit.
36113,91463,432sigadparseExtracts SIGADs from documents
37129,07163,432harvesterreporterIncremental result reporting for Transparency Toolkit
38143,29863,432guardianscraperScrapes Guardian articles.
39145,38163,432indeedscraperGet resumes and job listings from indeed based on search terms and locations.
40147,63263,432nametoemailGets a list of possible email addresses.
41168,35730,305docintegritycheckEncrypts, verifies, and checks hashes of files