Mc's Gems

icon
#Total RankDaily RankNameSummary
123,66955,441generalscraperScrapes Google
225,58767,713linkedindataScrapes all LinkedIn profiles including terms you specify.
327,61226,915jsontochartTake JSON files and outputs html for various types of charts
429,631123,501entityextractorExtracts entities and terms from any JSON.
530,33734,326linkedincrawlerCrawls public LinkedIn profiles via Google
633,043123,501dircrawlRun block on all files in dir
739,96929,855wordcloudTakes input and outputs the same text with word size changed based on frequency.
840,902123,501uploadconvertConverts documents to the appropriate format for Transparency Toolkit.
942,21067,713linkedinparserParses public LinkedIn profiles
1043,51887,645parsefileOCR file and extract metadata using Apache Tika and Tesseract
1147,78267,713urlarchiverSaves html and pdfs of websites.
1249,11934,326twittercrawlerCrawls Twitter
1351,69055,441extractpatternsExtracts entities and terms from any JSON.
1453,601123,501timelinegenTimelineGen generates JSON files for use as TimelineJS data.
1555,641123,501sunlightcongressAccess to Sunlight Foundation's congress data.
1656,738123,501indeedparserParses Indeed resumes
1760,45455,441jsontonetworkgraphGenerates node and link data from any JSON.
1862,093123,501piplrequestGets data from Pipl
1964,95541,725tsjobcrawlerCrawls job listing websites for jobs requiring security clearance.
2070,36734,326requestmanagerManages proxies, wait intervals, etc
2174,63587,645effscraperScrapes EFF court documents then extracts the plaintext and metadata.
2274,936123,501countryconvertConverts 2-char ISO country codes to 3-char.
2375,09687,645termextractorExtracts entities and terms from any JSON.
2479,20655,441indeedcrawlerCrawls Indeed resumes
2579,539123,501jsontomapConverts a JSON into a GeoJSON.
2679,918123,501sunlightpartytimeAccess to Sunlight Foundation's Party Time data.
2787,165123,501acluscraperScrapes ACLU court documents then extracts the plaintext and metadata.
2890,025123,501piplcollectorGets data from Pipl for dir of files
2990,917123,501jsoncrossreferenceCrossreferences JSONs and returns the matches
3094,597123,501wlsearchscraperGets a list of documents from the WikiLeaks search that match certain terms.
31105,54987,645datacalcSome data calculation/manipulation for Transparency Toolkit.
32111,23987,645jsoncombinerInput multiple JSONs, get back one with all the data
33112,557123,501doc_integrity_checkEncrypts, verifies, and checks hashes of files
34114,21287,645jsontochoroplethConverts as JSON to a world choropleth map.
35115,84055,441ttcalcCalculation functions for Transparency Toolkit.
36116,449123,501sigadparseExtracts SIGADs from documents
37130,96087,645harvesterreporterIncremental result reporting for Transparency Toolkit
38147,621123,501guardianscraperScrapes Guardian articles.
39149,096123,501indeedscraperGet resumes and job listings from indeed based on search terms and locations.
40149,368123,501nametoemailGets a list of possible email addresses.
41171,581123,501docintegritycheckEncrypts, verifies, and checks hashes of files