Mc's Gems

icon
#Total RankDaily RankNameSummary
123,38917,977generalscraperScrapes Google
225,27873,912linkedindataScrapes all LinkedIn profiles including terms you specify.
327,32393,934jsontochartTake JSON files and outputs html for various types of charts
429,26693,934entityextractorExtracts entities and terms from any JSON.
530,037125,166linkedincrawlerCrawls public LinkedIn profiles via Google
632,68473,912dircrawlRun block on all files in dir
739,55493,934wordcloudTakes input and outputs the same text with word size changed based on frequency.
840,485125,166uploadconvertConverts documents to the appropriate format for Transparency Toolkit.
941,86093,934linkedinparserParses public LinkedIn profiles
1043,12073,912parsefileOCR file and extract metadata using Apache Tika and Tesseract
1147,320125,166urlarchiverSaves html and pdfs of websites.
1248,84773,912twittercrawlerCrawls Twitter
1351,20593,934extractpatternsExtracts entities and terms from any JSON.
1453,080125,166timelinegenTimelineGen generates JSON files for use as TimelineJS data.
1555,29373,912sunlightcongressAccess to Sunlight Foundation's congress data.
1656,282125,166indeedparserParses Indeed resumes
1759,95593,934jsontonetworkgraphGenerates node and link data from any JSON.
1861,73473,912piplrequestGets data from Pipl
1964,66393,934tsjobcrawlerCrawls job listing websites for jobs requiring security clearance.
2069,946125,166requestmanagerManages proxies, wait intervals, etc
2173,94193,934effscraperScrapes EFF court documents then extracts the plaintext and metadata.
2274,353125,166countryconvertConverts 2-char ISO country codes to 3-char.
2374,535125,166termextractorExtracts entities and terms from any JSON.
2478,792125,166indeedcrawlerCrawls Indeed resumes
2579,006125,166jsontomapConverts a JSON into a GeoJSON.
2679,493125,166sunlightpartytimeAccess to Sunlight Foundation's Party Time data.
2786,636125,166acluscraperScrapes ACLU court documents then extracts the plaintext and metadata.
2889,432125,166piplcollectorGets data from Pipl for dir of files
2990,255125,166jsoncrossreferenceCrossreferences JSONs and returns the matches
3093,757125,166wlsearchscraperGets a list of documents from the WikiLeaks search that match certain terms.
31104,89293,934datacalcSome data calculation/manipulation for Transparency Toolkit.
32110,52293,934jsoncombinerInput multiple JSONs, get back one with all the data
33112,09193,934doc_integrity_checkEncrypts, verifies, and checks hashes of files
34113,394125,166jsontochoroplethConverts as JSON to a world choropleth map.
35115,11193,934ttcalcCalculation functions for Transparency Toolkit.
36115,746125,166sigadparseExtracts SIGADs from documents
37130,522125,166harvesterreporterIncremental result reporting for Transparency Toolkit
38146,537125,166guardianscraperScrapes Guardian articles.
39148,115125,166indeedscraperGet resumes and job listings from indeed based on search terms and locations.
40148,77493,934nametoemailGets a list of possible email addresses.
41170,717125,166docintegritycheckEncrypts, verifies, and checks hashes of files