Mc's Gems

icon
#Total RankDaily RankNameSummary
123,52049,272generalscraperScrapes Google
225,42718,397linkedindataScrapes all LinkedIn profiles including terms you specify.
327,45624,149jsontochartTake JSON files and outputs html for various types of charts
429,43588,565entityextractorExtracts entities and terms from any JSON.
530,19120,888linkedincrawlerCrawls public LinkedIn profiles via Google
632,85873,396dircrawlRun block on all files in dir
739,770112,136wordcloudTakes input and outputs the same text with word size changed based on frequency.
840,718112,136uploadconvertConverts documents to the appropriate format for Transparency Toolkit.
942,03324,984linkedinparserParses public LinkedIn profiles
1043,35031,227parsefileOCR file and extract metadata using Apache Tika and Tesseract
1147,562112,136urlarchiverSaves html and pdfs of websites.
1248,99773,396twittercrawlerCrawls Twitter
1351,460112,136extractpatternsExtracts entities and terms from any JSON.
1453,315112,136timelinegenTimelineGen generates JSON files for use as TimelineJS data.
1555,558149,536sunlightcongressAccess to Sunlight Foundation's congress data.
1656,50588,565indeedparserParses Indeed resumes
1760,20649,272jsontonetworkgraphGenerates node and link data from any JSON.
1861,92633,072piplrequestGets data from Pipl
1964,850112,136tsjobcrawlerCrawls job listing websites for jobs requiring security clearance.
2070,24544,662requestmanagerManages proxies, wait intervals, etc
2174,275112,136effscraperScrapes EFF court documents then extracts the plaintext and metadata.
2274,639112,136countryconvertConverts 2-char ISO country codes to 3-char.
2374,850112,136termextractorExtracts entities and terms from any JSON.
2479,039149,536indeedcrawlerCrawls Indeed resumes
2579,26962,843jsontomapConverts a JSON into a GeoJSON.
2679,80162,843sunlightpartytimeAccess to Sunlight Foundation's Party Time data.
2786,917112,136acluscraperScrapes ACLU court documents then extracts the plaintext and metadata.
2889,71662,843piplcollectorGets data from Pipl for dir of files
2990,57162,843jsoncrossreferenceCrossreferences JSONs and returns the matches
3094,245112,136wlsearchscraperGets a list of documents from the WikiLeaks search that match certain terms.
31105,20588,565datacalcSome data calculation/manipulation for Transparency Toolkit.
32110,85773,396jsoncombinerInput multiple JSONs, get back one with all the data
33112,327112,136doc_integrity_checkEncrypts, verifies, and checks hashes of files
34113,79088,565jsontochoroplethConverts as JSON to a world choropleth map.
35115,453149,536ttcalcCalculation functions for Transparency Toolkit.
36116,14688,565sigadparseExtracts SIGADs from documents
37130,779149,536harvesterreporterIncremental result reporting for Transparency Toolkit
38147,109149,536guardianscraperScrapes Guardian articles.
39148,641149,536indeedscraperGet resumes and job listings from indeed based on search terms and locations.
40148,871112,136nametoemailGets a list of possible email addresses.
41171,151149,536docintegritycheckEncrypts, verifies, and checks hashes of files