Mc's Gems

icon
#Total RankDaily RankNameSummary
123,46781,713generalscraperScrapes Google
225,36881,713linkedindataScrapes all LinkedIn profiles including terms you specify.
327,40881,713jsontochartTake JSON files and outputs html for various types of charts
429,35981,713entityextractorExtracts entities and terms from any JSON.
530,16181,713linkedincrawlerCrawls public LinkedIn profiles via Google
632,78381,713dircrawlRun block on all files in dir
739,67350,280wordcloudTakes input and outputs the same text with word size changed based on frequency.
840,60481,713uploadconvertConverts documents to the appropriate format for Transparency Toolkit.
941,96181,713linkedinparserParses public LinkedIn profiles
1043,27481,713parsefileOCR file and extract metadata using Apache Tika and Tesseract
1147,47481,713urlarchiverSaves html and pdfs of websites.
1248,92981,713twittercrawlerCrawls Twitter
1351,37381,713extractpatternsExtracts entities and terms from any JSON.
1453,21281,713timelinegenTimelineGen generates JSON files for use as TimelineJS data.
1555,45181,713sunlightcongressAccess to Sunlight Foundation's congress data.
1656,40381,713indeedparserParses Indeed resumes
1760,10081,713jsontonetworkgraphGenerates node and link data from any JSON.
1861,86481,713piplrequestGets data from Pipl
1964,74081,713tsjobcrawlerCrawls job listing websites for jobs requiring security clearance.
2070,17432,664requestmanagerManages proxies, wait intervals, etc
2174,13732,664effscraperScrapes EFF court documents then extracts the plaintext and metadata.
2274,51981,713countryconvertConverts 2-char ISO country codes to 3-char.
2374,71381,713termextractorExtracts entities and terms from any JSON.
2478,95781,713indeedcrawlerCrawls Indeed resumes
2579,17181,713jsontomapConverts a JSON into a GeoJSON.
2679,73581,713sunlightpartytimeAccess to Sunlight Foundation's Party Time data.
2786,79481,713acluscraperScrapes ACLU court documents then extracts the plaintext and metadata.
2889,62981,713piplcollectorGets data from Pipl for dir of files
2990,49281,713jsoncrossreferenceCrossreferences JSONs and returns the matches
3094,05981,713wlsearchscraperGets a list of documents from the WikiLeaks search that match certain terms.
31105,11481,713datacalcSome data calculation/manipulation for Transparency Toolkit.
32110,82881,713jsoncombinerInput multiple JSONs, get back one with all the data
33112,24181,713doc_integrity_checkEncrypts, verifies, and checks hashes of files
34113,64981,713jsontochoroplethConverts as JSON to a world choropleth map.
35115,31281,713ttcalcCalculation functions for Transparency Toolkit.
36115,95481,713sigadparseExtracts SIGADs from documents
37130,73481,713harvesterreporterIncremental result reporting for Transparency Toolkit
38146,89481,713guardianscraperScrapes Guardian articles.
39148,43081,713indeedscraperGet resumes and job listings from indeed based on search terms and locations.
40148,70681,713nametoemailGets a list of possible email addresses.
41171,01481,713docintegritycheckEncrypts, verifies, and checks hashes of files