Mc's Gems

icon
#Total RankDaily RankNameSummary
123,74813,648generalscraperScrapes Google
225,66128,114linkedindataScrapes all LinkedIn profiles including terms you specify.
327,69535,197jsontochartTake JSON files and outputs html for various types of charts
429,71552,828entityextractorExtracts entities and terms from any JSON.
530,43375,671linkedincrawlerCrawls public LinkedIn profiles via Google
633,11535,197dircrawlRun block on all files in dir
740,05961,973wordcloudTakes input and outputs the same text with word size changed based on frequency.
840,98846,466uploadconvertConverts documents to the appropriate format for Transparency Toolkit.
942,31052,828linkedinparserParses public LinkedIn profiles
1043,59546,466parsefileOCR file and extract metadata using Apache Tika and Tesseract
1147,88838,050urlarchiverSaves html and pdfs of websites.
1249,20632,895twittercrawlerCrawls Twitter
1351,78346,466extractpatternsExtracts entities and terms from any JSON.
1453,73135,197timelinegenTimelineGen generates JSON files for use as TimelineJS data.
1555,78152,828sunlightcongressAccess to Sunlight Foundation's congress data.
1656,86952,828indeedparserParses Indeed resumes
1760,56530,981jsontonetworkgraphGenerates node and link data from any JSON.
1862,18961,973piplrequestGets data from Pipl
1965,03552,828tsjobcrawlerCrawls job listing websites for jobs requiring security clearance.
2070,47698,294requestmanagerManages proxies, wait intervals, etc
2174,75898,294effscraperScrapes EFF court documents then extracts the plaintext and metadata.
2275,10061,973countryconvertConverts 2-char ISO country codes to 3-char.
2375,23675,671termextractorExtracts entities and terms from any JSON.
2479,36061,973indeedcrawlerCrawls Indeed resumes
2579,67175,671jsontomapConverts a JSON into a GeoJSON.
2680,09361,973sunlightpartytimeAccess to Sunlight Foundation's Party Time data.
2787,24046,466acluscraperScrapes ACLU court documents then extracts the plaintext and metadata.
2890,14341,722piplcollectorGets data from Pipl for dir of files
2991,10298,294jsoncrossreferenceCrossreferences JSONs and returns the matches
3094,82598,294wlsearchscraperGets a list of documents from the WikiLeaks search that match certain terms.
31105,71852,828datacalcSome data calculation/manipulation for Transparency Toolkit.
32111,45675,671jsoncombinerInput multiple JSONs, get back one with all the data
33112,71646,466doc_integrity_checkEncrypts, verifies, and checks hashes of files
34114,52898,294jsontochoroplethConverts as JSON to a world choropleth map.
35116,12675,671ttcalcCalculation functions for Transparency Toolkit.
36116,73898,294sigadparseExtracts SIGADs from documents
37131,24975,671harvesterreporterIncremental result reporting for Transparency Toolkit
38148,00375,671guardianscraperScrapes Guardian articles.
39149,59098,294indeedscraperGet resumes and job listings from indeed based on search terms and locations.
40149,75598,294nametoemailGets a list of possible email addresses.
41171,90198,294docintegritycheckEncrypts, verifies, and checks hashes of files