Mc's Gems

icon
#Total RankDaily RankNameSummary
123,58378,785generalscraperScrapes Google
225,49419,230linkedindataScrapes all LinkedIn profiles including terms you specify.
327,51020,969jsontochartTake JSON files and outputs html for various types of charts
429,51278,785entityextractorExtracts entities and terms from any JSON.
530,28420,465linkedincrawlerCrawls public LinkedIn profiles via Google
632,934126,223dircrawlRun block on all files in dir
739,84760,827wordcloudTakes input and outputs the same text with word size changed based on frequency.
840,77743,469uploadconvertConverts documents to the appropriate format for Transparency Toolkit.
942,11825,546linkedinparserParses public LinkedIn profiles
1043,444126,223parsefileOCR file and extract metadata using Apache Tika and Tesseract
1147,64478,785urlarchiverSaves html and pdfs of websites.
1249,043126,223twittercrawlerCrawls Twitter
1351,560126,223extractpatternsExtracts entities and terms from any JSON.
1453,41478,785timelinegenTimelineGen generates JSON files for use as TimelineJS data.
1555,65078,785sunlightcongressAccess to Sunlight Foundation's congress data.
1656,607126,223indeedparserParses Indeed resumes
1760,28532,889jsontonetworkgraphGenerates node and link data from any JSON.
1862,041126,223piplrequestGets data from Pipl
1964,892126,223tsjobcrawlerCrawls job listing websites for jobs requiring security clearance.
2070,309126,223requestmanagerManages proxies, wait intervals, etc
2174,42278,785effscraperScrapes EFF court documents then extracts the plaintext and metadata.
2274,75678,785countryconvertConverts 2-char ISO country codes to 3-char.
2374,94378,785termextractorExtracts entities and terms from any JSON.
2479,096126,223indeedcrawlerCrawls Indeed resumes
2579,36238,834jsontomapConverts a JSON into a GeoJSON.
2679,89478,785sunlightpartytimeAccess to Sunlight Foundation's Party Time data.
2786,96120,028acluscraperScrapes ACLU court documents then extracts the plaintext and metadata.
2889,875126,223piplcollectorGets data from Pipl for dir of files
2990,65643,469jsoncrossreferenceCrossreferences JSONs and returns the matches
3094,38778,785wlsearchscraperGets a list of documents from the WikiLeaks search that match certain terms.
31105,32478,785datacalcSome data calculation/manipulation for Transparency Toolkit.
32111,01350,436jsoncombinerInput multiple JSONs, get back one with all the data
33112,437126,223doc_integrity_checkEncrypts, verifies, and checks hashes of files
34113,95650,436jsontochoroplethConverts as JSON to a world choropleth map.
35115,57378,785ttcalcCalculation functions for Transparency Toolkit.
36116,24378,785sigadparseExtracts SIGADs from documents
37130,867126,223harvesterreporterIncremental result reporting for Transparency Toolkit
38147,189126,223guardianscraperScrapes Guardian articles.
39148,775126,223indeedscraperGet resumes and job listings from indeed based on search terms and locations.
40148,99178,785nametoemailGets a list of possible email addresses.
41171,320126,223docintegritycheckEncrypts, verifies, and checks hashes of files