Mc's Gems

icon
#Total RankDaily RankNameSummary
123,302143,427generalscraperScrapes Google
225,16477,741linkedindataScrapes all LinkedIn profiles including terms you specify.
327,204143,427jsontochartTake JSON files and outputs html for various types of charts
429,16692,912entityextractorExtracts entities and terms from any JSON.
529,908143,427linkedincrawlerCrawls public LinkedIn profiles via Google
632,590111,035dircrawlRun block on all files in dir
739,40330,171wordcloudTakes input and outputs the same text with word size changed based on frequency.
840,32927,645uploadconvertConverts documents to the appropriate format for Transparency Toolkit.
941,71577,741linkedinparserParses public LinkedIn profiles
1042,974111,035parsefileOCR file and extract metadata using Apache Tika and Tesseract
1147,11735,503urlarchiverSaves html and pdfs of websites.
1248,70133,465twittercrawlerCrawls Twitter
1351,03277,741extractpatternsExtracts entities and terms from any JSON.
1452,85592,912timelinegenTimelineGen generates JSON files for use as TimelineJS data.
1555,08767,069sunlightcongressAccess to Sunlight Foundation's congress data.
1656,112111,035indeedparserParses Indeed resumes
1759,735111,035jsontonetworkgraphGenerates node and link data from any JSON.
1861,53392,912piplrequestGets data from Pipl
1964,51040,670tsjobcrawlerCrawls job listing websites for jobs requiring security clearance.
2069,733143,427requestmanagerManages proxies, wait intervals, etc
2173,734111,035effscraperScrapes EFF court documents then extracts the plaintext and metadata.
2274,158143,427countryconvertConverts 2-char ISO country codes to 3-char.
2374,31377,741termextractorExtracts entities and terms from any JSON.
2478,551111,035indeedcrawlerCrawls Indeed resumes
2578,764143,427jsontomapConverts a JSON into a GeoJSON.
2679,16992,912sunlightpartytimeAccess to Sunlight Foundation's Party Time data.
2786,42667,069acluscraperScrapes ACLU court documents then extracts the plaintext and metadata.
2889,195143,427piplcollectorGets data from Pipl for dir of files
2989,998143,427jsoncrossreferenceCrossreferences JSONs and returns the matches
3093,39359,042wlsearchscraperGets a list of documents from the WikiLeaks search that match certain terms.
31104,670143,427datacalcSome data calculation/manipulation for Transparency Toolkit.
32110,211143,427jsoncombinerInput multiple JSONs, get back one with all the data
33111,882111,035doc_integrity_checkEncrypts, verifies, and checks hashes of files
34113,120111,035jsontochoroplethConverts as JSON to a world choropleth map.
35114,81292,912ttcalcCalculation functions for Transparency Toolkit.
36115,462111,035sigadparseExtracts SIGADs from documents
37130,148143,427harvesterreporterIncremental result reporting for Transparency Toolkit
38146,191143,427guardianscraperScrapes Guardian articles.
39147,727111,035indeedscraperGet resumes and job listings from indeed based on search terms and locations.
40148,691143,427nametoemailGets a list of possible email addresses.
41170,43692,912docintegritycheckEncrypts, verifies, and checks hashes of files