Mc's Gems

icon
#Total RankDaily RankNameSummary
124,02323,772generalscraperScrapes Google
225,94618,921linkedindataScrapes all LinkedIn profiles including terms you specify.
328,03722,059jsontochartTake JSON files and outputs html for various types of charts
430,03238,209entityextractorExtracts entities and terms from any JSON.
530,69520,339linkedincrawlerCrawls public LinkedIn profiles via Google
633,42035,174dircrawlRun block on all files in dir
740,32650,011wordcloudTakes input and outputs the same text with word size changed based on frequency.
841,23546,875uploadconvertConverts documents to the appropriate format for Transparency Toolkit.
942,59926,665linkedinparserParses public LinkedIn profiles
1043,87842,071parsefileOCR file and extract metadata using Apache Tika and Tesseract
1148,22353,432urlarchiverSaves html and pdfs of websites.
1249,40646,875twittercrawlerCrawls Twitter
1352,15453,432extractpatternsExtracts entities and terms from any JSON.
1454,10262,145timelinegenTimelineGen generates JSON files for use as TimelineJS data.
1556,15762,145sunlightcongressAccess to Sunlight Foundation's congress data.
1657,19036,670indeedparserParses Indeed resumes
1760,98046,875jsontonetworkgraphGenerates node and link data from any JSON.
1862,51462,145piplrequestGets data from Pipl
1965,23057,621tsjobcrawlerCrawls job listing websites for jobs requiring security clearance.
2070,79467,903requestmanagerManages proxies, wait intervals, etc
2175,17793,128effscraperScrapes EFF court documents then extracts the plaintext and metadata.
2275,54293,128countryconvertConverts 2-char ISO country codes to 3-char.
2375,57183,223termextractorExtracts entities and terms from any JSON.
2479,70153,432indeedcrawlerCrawls Indeed resumes
2580,12362,145jsontomapConverts a JSON into a GeoJSON.
2680,57693,128sunlightpartytimeAccess to Sunlight Foundation's Party Time data.
2787,791125,342acluscraperScrapes ACLU court documents then extracts the plaintext and metadata.
2890,55293,128piplcollectorGets data from Pipl for dir of files
2991,62574,353jsoncrossreferenceCrossreferences JSONs and returns the matches
3095,353108,114wlsearchscraperGets a list of documents from the WikiLeaks search that match certain terms.
31106,372125,342datacalcSome data calculation/manipulation for Transparency Toolkit.
32112,03293,128jsoncombinerInput multiple JSONs, get back one with all the data
33112,909108,114doc_integrity_checkEncrypts, verifies, and checks hashes of files
34115,02293,128jsontochoroplethConverts as JSON to a world choropleth map.
35116,596125,342ttcalcCalculation functions for Transparency Toolkit.
36117,152125,342sigadparseExtracts SIGADs from documents
37131,408125,342harvesterreporterIncremental result reporting for Transparency Toolkit
38148,509155,721guardianscraperScrapes Guardian articles.
39150,236125,342indeedscraperGet resumes and job listings from indeed based on search terms and locations.
40150,283155,721nametoemailGets a list of possible email addresses.
41172,576155,721docintegritycheckEncrypts, verifies, and checks hashes of files