Mc's Gems

icon
#Total RankDaily RankNameSummary
123,93959,381generalscraperScrapes Google
225,866140,767linkedindataScrapes all LinkedIn profiles including terms you specify.
327,93583,579jsontochartTake JSON files and outputs html for various types of charts
429,958107,846entityextractorExtracts entities and terms from any JSON.
530,61869,558linkedincrawlerCrawls public LinkedIn profiles via Google
633,35183,579dircrawlRun block on all files in dir
740,21741,232wordcloudTakes input and outputs the same text with word size changed based on frequency.
841,131140,767uploadconvertConverts documents to the appropriate format for Transparency Toolkit.
942,49983,579linkedinparserParses public LinkedIn profiles
1043,792140,767parsefileOCR file and extract metadata using Apache Tika and Tesseract
1148,08183,579urlarchiverSaves html and pdfs of websites.
1249,311140,767twittercrawlerCrawls Twitter
1352,048107,846extractpatternsExtracts entities and terms from any JSON.
1453,962107,846timelinegenTimelineGen generates JSON files for use as TimelineJS data.
1556,02269,558sunlightcongressAccess to Sunlight Foundation's congress data.
1657,098107,846indeedparserParses Indeed resumes
1760,841140,767jsontonetworkgraphGenerates node and link data from any JSON.
1862,393140,767piplrequestGets data from Pipl
1965,182107,846tsjobcrawlerCrawls job listing websites for jobs requiring security clearance.
2070,66628,161requestmanagerManages proxies, wait intervals, etc
2175,028140,767effscraperScrapes EFF court documents then extracts the plaintext and metadata.
2275,384107,846countryconvertConverts 2-char ISO country codes to 3-char.
2375,490140,767termextractorExtracts entities and terms from any JSON.
2479,566140,767indeedcrawlerCrawls Indeed resumes
2579,969140,767jsontomapConverts a JSON into a GeoJSON.
2680,408107,846sunlightpartytimeAccess to Sunlight Foundation's Party Time data.
2787,595107,846acluscraperScrapes ACLU court documents then extracts the plaintext and metadata.
2890,371140,767piplcollectorGets data from Pipl for dir of files
2991,416140,767jsoncrossreferenceCrossreferences JSONs and returns the matches
3095,110140,767wlsearchscraperGets a list of documents from the WikiLeaks search that match certain terms.
31106,114140,767datacalcSome data calculation/manipulation for Transparency Toolkit.
32111,797140,767jsoncombinerInput multiple JSONs, get back one with all the data
33112,820107,846doc_integrity_checkEncrypts, verifies, and checks hashes of files
34114,754107,846jsontochoroplethConverts as JSON to a world choropleth map.
35116,330140,767ttcalcCalculation functions for Transparency Toolkit.
36116,91159,381sigadparseExtracts SIGADs from documents
37131,303140,767harvesterreporterIncremental result reporting for Transparency Toolkit
38148,228107,846guardianscraperScrapes Guardian articles.
39149,836140,767indeedscraperGet resumes and job listings from indeed based on search terms and locations.
40150,008140,767nametoemailGets a list of possible email addresses.
41172,25483,579docintegritycheckEncrypts, verifies, and checks hashes of files