Shidash's Gems

icon
#Total RankDaily RankNameSummary
121,96164,836generalscraperScrapes Google
223,735143,384linkedindataScrapes all LinkedIn profiles including terms you specify.
325,51449,088jsontochartTake JSON files and outputs html for various types of charts
427,58164,836entityextractorExtracts entities and terms from any JSON.
528,501143,384linkedincrawlerCrawls public LinkedIn profiles via Google
631,16564,836dircrawlRun block on all files in dir
737,910143,384wordcloudTakes input and outputs the same text with word size changed based on frequency.
838,650143,384uploadconvertConverts documents to the appropriate format for Transparency Toolkit.
940,639143,384linkedinparserParses public LinkedIn profiles
1041,40420,865parsefileOCR file and extract metadata using Apache Tika and Tesseract
1145,10264,836urlarchiverSaves html and pdfs of websites.
1248,216143,384twittercrawlerCrawls Twitter
1349,85864,836extractpatternsExtracts entities and terms from any JSON.
1450,850143,384timelinegenTimelineGen generates JSON files for use as TimelineJS data.
1552,89027,459sunlightcongressAccess to Sunlight Foundation's congress data.
1654,94464,836indeedparserParses Indeed resumes
1757,83349,088jsontonetworkgraphGenerates node and link data from any JSON.
1860,59427,459piplrequestGets data from Pipl
1964,243143,384tsjobcrawlerCrawls job listing websites for jobs requiring security clearance.
2067,18929,669requestmanagerManages proxies, wait intervals, etc
2171,17064,836effscraperScrapes EFF court documents then extracts the plaintext and metadata.
2271,86164,836countryconvertConverts 2-char ISO country codes to 3-char.
2372,176143,384termextractorExtracts entities and terms from any JSON.
2476,12736,098sunlightpartytimeAccess to Sunlight Foundation's Party Time data.
2576,54549,088jsontomapConverts a JSON into a GeoJSON.
2677,31764,836indeedcrawlerCrawls Indeed resumes
2783,90264,836acluscraperScrapes ACLU court documents then extracts the plaintext and metadata.
2886,73664,836jsoncrossreferenceCrossreferences JSONs and returns the matches
2987,80036,098piplcollectorGets data from Pipl for dir of files
3089,199143,384wlsearchscraperGets a list of documents from the WikiLeaks search that match certain terms.
31101,08664,836datacalcSome data calculation/manipulation for Transparency Toolkit.
32106,39649,088jsoncombinerInput multiple JSONs, get back one with all the data
33110,04449,088jsontochoroplethConverts as JSON to a world choropleth map.
34111,840143,384ttcalcCalculation functions for Transparency Toolkit.
35112,11349,088doc_integrity_checkEncrypts, verifies, and checks hashes of files
36112,56949,088sigadparseExtracts SIGADs from documents
37128,98064,836harvesterreporterIncremental result reporting for Transparency Toolkit
38141,70764,836guardianscraperScrapes Guardian articles.
39143,94064,836indeedscraperGet resumes and job listings from indeed based on search terms and locations.
40146,147143,384nametoemailGets a list of possible email addresses.
41167,04264,836docintegritycheckEncrypts, verifies, and checks hashes of files