Web Crawling Gems

#Total RankDaily RankNameSummary
11,1301,359mechanizeThe Mechanize library is used for automating interaction with websites. Mechanize autom...
25,4736,853anemoneAnemone web-spider framework
36,2907,244link_thumbnailerRuby gem generating thumbnail images from a given URL.
48,3866,554spidrSpidr is a versatile Ruby web spidering library that can spider a site, multiple domain...
511,2257,647wombatGeneric Web crawler with a DSL that parses structured data from web pages
625,40278,133uptonDon't re-write web scrapers every time. Upton gives you a scraper template that's easy ...
772,30046,260fake_useragentSimple gem for generating valid web user agents.
876,12123,603ronin-web-spiderronin-web-spider is a collection of common web spidering routines using the spidr gem.