Web Crawling Gems

#Total RankDaily RankNameSummary
11,1411,977mechanizeThe Mechanize library is used for automating interaction with websites. Mechanize autom...
25,6125,062anemoneAnemone web-spider framework
36,4469,053link_thumbnailerRuby gem generating thumbnail images from a given URL.
48,4227,023spidrSpidr is a versatile Ruby web spidering library that can spider a site, multiple domain...
511,56816,506wombatGeneric Web crawler with a DSL that parses structured data from web pages
626,00926,833uptonDon't re-write web scrapers every time. Upton gives you a scraper template that's easy ...
770,80748,466ronin-web-spiderronin-web-spider is a collection of common web spidering routines using the spidr gem.
872,98665,180fake_useragentSimple gem for generating valid web user agents.