Easy Web Spidering in Ruby with Anemone

Courtesy Ruby Inside  Thu, 07/02/2009 - 14:40

anemone Anemone is a free, multi-threaded Ruby web spider framework from Chris Kite , which is useful for collecting information about websites.

With Anemone you can write tasks to generate some interesting statistics on a site just by giving it the URL.

Its only dependency is Nokogiri (an HTML and XML parser).

Other than that, you just need to install the gem to get started using Anemone's simple syntax which, among other things, allows you to tell...


 

More related items

PHPMaster.com: Bending XML to Your Will
PHPMaster.com has a new tutorial posted today about "bending XML to your will" - working with XML data using the XML Parser and SimpleXML functionality already included with PHP. If you've...

Christian Schaefer's Blog: Simply iterate over XML...
In a new post to his Test.ical.ly blog Christian Schaefer shows you how to iterate over XML in a more efficient way with the help of the XMLReader and Iterator features that come with PHP. ...

SemanticScuttle performance optimizations - Christian...
This week I worked on lowering the number of queries that SemanticScuttle needs to deliver a bookmark listing page. Just looking at the front page of a SemanticScuttle instance with 10 or...


 

Post new comment

The content of this field is kept private and will not be shown publicly.
computer-internet.marc8.com