Easy Web Spidering in Ruby with Anemone

Courtesy Ruby Inside  Thu, 07/02/2009 - 14:40

anemone Anemone is a free, multi-threaded Ruby web spider framework from Chris Kite , which is useful for collecting information about websites.

With Anemone you can write tasks to generate some interesting statistics on a site just by giving it the URL.

Its only dependency is Nokogiri (an HTML and XML parser).

Other than that, you just need to install the gem to get started using Anemone's simple syntax which, among other things, allows you to tell...


 

More related items

Abhinav Singh's Blog: How to use locks in PHP cron...
In this new post from Abhinav Singh on how to use file locking to keep your cron jobs from trying to use the same resources. Cron jobs are hidden building blocks for most of the websites....

Monitoring CPU Utilization of Virtual Machines in Xen...
Recently we have been experimenting with the Xen hypervisor. In my testing, I have found that Linux performance is better on Xen than VMWare and we are considering it for Linux rollouts....

Community News: Latest Releases from PHPClasses.org
phpMudnames AJAX Paginator PHP Mud Names Ray Feed Reader Currency Converter PHP Simple Large XML Parser QGoogleVisualizationAPI 2009 Send mail from any SMTP server PHP - GD Watermarker Option...

SanDisk SDMSM2-008G-A11M 8GB M2 Memory Stick Micro...
The Memory Stick Micro (M2) is approximately one-third the size of the existing Memory Stick PRO Duo. The Memory Stick M2 continues the Memory Stick PRO Duo legacy by supporting full...

Linksys WRT610N Simultaneous Dual-N Band Wireless...
The sleek Simultaneous Dual-N Band WRT610N Router sets a new standard for design, expanded bandwidth, and robust performance. It's the ideal router for all your current and future digital...


 

Post new comment

The content of this field is kept private and will not be shown publicly.
computer-internet.marc8.com