Personal tools
You are here: Home Plone XP Open Source Software Review Using Scrapy to Crawl Websites

Using Scrapy to Crawl Websites

Scrapy Homepage: http://scrapy.org/, It has a blog site with the most recent updates: http://blog.scrapy.org/

Scrapy split to 2 modules: w3lib and scrapely.  Both have pypi page.

After these changes, Scrapy codebase has been reduced by 4574 lines, including blank and comments (according to cloc).

Scrapy team also offer a sample project to explain how to use scrapy: https://github.com/scrapy/dirbot

Document Actions