Scrapy is a fast and open source high-level screen scraping and web crawling framework, used to crawl websites and extract structured data from their pages.
Scrapy can be used for a wide range of purposes, from data mining to monitoring and automated testing.
Here are some key features of "Scrapy":
Simple:
· Scrapy was designed with simplicity in mind, by providing the features you need without getting in your way.
Productive:
· Just write the rules to extract the data from web pages and let Scrapy crawl the entire web site for you.
Fast:
· Scrapy is used in production crawlers to completely scrape more than 500 retailer sites daily, all in one server.
Extensible:
· Scrapy was designed with extensibility in mind and so it provides several mechanisms to plug new code without having to touch the framework core.
Portable:
· Scrapy runs on Mac, Linux, Windows, and BSD.
Open Source and 100% Python:
· Scrapy is completely written in Python, which makes it very easy to hack.
Well-tested:
· Scrapy has an extensive test suite with very good code coverage.
Requirements:
What's New in This Release: [ read full changelog ]
· forgot to include pydispatch license. #118 (commit fd85f9c)
· include egg files used by testsuite in source distribution. #118 (commit c897793)
· update docstring in project template to avoid confusion with genspider command, which may be considered as an advanced feature. refs #107 (commit 2548dcc)
· added note to docs/topics/firebug.rst about google directory being shut down (commit 668e352)
· Merge branch ‘0.14’ of github.com:scrapy/scrapy into 0.14 (commit 835d082)
· dont discard slot when empty, just save in another dict in order to recycle if needed again. (commit 8e9f607)
· do not fail handling unicode xpaths in libxml2 backed selectors (commit b830e95)
· fixed minor mistake in Request objects documentation (commit bf3c9ee)
· fixed minor defect in link extractors documentation (commit ba14f38)
· removed some obsolete remaining code related to sqlite support in scrapy (commit 0665175)
Free Download :http://mac.softpedia.com/get/Development/HTML/Scrapy.shtml
No comments:
Post a Comment