scrapinghub/portia · GitHub
Extracted Page: https://github.com/scrapinghub/portia
Visual scraping for Scrapy.
Portia is a tool for visually scraping web sites without any programming knowledge. Just annotate web pages with a point and click editor to indicate what data you want to extract, and portia will learn how to scrape similar pages from the site.
Portia has a web based UI served by a Twisted server, so you can install it on almost any modern platform.
- Python 2.7
- Works on Linux, Windows, Mac OSX, BSD
- Supported browsers: Latest versions of Chrome (recommended) or Firefox
There are two main components in this repository, slyd and
Additional text has been truncated due to copyright reasons. Things without URLs and private things don't get truncated.