Valdir Stumm Junior
2 min read ·
The Scrapy tutorial part III: How To Scrape Data From Multiple Web Pages
Developed by Pablo Hoffman and Shane Evans, Scrapy is an open-source python framework built specifically for web data extraction. With Scrapy spiders, you are able to download HTML, parse and process the data and save it in either CSV, JSON, or XML file formats.
So far you've learned how to build a Scrapy spider to extract an item from a page. Now you will learn how to scrape multiple items from the same page.
After watching this, you will know:
If you haven't yet, we recommend you to first watch part I and part II of our tutorial series.
Scrapy is an open-source Python framework designed for web data extraction.
Scrapy can save data in CSV, JSON, or XML file formats.
Iterating over page elements and extracting data from repeating elements.
Watch parts I and II of the tutorial series for foundational knowledge.
More learn articles
Use caseLearn what residential proxies are, how they compare to datacenter proxies, and why modern web scraping needs more than IP diversity.
10 min read
Use caseLearn how much rotating proxies cost, what affects pricing, and why total web scraping costs often go beyond proxy subscriptions.
10 min read
Use caseLearn how rotating proxies work, when to use them for web scraping, and why IP rotation alone is not enough for reliable data access.
10 min read
G2.com