Scrapy Tutorial: Web Scraping Follow Pagination Links

Valdir Stumm Junior

1 min read · January 14, 2021

The Scrapy tutorial part IV: Web Scraping Follow Pagination Links

Developed by Pablo Hoffman and Shane Evans, Scrapy is an open-source python framework built specifically for web data extraction. With Scrapy spiders, you are able to download HTML, parse and process the data and save it in either CSV, JSON or XML file formats.

In our first tutorials, we've focussed on how to extract data from a website. In this video, we'll show you how to crawl a website using Scrapy.

After watching this, you will know:

How to find links in a page
How to create requests to other pages

If you haven't yet, we recommend you to first watch part I, part II and part III of our tutorial series.

Watch the third part of the Scrapy tutorial course.

Keep learning

All learn articles →

Use case

What is a residential proxy?

Learn what residential proxies are, how they compare to datacenter proxies, and why modern web scraping needs more than IP diversity.

10 min read

Zyte Case Studies — every customer story, in one place

Use case

How much do rotating proxies cost?

Learn how much rotating proxies cost, what affects pricing, and why total web scraping costs often go beyond proxy subscriptions.

10 min read

Use case

How do rotating proxies work?

Learn how rotating proxies work, when to use them for web scraping, and why IP rotation alone is not enough for reliable data access.

10 min read

If you like what you saw, we can recommend you to also check out the links below:

Keep learning

What is a residential proxy?

How much do rotating proxies cost?

How do rotating proxies work?