Explore resources by topic or category
Browse by Category
Browse by topic
Learn
A data scientist's guide to stress-free product scraping
John Rooney
March 4, 2026
As a data scientist, your job is to find patterns, build models, and generate insights. To do that, you first need to reliably acquire web data. Competitor pricing, product specifications, consumer reviews - you name it, data scientists need it.
Learn
Why Python Requests gets "403 Forbidden"
John Rooney
March 2, 2026
If you’ve had your HTTP request blocked regardless of using correct headers, cookies, and good IPs, there’s a chance you are running into one of the simplest forms of blocking, and one of the most confusing for beginners.
Learn
Building a production-style web scraper with Scrapy, Docker, and PostgreSQL
Ayan Pahwa
March 2, 2026
Demo project scrape2postgresql shows how to scrape structured data with Scrapy, store it in PostgreSQL, and run both the spider and database in separate containers using Docker Compose.
Blog
Teaching AI to scrape like a pro: how we measure LLMs’ data quality
Theresia Tanzil
10 min
February 23, 2026
AI-enabled code editors can now conjure scraping code on command. But is it any good? Here’s how Zyte re-engineered LLMs with Web Scraping Copilot to drive best-in-class output.
Learn
Web scraping for pricing intelligence: how to track competitor prices at scale
Mitch Holt
10 mins
February 2, 2026
Compare the best headless browsers for web scraping in 2026. Learn when to use Playwright, Puppeteer, Selenium, or Zyte API’s managed CDP browser for scalable, anti-ban scraping.
Learn
Best headless browsers for web scraping in 2026
10 mins
January 27, 2026
Compare the best headless browsers for web scraping in 2026. Learn when to use Playwright, Puppeteer, Selenium, or Zyte API’s managed CDP browser for scalable, anti-ban scraping.
Learn
Best proxy providers for web scraping in 2026
10 mins
January 16, 2026
Compare the best proxy providers for web scraping in 2026. Learn which residential, ISP, and mobile proxies work best—and when teams move beyond proxies to automation.
Learn
The Modern Scrapy Developer's Guide (Part 3): Auto-Generating Page Objects with the Web Scraping Copilot
John Rooney
December 16, 2025
In this guide, we'll show you how to use Web Scraping Copilot (our VS Code extension) to automatically write 100% of your Items, Page Objects, and even your unit tests.
Learn
The Modern Scrapy Developer's Guide (Part 2): Page Objects with scrapy-poet
John Rooney
December 16, 2025
In this guide, we'll fix this by refactoring our spider to a professional, modern standard using Scrapy Items and Page Objects (via crapy-poet). We will completely separate our crawling logic from our parsing logic.
Learn
The Modern Scrapy Developer's Guide (Part 1): Building Your First Spider
John Rooney
December 16, 2025
In this definitive guide, we will walk you through, step-by-step, how to build a real, multi-page crawling spider. You will go from an empty folder to a clean JSON file of structured data in about 15 minutes
Blog
AI Web Scraping as the Future of Scalable Data Collection
Karlo Jedud
5 mins
September 4, 2025
AI-powered web scraping is transforming data collection by making it faster, smarter, and highly scalable. Learn how it overcomes traditional scraping challenges and unlocks new opportunities for businesses across industries.
Learn
How to Scrape Search Engine Results
Karlo Jedud
5 mins
August 25, 2025
From SEO audits to market intelligence, lead generation, and even brand monitoring, structured SERP data can give you the insights you need to make smarter, faster business decisions. But scraping search engines isn't as simple as sending a GET request and collecting some HTML.