Explore resources by topic or category
Browse by Category
Browse by topic
Blog
More data, more trouble: How a perfect corpus corrupted my AI dream
Neha Setia Nagpal
10 min
March 13, 2026
What a failed experiment taught me about curated data, prompting, and when scraping actually matters.
Blog
Claude skills, MCP or Web Scraping Copilot: Which should you choose?
John Rooney
10 min
March 11, 2026
Three ways to bring Zyte-powered web data into your AI workflow — from production spiders to conversational extraction.
Blog
Supercharging web scraping with Claude skills
John Rooney
10 min
March 11, 2026
Learn how Claude skills can automate HTML fetching, AI parsing, selector generation, and structured data extraction to build faster, smarter web scraping workflows.
Learn
Building a production-style web scraper with Scrapy, Docker, and PostgreSQL
Ayan Pahwa
March 2, 2026
Demo project scrape2postgresql shows how to scrape structured data with Scrapy, store it in PostgreSQL, and run both the spider and database in separate containers using Docker Compose.
Blog
Scrapy in 2026: New release brings modern async crawling standards
Robert Andrews
6 min
January 12, 2026
Scrapy 2.14.0 is released with a major under-the-hood modernization. Say goodbye to Twisted Deferreds.
Blog
The new economics of web data: Smaller scraping just got cheaper
Theresia Tanzil
2 mins
October 6, 2025
Smarter tools and AI-driven automation are rewriting the rules of web scraping. As costs fall and setup barriers vanish, smaller teams can now compete at scale, reshaping how the web’s data economy works.
Blog
The future of Scrapy: Smarter, faster and ready for AI-powered scraping
Robert Andrews
6 min
June 23, 2025
What does the future hold for the tool some describe as “the gift that revolutionised web scraping”?
Blog
Rise of the Data Vendor: How Outsourcing is Transforming Supply and Fuelling Businesses
Robert Andrews
6 min
June 20, 2025
With the emergence of managed data extraction vendors, businesses no longer need to gather web data themselves.
Blog
Quality, focus and scale: Three ways data outsourcing benefits businesses
Theresia Tanzil
8 min
June 11, 2025
The Strategic Case for Buying Web Data: Quality, Focus, and Scale
Blog
What AI Builders Need to Know About the Training Data Copyright Debate
Sanaea Daruwalla
6 min
June 9, 2025
The generative AI gold rush is upon us, with astounding new products and capabilities emerging that are fuelled by web data.
Blog
Ten years since Scrapy 1.0: The stats and stories behind your favorite framework
Cleber Alexandre
5 mins
June 5, 2025
See what 10 years of Scrapy 1.0 has produced — in milestones and metrics - as it became the most-used open source web scraping framework in the world.
Blog
A Deep Dive into Zyte's Open-Source Libraries
Neha Setia Nagpal
1 mins
December 19, 2024
Discover how Zyte’s open-source libraries like ClearHTML, Extruct, Chomp.js, and more simplify web data extraction and processing.