Open Source

Blog

More data, more trouble: How a perfect corpus corrupted my AI dream

Neha Setia Nagpal

10 min

March 13, 2026

What a failed experiment taught me about curated data, prompting, and when scraping actually matters.

Blog

Claude skills, MCP or Web Scraping Copilot: Which should you choose?

Neha Setia Nagpal

10 min

March 11, 2026

Three ways to bring Zyte-powered web data into your AI workflow — from production spiders to conversational extraction.

Blog

Supercharging web scraping with Claude skills

John Rooney

10 min

March 11, 2026

Learn how Claude skills can automate HTML fetching, AI parsing, selector generation, and structured data extraction to build faster, smarter web scraping workflows.

Learn

Building a production-style web scraper with Scrapy, Docker, and PostgreSQL

Ayan Pahwa

March 2, 2026

Demo project scrape2postgresql shows how to scrape structured data with Scrapy, store it in PostgreSQL, and run both the spider and database in separate containers using Docker Compose.

Blog

Scrapy in 2026: New release brings modern async crawling standards

Robert Andrews

6 min

January 12, 2026

Scrapy 2.14.0 is released with a major under-the-hood modernization. Say goodbye to Twisted Deferreds.

Blog

The new economics of web data: Smaller scraping just got cheaper

Theresia Tanzil

2 mins

October 6, 2025

Smarter tools and AI-driven automation are rewriting the rules of web scraping. As costs fall and setup barriers vanish, smaller teams can now compete at scale, reshaping how the web’s data economy works.

Blog

The future of Scrapy: Smarter, faster and ready for AI-powered scraping

Robert Andrews

6 min

June 23, 2025

What does the future hold for the tool some describe as “the gift that revolutionised web scraping”?

Blog

Rise of the Data Vendor: How Outsourcing is Transforming Supply and Fuelling Businesses

Robert Andrews

6 min

June 20, 2025

With the emergence of managed data extraction vendors, businesses no longer need to gather web data themselves.

Blog

Quality, focus and scale: Three ways data outsourcing benefits businesses

Theresia Tanzil

8 min

June 11, 2025

The Strategic Case for Buying Web Data: Quality, Focus, and Scale

Blog

What AI Builders Need to Know About the Training Data Copyright Debate

Sanaea Daruwalla

6 min

June 9, 2025

The generative AI gold rush is upon us, with astounding new products and capabilities emerging that are fuelled by web data.

Blog

Ten years since Scrapy 1.0: The stats and stories behind your favorite framework

Cleber Alexandre

5 mins

June 5, 2025

See what 10 years of Scrapy 1.0 has produced — in milestones and metrics - as it became the most-used open source web scraping framework in the world.

Webinars

Sustainability in Open Source | Fireside Chat

Shane Evans

January 5, 2025

Learn how successful open-source projects balance community value with sustainable growth. Industry leaders share insights on monetization, maintenance, and building thriving communities.

Explore resources by topic or category