Open-source

Articles from the Zyte blog about Open-source.

⌕

Treat your AI skills like software, starting with evals

Most AI skills are never tested — and it shows. Here's how Zyte evaluates scraping skills like real software, catching failures demos miss.

Neha Setia Nagpal14 min readJuly 8, 2026

Scraping practice

AI won’t fix your data quality (until you answer these three questions)

In our interview, a QA expert warns - before you delegate web scraping quality assurance to AI, make sure you can describe what ‘good’ looks like for yourself.

Neha Setia Nagpal10 min readMay 13, 2026

Open-source

The future of Scrapy: Smarter, faster and ready for AI-powered scraping

What does the future hold for the tool some describe as “the gift that revolutionised web scraping”?

Robert Andrews6 min readJune 23, 2025

Open-source

Ten years since Scrapy 1.0: The stats and stories behind your favorite framework

See what 10 years of Scrapy 1.0 has built — in milestones and metrics.

Cleber Alexandre5 min readJune 5, 2025

Developer interest

The rise of Scrapy: How an open-source scraping framework conquered the web

The story of Scrapy reflects the broader evolution of the web itself and the ongoing quest to harness its ever-expanding ocean of information.

Theresia Tanzil10 min readMay 14, 2025

Data in 2026: Bets and forecasts from web experts

Open-source

A Deep Dive into Zyte's Open-Source Libraries

Discover how Zyte’s open-source libraries like ClearHTML, Extruct, Chomp.js, and more simplify web data extraction and processing.

Neha Setia Nagpal1 min readDecember 19, 2024

Open-source

4 essential Scrapy plugins for building efficient and effective spiders

Here are four essential Scrapy plugins we use to build efficient web crawlers for our customers.

Neha Setia Nagpal1 min readAugust 15, 2024

Zyte Blog — field notes from the world of data extraction

Open-source

The Scraper’s System Part 2: Explorer’s Compass to analyze websites

Learn about the scrapers system: Explorer’s Compass to analyze websites.

Neha Setia Nagpal8 min readFebruary 16, 2024

4 simple Steps for effective Automated Data QA Process

Open-source

How Scrapy makes web crawling easy and accurate

Get the best value for your web crawling project by using Scrapy. An awesome framework you should learn and incorporate for easy and accurate web crawling.

Attila Toth5 min readJuly 27, 2021

Open-source

Dateparser: A Little But Powerful Date Parsing Library

Meet Dateparser, a potent date parsing library simplifying date extraction from HTML pages. Ideal for various applications like command-line tools, chatbots, and more.

Marc Hernandez Cabot3 min readMay 6, 2021

Open-source

Scrapy Update: Better Broad Crawl Performance

Understand which Scrapy settings help you honor these limits and how to achieve better performance during broad crawls in the presence of these limits.

Nikita Vostretsov3 min readFebruary 18, 2021

Open-source

Building Spiders Made Easy | GUI For Scrapy Shell

We are introducing a new open source project, Scrapy-GUI. It provides a GUI for Scrapy Shell and makes it easier to write spiders.

Roy Healy4 min readMarch 3, 2020