PINGDOM_CHECK

#ExtractSummit2026 The world's largest web scraping conference returns. Austin Oct 7–8 · Dublin Nov 10–11.

Register now
Data Services
Pricing
Login
Try Zyte APIContact Sales
  • Unblocking and Extraction

    Zyte API

    The ultimate API for web scraping. Avoid website bans and access a headless browser or AI Parsing

    Ban Handling

    Headless Browser

    AI Extraction

    SERP

    Enterprise

    DocumentationSupport

    Hosting and Deployment

    Scrapy Cloud

    Run, monitor, and control your Scrapy spiders however you want to.

    Coding Agent Add-Ons

    Agentic Web Data

    Plugins that give coding agents the context to build production Scrapy projects. Starts with Claude Code.

  • Data Services
  • Pricing
  • Browse

    • BlogArticles, podcasts, videos
    • Case studiesCustomer outcomes
    • White papersIn-depth reports
    • EventsConferences, webinars, recordings

    Subscribe

    • NewsletterSwiftly delivered
    • Discord communityExtract Data community
  • Product and E-commerce

    From e-commerce and online marketplaces

    Data for AI

    Collect and structure web data to feed AI

    Job Posting

    From job boards and recruitment websites

    Real Estate

    From Listings portals and specialist websites

    News and Article

    From online publishers and news websites

    Search

    Search engine results page data (SERP)

    Social Media

    From social media platforms online

  • Meet Zyte

    Our story, people and values

    Contact us

    Get in touch

    Support

    Knowledge base and raise support tickets

    Terms and Policies

    Accept our terms and policies

    Open Source

    Our open source projects and contributions

    Web Data Compliance

    Guidelines and resources for compliant web data collection

    Join the team building the future of web data
    We're Hiring
    Trust Center
    Security, compliance & certifications
Login
Try Zyte APIContact Sales

Zyte Developers

Coding tools & hacks straight to your inbox

Become part of the community and receive a bi-weekly dosage of all things code.

Join us
    • Zyte Data
    • News & Articles
    • Search
    • Social Media
    • Product
    • Data for AI
    • Job Posting
    • Real Estate
    • Zyte API - Ban Handling
    • Zyte API - Headless Browser
    • Zyte API - AI Extraction
    • Web Scraping Copilot
    • Zyte API Enterprise
    • Scrapy Cloud
    • Solution Overview
    • Blog
    • Webinars
    • Case Studies
    • White Papers
    • Documentation
    • Web Scraping Maturity Self-Assesment
    • Web Data compliance
    • Meet Zyte
    • Jobs
    • Terms and Policies
    • Trust Center
    • Support
    • Contact us
    • Pricing
    • Do not sell
    • Cookie settings
    • Sign up
    • Talk to us
    • Cost estimator
All articles
AI60, 60 articles
Data quality13, 13 articles
Developer interest57, 57 articles
Integration2, 2 articles
Open-source40, 40 articles
Proxies29, 29 articles
Scraping practice17, 17 articles
Scraping strategy26, 26 articles
Web data60, 60 articles
Web scraping APIs33, 33 articles
Zyte API59, 59 articles
Scrapy48, 48 articles
Scrapy Cloud10, 10 articles
Web Scraping Copilot12, 12 articles
AI & Machine Learning1, 1 articles
Automotive2, 2 articles
E-commerce & retail26, 26 articles
Entertainment & Streaming2, 2 articles
Financial Services8, 8 articles
Government2, 2 articles
Market Research & Intelligence3, 3 articles
Media & publishing8, 8 articles
Real Estate2, 2 articles
Recruitment & HR3, 3 articles
Transportation & Logistics2, 2 articles
Travel & hospitality2, 2 articles
Extract Summit25, 25 articles
PyCon1, 1 articles

Appearance

Discord Community
BlogOpen-sourceThe future of Scrapy: Smarter, faster and ready for AI-powered scraping
ArticleViewpointOpen-sourceTool-assisted coding

The future of Scrapy: Smarter, faster and ready for AI-powered scraping

What does the future hold for the tool some describe as “the gift that revolutionised web scraping”?

Robert Andrews · Senior editor

6 min read · June 23, 2025

The future of Scrapy: Smarter, faster and ready for AI-powered scraping

It’s the little framework that was created for a single task but grew into the web scraping tool of choice for millions of developers.

Ten years after it reached version 1.0, Scrapy has amassed 82 million downloads and almost 11,000 commits, thanks to more than 575 open-source contributors.

But what does the future hold for the tool some describe as “the gift that revolutionised web scraping”?

Recent updates

In the 2020s, Scrapy has already been evolving as a modern framework, with more recent additions including:

  1. Asynchronous powerhouse: Scrapy has been adding support for asyncio, allowing use of myriad asyncio-based libraries.

  2. Headless browsers and beyond: Modern web scraping often involves rendering JavaScript-heavy pages. Scrapy already plays nicely with tools like Playwright, making it competitive with newer, browser-based scraping frameworks. The Scrapy API supports custom downloaders, making it easy to develop support for Playwright and future browser automation tools.

  3. Ecosystem of extensions: Scrapy’s plugin architecture has evolved, with powerful middlewares and add-ons that extend its functionality, from handling CAPTCHAs to integrating with cloud-based storage and data pipelines.

That is thanks partly to Scrapy’s steward, Zyte, but also to its active community of developers building on what was always an inherent extensibility.

Meanwhile, to celebrate the tenth anniversary of Scrapy’s 1.0 version, June 2025 also saw a major facelift for Scrapy’s own website.

Principles for the future

The future will build on Scrapy as a foundational scraping framework, in a largely evolutionary approach.

“The focus ahead is on modernizing Scrapy while staying true to its original principles,” according to Scrapy’s co-originator Shane Evans, CEO, Zyte.

Mikhail Korobov, head of development, Zyte, agrees. “We will keep doing what it’s doing, but better,” he says. That means:

  • Continuing Scrapy’s heritage of extensibility.

  • Out-of-the-box usability - effective default options to speed up time to results.

  • Compatibility with the general Python ecosystem.

  • A simple programmatic API for developers.

“In general, we prefer extensibility over hard-coded, out-of-box features,” Korobov adds.

Future features

The team has some specific developments in mind. As Evans puts it: “That means enhancing support for modern web technologies (especially JavaScript-heavy sites), improving integrations with headless browsers like Playwright, and continuing to streamline the developer experience – particularly around configuration and observability.”

Execution control

“We are planning enhancements to retry logic, rate limiting, delay handling, etc,” Korobov says.

The changes would improve the way Scrapy deals with failure, speed, and politeness so it can scrape the web more smoothly, more reliably, and more respectfully.

Modernised engine

“We’re rewriting Scrapy core from Twisted to asyncio primitives; the plan is to eventually make the Twisted reactor optional,” says Korobov.

Such a move would help make Scrapy simpler, faster, and more compatible with asynchronous Python code that has become the popular modern go-to.

Better organization

Mikhail Korobov reveals: “We’re exploring different ways to organize the web scraping code, such as page objects (web-poet library), spider templates, etc.

“Scrapy spiders are easy to get started, but having all the code in a single spider class also can get in the way.

“These “new” paradigms will allow developers to make web scraping projects more maintainable in the long run. They also turn out to be a better fit for various AI tools.”

AI-ready, closer to the user

Beyond those updates, two key motions are on the horizon, each aimed at making scraping easier.

Artificial intelligence is now a key area of focus. As Shane Evans says: “We’re thinking about how Scrapy can stay relevant in a world increasingly shaped by AI.”

The world is witnessing an explosion of tools like Cursor, Windsurf, OpenAI’s Codex and Claude Code for accelerating and improving code writing.

At Zyte, AI-assisted coding capabilities are already making Zyte’s own developers up to twice as productive.

Now the company wants to bring the same powers to writing scraping code.

So Zyte is working to:

  1. Enhance its AI tools for easy spider code creation, available to both non-technical users and in developer tooling.

  2. Make Scrapy features more immediately accessible in code editors.

“We’d like to help people to generate web scraping code, and Scrapy is a foundation framework for this, alongside libraries like web-poet,” Korobov explains.

Ready for the future

Scrapy, then, is ready for a future in which its historic extensibility and its central utility at the heart of scraping operations get augmented by a new wave of services, making its capabilities more immediate than ever.

“I think it plays really well with a lot of the AI tooling that we're seeing nowadays,” Evans says.

While the future may see scraping developers conjure spider code effortlessly, Evans believes, underneath the hood, it will be Scrapy “at the core of it”.

Try Zyte API

Build your first scraper in minutes

Free trial, no credit card. From a single request to production in an afternoon.

Get started
Open-sourceTool-assisted coding

Robert Andrews

Senior editor

More from this author

In this article

  • Recent updates
  • Principles for the future
  • Future features
  • Execution control
  • Modernised engine
  • Better organization
  • AI-ready, closer to the user
  • Ready for the future

Follow

Get the latest

Zyte and the data web in your inbox — or wherever you already are.

Subscribe

Or follow elsewhere

Continue reading

Scrapy in 2026: New release brings modern async crawling standards
Open Source

Scrapy in 2026: New release brings modern async crawling standards

Scrapy 2.14.0 is released with a major under-the-hood modernization. Say goodbye to Twisted Deferreds.

Robert Andrews·6 min·January 12, 2026
The new economics of web data: Smaller scraping just got cheaper
Open Source

The new economics of web data: Smaller scraping just got cheaper

Smarter tools and AI-driven automation are rewriting the rules of web scraping. As costs fall and setup barriers vanish, smaller teams can now compete at scale, reshaping how the web’s data economy works.

Theresia Tanzil·2 mins·October 6, 2025
A Deep Dive into Zyte's Open-Source Libraries
Open Source

A Deep Dive into Zyte's Open-Source Libraries

Discover how Zyte’s open-source libraries like ClearHTML, Extruct, Chomp.js, and more simplify web data extraction and processing.

Neha Setia Nagpal·1 mins·December 19, 2024

The Community · Newsletter

The best of Zyte and the data web, in your inbox.

One curated edition — new articles, product updates, and the stories shaping the data web. No noise.

G2.com

Capterra.com

Proxyway.com

EWDCI logoMost loved workplace certificateZyte rewardISO 27001 iconG2 rewardG2 rewardG2 reward

© Zyte Group Limited 2026