PINGDOM_CHECK

#ExtractSummit2026 The world's largest web scraping conference returns. Austin Oct 7–8 · Dublin Nov 10–11.

Register now
Data Services
Pricing
Login
Try Zyte APIContact Sales
  • Unblocking and Extraction

    Zyte API

    The ultimate API for web scraping. Avoid website bans and access a headless browser or AI Parsing

    Ban Handling

    Headless Browser

    AI Extraction

    Enterprise

    DocumentationSupport

    Hosting and Deployment

    Scrapy Cloud

    Run, monitor, and control your Scrapy spiders however you want to.

    Coding Agent Add-Ons

    Agentic Web Data

    Plugins that give coding agents the context to build production Scrapy projects. Starts with Claude Code.

  • Data Services
  • Pricing
  • Blog

    Learn

    Case Studies

    Webinars

    Videos

    White Papers

    Join our Community
    Web scraping APIs vs proxies: A head-to-head comparison
    Blog Post
    The seven habits of highly effective data teams
    Blog Post
  • Product and E-commerce

    From e-commerce and online marketplaces

    Data for AI

    Collect and structure web data to feed AI

    Job Posting

    From job boards and recruitment websites

    Real Estate

    From Listings portals and specialist websites

    News and Article

    From online publishers and news websites

    Search

    Search engine results page data (SERP)

    Social Media

    From social media platforms online

  • Meet Zyte

    Our story, people and values

    Contact us

    Get in touch

    Support

    Knowledge base and raise support tickets

    Terms and Policies

    Accept our terms and policies

    Open Source

    Our open source projects and contributions

    Web Data Compliance

    Guidelines and resources for compliant web data collection

    Join the team building the future of web data
    We're Hiring
    Trust Center
    Security, compliance & certifications
Login
Try Zyte APIContact Sales

Zyte Developers

Coding tools & hacks straight to your inbox

Become part of the community and receive a bi-weekly dosage of all things code.

Join us
    • Zyte Data
    • News & Articles
    • Search
    • Social Media
    • Product
    • Data for AI
    • Job Posting
    • Real Estate
    • Zyte API - Ban Handling
    • Zyte API - Headless Browser
    • Zyte API - AI Extraction
    • Web Scraping Copilot
    • Zyte API Enterprise
    • Scrapy Cloud
    • Solution Overview
    • Blog
    • Webinars
    • Case Studies
    • White Papers
    • Documentation
    • Web Scraping Maturity Self-Assesment
    • Web Data compliance
    • Meet Zyte
    • Jobs
    • Terms and Policies
    • Trust Center
    • Support
    • Contact us
    • Pricing
    • Do not sell
    • Cookie settings
    • Sign up
    • Talk to us
    • Cost estimator
Home
Blog
Data in 2026: Bets and forecasts from web experts
Light
Dark

Data in 2026: Bets and forecasts from web experts

Posted on
December 31, 2025
What will 2026 have in store for the world of web data? We asked eight experts for their predictions.
By
Robert Andrews
Agents, agents everywhereWebsites tighten up and lock downReasoning gets a re-think?Market-defining legal cases
×

Try Zyte API

Zyte proxies and smart browser tech rolled into a single API.
Start FreeFind out more
Subscribe to our Blog
Table of Contents

It was a 2025 in which AI-assisted web scraping truly arrived, quantum computing took a leap forward and large parts of the web routinely went dark as sites’ reliance on a few big infrastructure providers became clear to see.


But what will 2026 have in store for the web we know and love? Arguably, the biggest shake-up in its three-decade history.


I asked a variety of web data thinkers to give me their predictions for the year ahead.

Agents, agents everywhere

The rise of AI-assisted web scraping in 2025 sets the stage for a new era. “Agent” hype may already have reached fever pitch. In 2026, though, experts predict a shift toward truly automated web processes, powered by sophisticated AI agents that handle everything.

Jan Seidler, Chief Technology Officer, Zyte

“I think 2026 is the year web data becomes truly automated. Not just faster scrapers, but AI creating, fixing and scaling them - from a site name to working production code, and then keeping it running as the web changes.


“Agent-based workflows will move from demos to everyday tools, including real browser control, app data access and self-healing maintenance.”


Read more: Why your agent deserves a wallet

Ayan Pahwa, Developer Advocate, Zyte

“2026 will be prominent for agentic AI workflows - armies of AI agents doing the heavy lifting - from research to execution to reporting - talking among each other using MCP, A2A etc. Imagine a business running 12 different agents managing leads, accounting, ad ops etc - the founder wakes up to dashboards instead of to-do lists.


“Maintaining these agents will probably be a full-time job. I’m thinking, reliability engineering for AI agents.”

Daniel Cave, Product Marketing Manager, Zyte

“We may see a shift away from building sites exclusively for human consumption and more adoption of agent-first design practices, such as using OpenAI's new shopping cart protocol. How this plays into scraping will be interesting.”


Read more: Why AI agents struggle with web scraping (and how to help them)

Websites tighten up and lock down

As data-gathering technology evolves, so, too, do the defenses. Experts predict new automated solutions and access policies will fuel a different relationship between site and scraper.

Fabien Vauchelles, web scraping expert, Scrapoxy

"The barrier to entry is getting higher and higher every year. I think we will move toward a closed internet in the coming years.


“The future is pretty clear. We will have major websites which will be accessible for AI agents, but everyone else will be locked out."

Akshay Philar, Head of Engineering, Zyte

"The growing adoption of RASP, polymorphic obfuscation, and WebAssembly will further complicate reverse engineering. 


“As agents become more adept at handling CAPTCHAs, vendors that currently rely on proof-of-work may pivot toward alternative mechanisms that preserve user experience while distinguishing humans from bots.”

Iain Lennon, Chief Product Officer, Zyte

“Anti-bot solutions will continue to increase the rate of changes to their configurations. Software automation to respond and handle these will increasingly become vital.


“Separately, businesses using web data will increasingly expect to integrate web data pipelines into enterprise architectures, and with enterprise levels of engineering control.”


Read more: Beyond the block: The front line of data access

Reasoning gets a re-think?

Large Language Models, including those with “reasoning” capability, are predicated on “token” prediction - but some experts now think that approach may not catapult AI to the much-vaunted echelons of super-intelligence.

Iván Sánchez, Senior Data Scientist, Zyte

“I'm pretty sure we will get some new AI models from China.


“The only question that I have for 2026 is if the current method of AI reasoning via token prediction will be the paradigm that will stay, the new norm, or will we have something new, a new way of thinking, or maybe parallel thinking?”


Read more: AI and the web: What 2025 changed and what comes next

Market-defining legal cases

While 2025 brought some legal clarity to the issue of scraping for AI systems, experts are looking to 2026 to bring further certainty.

Sanaea Daruwalla, Chief Legal & People Officer, Zyte

“We will continue to see a lot of copyright lawsuits as it relates to data to train AI. Web scrapers will need to continue to follow these cases and ensure compliance as new rulings take place. 


“Additionally, we will also see copyright holders try to monetize and license their content, with things like Really Simple Licensing, so we will see more on the legality of this and how it will play out in 2026.”


Read more: Balancing innovation and regulation in data scraping

×

Try Zyte API

Zyte proxies and smart browser tech rolled into a single API.
Start FreeFind out more

Get the latest posts straight to your inbox

No matter what data type you're looking for, we've got you

G2.com

Capterra.com

Proxyway.com

EWDCI logoMost loved workplace certificateZyte rewardISO 27001 iconG2 rewardG2 rewardG2 reward

© Zyte Group Limited 2026