Subscribe to our Blog
The latest from Theresia Tanzil

Open Source
The new economics of web data: Smaller scraping just got cheaper
Smarter tools and AI-driven automation are rewriting the rules of web scraping. As costs fall and setup barriers vanish, smaller teams can now compete at scale, reshaping how the web’s data economy works.
October 6, 2025

How to Plan Your Web Scraping Project Like a Product Manager
Most web scraping projects that collapse don't fail because of technical incompetence. They fail because teams treat data extraction like a coding sprint rather than a product launch.
September 15, 2025

Leadership
The DQ playbook: How ‘data quality’ fuels business’ pursuit of precision
The practice of data quality (DQ) is emerging as a key discipline businesses can use to understand and improve the provenance of the content they collect.
August 14, 2025

Use case
How price extraction is fuelling insights for modern retailers
Retail pricing has long combined data, experience, and instinct – but today’s market volatility demands a faster, smarter approach.
July 23, 2025

Leadership
Four sweet spots for AI in web scraping
Discover how AI and LLMs are enhancing web scraping with smarter crawling, fuzzy data extraction, automated spider generation, and intelligent QA.
July 14, 2025

Leadership
From script to system: 10 building blocks to scale web scraping
Scaling your business’ web data gathering – acquiring, monitoring and storing a growing amount of data from a growing number of sources over time – requires holistic planning.
June 30, 2025

Open Source
Quality, focus and scale: Three ways data outsourcing benefits businesses
The Strategic Case for Buying Web Data: Quality, Focus, and Scale
June 11, 2025

Leadership
What’s your data type? Solving the procurement problem
Engagements with data suppliers break down when buyers don’t have a clear project concept. Understanding and articulating your needs is paramount. Meet the three types of data buyers. Which one are you?
May 22, 2025

Leadership
The rise of Scrapy: How an open-source scraping framework conquered the web
The story of Scrapy reflects the broader evolution of the web itself and the ongoing quest to harness its ever-expanding ocean of information.
May 14, 2025

How To
Browser bother: Three painkillers for headless scraping headaches
This article shares three strategies to operationalize large-scale browser automation yourself and what alternatives exist.
March 19, 2025

Leadership
Buy or Build? The Four Roads to Acquiring Web Data
Weighing your options from full control to full service
February 21, 2025
.png%3Ffm%3Dwebp&w=1080&q=75)
Use case
Beyond Hello World: The Operational Gaps in LLM-Powered Scraping Tools
The difference between writing a scraper and running a scraping operation
February 7, 2025

Announcement
Handling Bans
Has Your Google Scraper Stopped Working? Here’s What You Need to Know
Discover how to adapt to Google’s January 2025 SERP changes requiring JavaScript rendering.
January 23, 2025

Announcement
Web Data Extract Summit 2024: What Did You Miss?
Business and technical insights from key players and experts in web data extraction.
January 3, 2025