Articles, interviews and analysis on how data is gathered, used and fought over — written by the people closest to it.

New Changes to Our Scrapy Cloud Platform - Stay up-to-date with the latest changes to Scrapy Cloud. Enhance your web scraping workflow with new features.

Introducing ScrapyRT: An API for Scrapy Spiders - Make the most of your Scrapy spiders with ScrapyRT. Explore its functionalities as an API for seamless integration.

Looking Back at 2014 - Take a trip down memory lane and see the milestones and breakthroughs at Zyte in 2014.

XPath is helpful for web scraping, allowing to write specifications more flexibly than CSS selectors. This tutorial is packed with XPath tips and examples.

Introducing Data Reviews - Empower your decision-making with data reviews. Gain insights into data quality and credibility for your web scraping projects.

Web pages are full of data. Microdata markup helps machines understand pages. Schema.org supports a set of schemas for structured data markup on web pages.

As you can see, Portia allows you to visually configure what’s crawled and extracted in a very natural way. It provides immediate feedback, making the process

We use the scikit-learn library for various machine-learning tasks at Zyte. For example, for text classification we'd typically build a statistical

Open Source at Zyte, Now Zyte - Embrace the open-source movement at Zyte. Learn how we contribute to the community and promote transparency.

Looking Back at 2013 - Join us as we reflect on the highlights and achievements of Zyte in 2013. See how far we've come in the web scraping industry.

Marcos Campal is a Zyteber - Get to know one of our talented team members, Marcos Campal. Learn about his contributions to the world of web scraping.

Introducing Dash - Discover Dash, a new tool designed to simplify web scraping. Learn how to leverage its capabilities for better data extraction.
G2.com