PINGDOM_CHECK
CASE STUDY

Kinzen: Providing structured news data for personalized feeds with AI

Curated personalized news content

In today’s digital world, news sources are abundant throughout the world, available at the fingertips of readers and even readers have become publishers themselves. The way people consume news is changing with users seeking alternatives to noise, privacy issues, and social media. There is a demand for information that fits their interests and for trusted journalism. According to the Digital News Report 2019, 41% of readers have checked accuracy by comparing multiple sources and 24% have said they stopped using news sources with poor reputations altogether.


With more and more people having access to the news through smart devices and hunger for personalized news content, there is a problem in categorizing the colossal amount of news data on a daily basis.

Zyte has provided us with over 10 million articles for our technology to process. The data is there, constant and reliable. Collaboration with Zyte has been easy and customer support was very responsive throughout our journey. Feel like we struck gold with Zyte.

Ru Hickson

Data Engineer at Kinzen

About

Kinzen is a technology company that helps readers engage with the publishers who inform, inspire and empower them. They build tools for individuals and publishers to access and present personalized and trusted news. Providing structured news data for personalized feeds, backed by artificial intelligence and algorithms, that will be based on a user's preferences and what experts are interested in.

Using multiple APIs that focus on user signals, Kinzen powers newsletters on behalf of publishers that provide impressive personalization between publisher and reader. Giving readers the content they want and helping publishers better connect with their subscribers.

In this case study, learn how Kinzen partnered with Zyte to become a crucial part of the building and maintaining of their news data pipeline using Zyte API with Extraction. Zyte's recently launched Automatic Extraction provides customers with AI-enabled, automated web data extraction at scale. Using machine learning, Automatic Extraction can extract millions of news articles at a scale in a fraction of time it would take a developer to do manually.

Challenges

Due to the nature of its business, Kinzen needs to deliver quality sources of information that the reader can trust and help their publisher partners better connect with their readers. This requires gathering a lot of news data from thousands of different sources across the web. It’s the Data Engineers in Kinzen who are responsible for maintaining quality data pipelines for their APIs to run effectively. Kinzen must source millions of news data sources daily, which requires extracting the world’s news data accurately and reliably.


To have their product and APIs to be successful, getting quick access to news data was paramount for Kinzen. Therefore the key challenge facing them was speed and scaling. Also, they had to consider the list of sources in their directory is constantly growing and evolving. News publishers’ websites today are constantly updated in real-time and a wide range of measures are needed to wrangle the data into a consistent uniform format.

Solution

To do this, Kinzen had a choice - Hire a dedicated team of web scraping experts internally, or seek out a third-party provider.

After considering the costs of hiring an internal team, including training, onboarding, and setup, they decided on the latter and began to assess the types of businesses that could provide the data extraction capability they needed. Zyte was the clear choice based on our ability to provide data extraction on a grand scale, accessibility to the data, the quality of the data, and the speed at which the data was received.


Zyte provided Kinzen with an easily manageable solution where they were able to identify sources and maintain a consistently high level of data quality when collecting news data. This also allowed Kinzen to be able to precisely tweak the kind of data they required at scale efficiently and effectively.


With Zyte, Kinzen was immediately able to scale its data extraction efforts to match the quantity of news content produced daily. Providing their data engineers with millions of extracted news articles for their APIs to process.


Offloading the collection and maintenance of news data to Automatic Extraction, it allowed Kinzen’s team to focus on product development and business strategy. The provision of such data has allowed Kinzen to accelerate its product development process without worrying about the maintenance of its data pipeline. This has enabled them to become one of the most innovative technology companies today.


Using Zyte Automatic Extraction news data APIs - Kinzen is able to reduce the refresh time of all their publisher partner's latest content to 1 minute. So any amendments or removal of content on the publisher’s website, Kinzen’s content capture would be accurate within 1 minute of publication. Allowing Kinzen to give their publishing partners more breathing room and give readers the most up-to-date newsletters.

Results - Automatic Data Extraction at Scale

Data extraction at scale10x
cleaner data daily
Quality & Accessibility30+
news articles per day
Speed and Reliability90%
reduction in refresh time

Summary

An external provider with expertise in data extraction—Zyte API. Providing data extraction on a grand scale, accessibility to the data, quality of the data, and speed that Kinzen needed.

Access any website from one place

One powerful web scraping API to access all websites. Per-site pricing that just makes sense.

Trusted by data driven organizations

Why Zyte API Enterprise

Our web scraping software uniquely and directly tackles the most fundamental problems common to all data projects.


  • Break the cycle of build, break, fix, ban, unblock

  • Automate maintenance and unblock scripts

  • Add new data sources in hours not days

Partner with the global pioneers of web scraping. 12+ years of web scraping experience and legal expertise.


  • Engage in developer-to-developer consultancy

  • Receive hands-on training and strategic insight

  • Dedicated support team with premium SLAs