Pablo Hoffman

Pablo Hoffman6 min readFebruary 7, 2019

The Zyte Smart Proxy Manager Story: Enhancing Scraping Efficiency

Discover Smart Proxy Manager (Crawlera), the world's smartest proxy network tailored for web scraping, eliminating proxy management hassles.

Pablo Hoffman4 min readJuly 13, 2016

Improving Access to Peruvian Congress Bills with Scrapy

Improving Access to Peruvian Congress Bills with Scrapy - Learn how Scrapy is improving access to Peruvian Congress bills for greater transparency.

Pablo Hoffman4 min readAugust 3, 2015

The Road to Loading JavaScript in Portia: A Technical Journey

The Road to Loading JavaScript in Portia - Learn about the journey of adding JavaScript support to Portia. Extract data from dynamic websites more efficiently.

Pablo Hoffman3 min readJuly 21, 2015

EuroPython 2015: Uniting Pythonistas in Europe

Europython 2015 - Calling all Python enthusiasts! Join us at Europython 2015 for insightful talks and networking opportunities.

Scraping strategy

StartupChats: Embracing Remote Working for Success

StartupChats: Remote Working - Tune in to StartupChats as they discuss the advantages and challenges of remote working.

Pablo Hoffman1 min readJuly 17, 2015

A Practical Guide To Web Data QA Part IV

Pablo Hoffman3 min readJuly 15, 2015

PyCon Philippines 2015: Celebrating Python and Community

PyCon Philippines 2015 - Join us at PyCon Philippines 2015. Discover the latest trends and innovations in the Python community.

Pablo Hoffman1 min readJune 25, 2015

Google Summer of Code 2015: Empowering Open Source Projects

Google Summer of Code 2015 - Get involved in the Google Summer of Code with Zyte. Explore exciting projects and opportunities for students.

Embracing The Future Of Work: How To Communicate Remotely

Pablo Hoffman3 min readJune 8, 2015

Github repository: Manage Vacations Distributed Team

Leverage a github repository to help manage employee personal vacations, filter each country and their public holidays. Efficiently manage vacations for distributed teams.

Pablo Hoffman2 min readMay 27, 2015

Gender Inequality Across Programming Languages

Gender inequality across programming languages is a hot topic. The study is based on UK profiles to determine the gender of a profile covering 80% of the users.

Want To Predict Fitbit’s Quarterly Revenue? Eagle Alpha Did It Using Web Scraped Product Data

Pablo Hoffman5 min readApril 22, 2015

Frontera: The Brain Behind The Crawls

Frontera, formerly Crawl Frontier, is an open-source framework to manage our crawling logic and sharing it between spiders in our Scrapy projects.

A Practical Guide to Web Data QA (Part V): Navigating Broad Crawls

Pablo Hoffman4 min readApril 7, 2015

Scrape Data Visually With Portia And Scrapy Cloud

Note: Portia is no longer available for new users. It has been disabled for all the new organizations from August 20, 2018, onward.

Chats with Rinar Solutions: Insights into Remote Working

Pablo Hoffman3 min readMarch 16, 2015

Why We Moved To Slack

We are veterans in the chat group arena. We have been using one form of another since we started Zyte in 2010 and I've been personally using corporate

Pablo Hoffman2 min readMarch 16, 2015

History of Zyte : A Journey of Innovation

History of Zyte - Learn about the journey of Zyte and how we evolved from Zyte to a leading web scraping and data extraction platform.

Pablo Hoffman5 min readMarch 2, 2015

Handling JavaScript In Scrapy With Splash

Handling modern websites that entirely run on Javascript? In this article, learn how to use Splash to render JavaScript-based pages in your Scrapy spiders.

Pablo Hoffman3 min readJanuary 23, 2015

New Changes to Our Scrapy Cloud Platform: Enhanced Performance and Features

New Changes to Our Scrapy Cloud Platform - Stay up-to-date with the latest changes to Scrapy Cloud. Enhance your web scraping workflow with new features.

Pablo Hoffman1 min readJanuary 22, 2015

Introducing ScrapyRT: An API for Scrapy Spiders

Introducing ScrapyRT: An API for Scrapy Spiders - Make the most of your Scrapy spiders with ScrapyRT. Explore its functionalities as an API for seamless integration.

Pablo Hoffman3 min readDecember 31, 2014

Looking Back at 2014: Highlights and Milestones

Looking Back at 2014 - Take a trip down memory lane and see the milestones and breakthroughs at Zyte in 2014.

Pablo Hoffman2 min readJanuary 18, 2014

Open source at Zyte

Open Source at Zyte, Now Zyte - Embrace the open-source movement at Zyte. Learn how we contribute to the community and promote transparency.

Pablo Hoffman1 min readOctober 1, 2013

Marcos Campal Is A ScrapingHubber!

Marcos Campal is a Zyteber - Get to know one of our talented team members, Marcos Campal. Learn about his contributions to the world of web scraping.

Proxy management: In-house or off-the-shelf proxy solutions?

Pablo Hoffman1 min readMay 11, 2013

Introducing Smart Proxy Manager

Introducing Zyte Smart Proxy Manager - Enhance your web scraping with Smart Proxy Manager. Explore its powerful features and benefits as a smart proxy manager.

4 simple Steps for effective Automated Data QA Process

How To

Git Workflow For Scrapy Projects

Git Workflow for Scrapy Projects - Streamline your Scrapy projects with an efficient Git workflow. Improve collaboration and project management.

Pablo Hoffman2 min readMarch 6, 2013

Zyte Blog — field notes from the world of data extraction

Pablo Hoffman3 min readOctober 26, 2012

How To Fill Login Forms Automatically

We often have to write spiders that need to fill login forms to sites. Our customers provide us with the site, username and password, and we do the rest.

Pablo Hoffman2 min readAugust 25, 2012

Spiders Activity Graphs

Spiders Activity Graphs - Visualize your spiders' performance with activity graphs. Optimize your web scraping process with actionable insights.