Scrapy Tutorial: How To Scrape Javascript with Python & Splash
Developed by Pablo Hoffman and Shane Evans, Scrapy is an open-source python framework built specifically for web data extraction. With Scrapy spiders, you are able to download HTML, parse and process the data and save it in either CSV, JSON or XML file formats.
This video revolves around web scraping Javascript based websites using Scrapy and Splash.
- How to identify pages based on JavaScript
- How to run Splash
- How to integrate your Scrapy spiders with Splash
If you haven't yet, we recommend you to first watch part I, part II, part III, part IV, part V, part VI, and part VII of our tutorial series.