PYCON UK

Tickets

An Introduction to web scraping using Python

Manoj Pandey | Saturday 16:30 | Ferrier Hall

Web scraping is a technique for gathering data or information on web pages. You could revisit your favorite web site every time it updates for new information. Or you could write a web scraper to have it do it for you!

Want to learn how to scrape the web (and/or organized data sets and APIs) for content? This talk will give you the building blocks (and code) to begin your own scraping adventures. We will review basic data scraping, API usage, form submission as well as how to scrape pesky bits like JavaScript-usage for DOM manipulation.

Besides looking at how websites are put together, we will also discuss the ethics of scraping. What is legal? How can you be a friendly scraper, so that the administrator of the website you are scraping won’t try to shut you down?

Slides for the presentation are already drafted here: https://slides.com/manojp/introws

Link to video