Introduction to Web Scraping with Python

Web scraping is a method of extracting and restructuring information from web pages. This workshop will introduce basic techniques for web scraping using the popular Python libraries. Participants will practice accessing websites, parsing information, and storing data in a CSV file. This workshop is intended for social scientists who are new to web scraping, but have some familiarity with Python or have attended the Introduction to Python workshop.

Workshop Preparation

The duration of the workshop is 3 hours. Computers with Python pre-loaded are available on a first-come, first-served basis. If you with to use your own laptop, please install the Anaconda distribution of Python 3.6 from https://www.continuum.io/downloads. Notes and materials for this workshop are available at https://dss.iq.harvard.edu/workshop-materials#widget-0.

This workshop is being conducted in partnership with the Data Science Services group at IQSS. For additional information, please contact us at research@hbs.edu.