Web scraping is a method of extracting and restructuring information from web pages. This workshop will introduce basic techniques for web scraping using popular Python libraries. Participants will practice accessing websites, parsing information, and storing data in a CSV file. This workshop is intended for social scientists who are new to web scraping, but have some familiarity with Python or have attended the Introduction to Python workshop.
The duration of the workshop is 3 hours. Computers with R installed are available on a first-come, first-served basis.
Setup instructions must be completed prior to starting the workshop: http://bit.ly/dss_pythoninstall
Class notes for this workshop are available at: http://bit.ly/dss_pythonwebscrape
PLEASE NOTE: This workshop is being delivered in a FLIPPED CLASSROOM format. This means that participants will be responsible for working through the online materials at their own pace IN ADVANCE of the scheduled meeting time. During the scheduled meeting time, the instructor will demonstrate how to complete the example exercises and will be available to answer questions related to the workshop materials. The instructor WILL NOT walk through all the online materials during the scheduled meeting.