Web-scraping in Python Workshop

This workshop is meant to be interactive and is designed to be completed in Google Colab. We will give you examples of how to do things in Python, and then give you a challenge so you can try writing the code yourself. Go ahead and take a moment to duplicate this document to your own Google Drive – your copy is where you will do your own editing. Once you’re ready, run the code blocks one at a time, from the top down. Future code blocks might depend on old code blocks, so it is important to run every block above your current code block to get the intended results.

Prereqs: Basic Python knowledge is required for this workshop, although you are welcome either way. If you are not familiar with Python, working through our Intro Python Workshop notebook on your own should bring you up to speed. That notebook should take anywhere from 15-60 minutes to complete on your own. If you have questions about it or get stuck, post a message in the Data Science Club Discord and @ one of the admins. (As of Nov. 8, 2020, Tajen and Kori are the best people to ask about the notebooks.)

Note: The notes and activites in this document are at times pretty verbose and meant to be friendly to someone who knows very little about programming. Feel free to follow along with whichever activities you feel would be most helpful for you.

Main Challenge

