Week 3: Further building your scraping skills
Learning goals
- Identifying a strategy to generating seeds (“sampling”)
- Navigating on a website programmatically
- Improving extraction design
- Scraping more advanced, dynamic websites
Lecture
Laptop required!
- In-clss tutorial ( slides, download tutorial, view tutorial in Google Colab)
Would you like to advance your scraping skills for dynamic websites such as Instagram that involves a lot of user interaction like scrolling and clicking? Then also follow the “Web Scraping Advanced” tutorial.
Coaching session
- Please check out the project page.
After the lecture and coaching session
- Complete the exercises in the tutorial of this week
- Please work through challenges #2.1-#2.4 in “Fields of Gold”.