Draft

Week 2) Learning how to scrape

Week 2: Workflow & learning how to scrape

Learning goals

  • Differentiate between retrieving data from websites and APIs.
  • Retrieve and store web data in various formats using Python’s requests library and browser inspection tools.
  • Extract and manipulate data from websites and APIs using BeautifulSoup and JSON handling techniques.
  • Apply programming concepts to automate data collection and understand the use of Jupyter Notebooks vs. raw Python files.

Lecture

Coaching session

After the lecture and coaching session

Note

This paper will provide a guiding framework for the rest of this course, and chance is you’ll have to read it a couple of times (e.g., first to get an overview, and later to appreciate and use the details in your project). The web appendix contains valuable tables, so don’t skip them.