Week 2) Foundations of web scraping

Learning goals

  • Retrieve and store web data in various formats using Python’s requests library and browser inspection tools.
  • Extract and manipulate data from websites and APIs using BeautifulSoup and JSON handling techniques.
  • Apply programming concepts to automate data collection and understand the use of Jupyter Notebooks vs. raw Python files.
  • Learn how to extract data from static and dynamic websites

Preparation before class

  • Ensure you have the necessary tools installed (e.g., BeautifulSoup, Selenium).

Tutorials

  • Web scraping 101
  • Web scraping Advanced

After class

  • Work through the tutorials at your own pace.
  • Register project teams on Canvas.