Online Data Collection and Management (oDCM)
Instructor: dr. Hannes Datta
Course codes: 328061-M3 (fall, block 3) and 328060-M3 (spring, block 1)
This edition: August - October 2021 | Next edition: February - April 2022
Learn how to mine the web
Welcome to the course website of oDCM.
This course teaches you the nuts and bolts about collecting web data using web scraping and APIs. Unlike most other courses on this topic, this course not only teaches you the technicalities of using web scraping and Application Protocol Interfaces (APIs), but also introduces a comprehensive framework that helps you to think about scraping - specifically with regard to its application in empirical marketing research.
This website is the backbone of the course, and features the following main sections.
The course section features the syllabus, schedule, and grading details.
The module section contains (weekly) collections of live streams, self-study material, and activities.
The tutorial section offers a workflow for collecting online data, and self-guided Jupyter Notebooks that teach the basics of data retrieval via web scraping and APIs. Use these to start your own scraping projects!
Finally, the example section offers links to publicly available data collection projects, which you can use as an inspiration.