Overview

Learning Goals

Collecting data via web scraping and APIs requires practice. Together with your team members, you plan and execute an online data collection throughout the course by closely following the recommendations in “Fields of Gold”.

At the end of the course, you will submit a data package, consisting of:

  • Your collected data
  • All source code
  • A documentation (“readme”)

The project teams are finalized in week 3. The project starts in week 4.

Note

The focus lies on completing an entire data collection project. Keep each stage of your project manageable and feasible. Your project will ultimately be written up as a proper data documentation (readme), following this paper and corresponding template download. Ensure to also check the grading details.

Organization

Coaching sessions

During the course, you will have the opportunity to meet up with the course instructor for coaching sessions. These sessions are meant for you to receive feedback on your ideas and code. Frequently, this also entails problem-solving & debugging.

  • Participation: All teams attend the full session. Teams typically collaborate on their projects while the course instructor provides support (in-person by walking around or via Zoom breakout rooms).
  • Session Format:
    • First Half: Each team gets 5–10 minutes to provide a progress update and seek assistance from the coach.
    • Second Half: Time is allocated on a needs basis to address specific issues or questions raised by teams.

Deliverables: Most coaching sessions will help the team work on some deliverables, which are always due before the next coaching session. Please refer to Canvas for exact due dates each week.

Note

How to Prepare for Coaching Sessions?

  1. Complete Weekly Tasks: Work on your project and submit weekly deliverables to receive constructive feedback.
  2. Preparation: Review lecture materials and relevant academic literature (e.g., Boegershausen et al., 2022; Guyt et al., 2024).
  3. Solve Technical Issues:
    • Check for solutions on Canvas, as well as YouTube or YouTube channel.
    • Perform your own research using the course chatbot or Google/StackOverflow.
  4. Set Up Jupyter Notebook:
    • Load scripts and display any issues (e.g., error messages) on your screen for troubleshooting. Screenshots alone are insufficient.
  5. For Online Sessions:
    • Share your screen (Jupyter Notebook) and ensure your microphone is functional.
    • Be ready for the instructor to take over your screen if needed.

Team composition

  • 4-5 students per team
  • Enroll your team on Canvas by week 3 (project starts in week 4)

Deadline & submission

  • Deadline: 5 December, 2025 (23:59)
  • Submission of your data package on Canvas in one zip file.

Permitted Level of AI Use

Level of AI allowed for this assignment: AI-assisted idea generation and structuring (Level 3 on AI Index Tilburg University)

  • You are allowed to use generative AI tools to develop or refine initial ideas, materials, paraphrasing, structures, or outlines. This includes generating code, e.g., for R.
  • Failing to declare AI use, or using AI beyond what is allowed in the syllabus, may be considered fraud and will be reported to the Examination Board.
  • Mandatory for students: For this assignment, please keep a simple “logbook” documenting which AI tools you have used and for what purposes. Submit this logbook with the assignment (e.g., as a PDF). Examiners may inspect the logbook at any time. Failing to declare AI use, or using AI beyond what is allowed in the syllabus, may be considered fraud and will be reported to the Examination Board.