Weekly Deliverables

The activities below help you to structure your project. Stick to the plan to ensure you can finish the project in time.

Coaching 1: Idea & Extraction Design

Prework

  • Sign up to teams on Canvas.
  • Pick a data context of interest and identify a potential data source (website/API) that contains data that maybe valuable to the broader research community and industry. Use Table 2 in Boegershausen et al. 2022 for refinement.
  • Write preliminary code to connect to the data source and determine its accessibility.
  • Design your data extraction plan: decide what to retrieve, which seeds to use, how frequently to extract, and how to process the data.
  • Use Table 3 in Boegershausen et al. (2022) as guidance throughout.

Deliverable

  • Sections 1 and 2 of your data documentation.

Coaching 2: Prototype Development

Prework

  • Finalise a functional prototype that can extract at least a sample dataset end-to-end.
  • Implement monitoring tools (e.g., logging, rate-limit handling) and decide on infrastructure.
  • Use Table 4 in Boegershausen et al. (2022) throughout.

Deliverable

  • Section 3 of your data documentation.

Coaching 3: Refinement & Data Quality

Prework

  • Continue refining your data collection process and, if feasible, complete the extraction.
  • Inspect the data and perform any necessary preprocessing (see Section 4 of your data documentation).

Deliverable

  • Sections 4 and 5 of your data documentation.

Coaching 4: Finalise Your Data Package

Prework

  • Finalise your data documentation (Sections 1–5).
  • Improve source code quality and ensure reproducibility.
  • Organise the final submission package in a clear and consistent directory structure.

Deliverable

No deliverables. Prepare for final submission.