Hannes Datta
We're about to start with the first lecture of this class.
If you haven't done so, please
Substantive interests
Methodological interests
With web scraping, you can capture anything you can view in a web browser
With APIs, you obtain official data from a firm in a programmatic way
import requests
import requests
url = 'https://music-to-scrape.org/'
webrequest = requests.get(url)
from bs4 import BeautifulSoup
soup = BeautifulSoup(webrequest.text)
weekly15 = soup.find('section', {'name':'weekly_15'})
for song in weekly15.find_all('h5'): print(song.text)
Gabriel Yared
Danny Williams
Pascal Obispo
Vangelis
Solas
Joi
Stevie Ray Vaughan And Double Trouble
Magnatune Compilation
Mint Condition
Enslavement Of Beauty
Gerald Veasley
Sofi Marinova
Elmore James
Snow Patrol
Converge
APIs are official interfaces by firms for programmers to extract or submit data, or obtain access to an algorithm
They work like websites (i.e., you can call them with the same snippets as before), but usually you need to pay or at least sign up for the service
# let's get some data from the API of music-to-scrape
api_request = requests.get('https://api.music-to-scrape.org/charts/top-tracks')
api_request_json = api_request.json()
for song in api_request_json.get('chart'): print(song.get('name'))
Drugs
Decide
Breaking Down The Walls Of Heartache
Watt's Cradle Song
Now everybody's gone
What’s the last app/website that made you say ‘wow’?
If you could only use three apps/websites for the next year, which ones would you pick and why?
What's a niche online community you're part of that most people don’t know about?
What’s the last thing you saw on TikTok that made you stop scrolling?
Imagine an AI tool that could make you internet-famous overnight. What would it do?
Let's talk about it right now…
Why do we need a course on this? Isn't this how research is always done?
Yes, but collecting web data is different from other datasets!
Each project is totally unique - that's why there is no universal “best way” to approach things…
Weekly modules, structured along the methodological framework
Project in which you put into practice your skills (40% of your grade)
On-campus computer exam (60% of your grade)
Specifics
Evaluation
Visit https://odcm.hannesdatta.com!
Course website is your #1 resource, Canvas used for
Do all students have access to Canvas?
→ quick feedback loops in first few weeks
When you run into trouble, this is your way out!
Investment in research skills
Essential skills for entrepreneurs
Showcast expertise in coding