Hannes Datta
Using virtual environment '/Users/hannesdatta/.virtualenvs/r-reticulate' ...
Using virtual environment '/Users/hannesdatta/.virtualenvs/r-reticulate' ...
We're about to start with the first lecture of this class.
If you haven't done so, please
Substantive interests
Methodological interests
With web scraping, you can capture anything you can view in a web browser
With APIs, you obtain official data from a firm in a programmatic way
import requests
url = 'https://music-to-scrape.org/'
webrequest = requests.get(url)
from bs4 import BeautifulSoup
soup = BeautifulSoup(webrequest.text)
weekly15 = soup.find('section', {'name':'weekly_15'})
for song in weekly15.find_all('h5'): print(song.text)
Tito Puente
DJ Quik
Babylon Disco
Stevie Ray Vaughan And Double Trouble
Billie Jo Spears
Charlie McCoy
Les Bonapartes
Muse
Bare Jr.
Spectra Soul
Little Joe & The Thrillers
Charlie Byrd Trio
Johnny Pearson
Stevie Ray Vaughan And Double Trouble
Chris Farlowe
APIs are official interfaces by firms for programmers to extract or submit data, or obtain access to an algorithm
They work like websites (i.e., you can call them with the same snippets as before), but usually you need to pay or at least sign up for the service
# let's get some data from the API of music-to-scrape
api_request = requests.get('https://api.music-to-scrape.org/charts/top-tracks')
#api_request_json = api_request.json()
#for song in api_request_json.get('chart'): print(song.get('name'))
What are cool websites/services you're using often?!
What are important societal issues right now that directly or indirectly affect your lives?
As a marketer, how could you use the API of OpenAI to automate/invent something new?
Let's talk about it right now…
Why do we need a course on this? Isn't this how research is always done?
Yes, but collecting web data is different from other datasets!
Each project is totally unique - that's why there is no universal “best way” to approach things…
Weekly modules, structured along the methodological framework
Project in which you put into practice your skills (50% of your grade)
On-campus computer exam (50% of your grade)
Bonus points available (max .5 on final course grade; contribute to course or Tilburg Science Hub using source code: e.g., solutions for assignments, writing a tutorial for a new package, debugging source code, maintaining course's issue board); needs GitHub for this
Goals:
In the context of retail-based platforms,
Find your sweet spot!
Specifics
Evaluation
Visit https://odcm.hannesdatta.com!
Course website is your #1 resource, Canvas only used for
Do all students have access to Canvas?
→ quick feedback loops in first few weeks
When you run into trouble, this is your way out!
Investment in research skills
Essential skills for entrepreneurs
Showcast expertise in coding
Please tell me what would make this course a success for you
Any questions so far?!