Small Project to try and crawl the highoff pages
mschf release a project in collaboration with Future. This crawler will help you find the right page the song is on.
Install the chrome webdriver. On macOS best way I have found is using brew
brew cask install chromedriver
Install and run the crawler in headless mode from the command line:
virtualenv -p python3 env
source env/bin/activate
pip install -r requirements.txt
python mschf.py
In the commandline you will be given the url with the current active playing session. Each session last 10 minutes before moving to a new domain.
With some digging if you wanted to download the *.mp3
file they are streaming you can just pop open your develop tools and go to the network
tab and find the request being made.
I have already done this, here is where the file is stored in S3. Quarantine got me real bored, so I hope you found this useful.