Web application that scrapes information of mars from various NASA related pages and displays the information in one page.
Since the objective was to show web scraping, each data was retrive from different sources:
Information Retrived | Source |
---|---|
Mars News | https://mars.nasa.gov/news/ |
Featured Image & image details | https://www.jpl.nasa.gov/spaceimages/?search=&category=Mars |
Mars Weather | https://twitter.com/marswxreport?lang=en |
General Facts | https://space-facts.com/mars/ |
Mars Hemispheres | https://astrogeology.usgs.gov/search/results?q=hemisphere+enhanced&k1=target&v1=Mars |
From this, a dictionary was created and uploaded into a MongoDB database through Python's Flask library so that when you open the site you'll see the latest scrape made.
In this repository you'll see two folders. The one name Original, is the original code written for this exercise, which works locally with the appropiate keys for twitter and mongodb. The second folder called Simple, is a simpler version of the code that only displays the latest result of the scrape that was originally in the Heroku application since this code was writen using a chromedriver and it's not compatible with Heroku.
I'm currently working on deployment site with the scrape function with the aid of buildpacks, the site is deployed but the scrape is still not working properly.
Mars image not own, retrived from Space.com on 29/06/2020.