Skip to content

Latest commit

 

History

History
29 lines (28 loc) · 1.65 KB

README.md

File metadata and controls

29 lines (28 loc) · 1.65 KB

📊 Applied Data Science Capstone

This Capstone project is the 10th (final) course in IBM Data Science Professional Certificate. It summarizes in the form of a project all materials that have been learned during this specialization.

📋 Project Background

SpaceX is the most successful company of the commercial space age, making space travel affordable. The company advertises Falcon 9 rocket launches on its website, with a cost of 62 million dollars; other providers cost upward of 165 million dollars each, much of the savings is because SpaceX can reuse the first stage. Therefore, if we can determine if the first stage will land, we can determine the cost of a launch. Based on public information and machine learning models, we are going to predict if SpaceX will reuse the first stage.

📂 Questions to be answered

  • How do variables such as payload mass, launch site, number of flights, and orbits affect the success of the first stage landing?
  • Does the rate of successful landings increase over the years?
  • What is the best algorithm that can be used for binary classification in this case?

📂 Methodology

1. Data collection

  • Using SpaceX Rest API
  • Using Web Scrapping from Wikipedia

2. Data wrangling

  • Filtering the data
  • Dealing with missing values
  • Using One Hot Encoding to prepare the data to a binary classification

3. Exploratory data analysis (EDA) using visualization and SQL

4. Interactive visual analytics using Folium and Plotly Dash

5. Predictive analysis using machine learning