Familiarize yourself with scientific programming through the SKLearn / Scikitlearn library
- Exercise 1:
- Split the DelayedFlights.csv dataset into train and test. Study the two sets separately, at a descriptive level.
- Exercise 2:
- Apply some transformation process (standardize numerical data, create dummy columns, polynomials.
- Exercise 3:
- Summarize the new columns generated statistically and graphically
- Preprocessing
- Transformation
- Dummy Variable
- Train and Test