In this repository you could find solution of the 2019 Data Science Bowl competition on Kaggle platform. We took 39-th place out of 3497 participants (Top 2%, silver medal).
Big respect to my teammates - Tatiana Glazkova and Andrew Ponomarev
It was really hard to avoid shake-up and get stable model. Look at right picture (thanks to James Trotman for pic.). It clearly shows the problem..
How to reproduce solution:
- Download input data from https://www.kaggle.com/c/data-science-bowl-2019/data
- Create directory with name "input" in root directory
- Unzip all files to "input" directory
- Open ./scripts/final_solution.ipynb
- Run this notebook
- Wait...
- Answers will be stored in "submission.csv"