Surrender analysis in life insurance

Research project 2019-2021.
Accompanying code for the paper "Modelling surrender risk in life insurance: theoretical and experimental insight" by M. Kiermayer.
Paper available on https://www.tandfonline.com/doi/full/10.1080/03461238.2021.2013308 (publication) or https://arxiv.org/abs/2101.11590 (preprint).

Goal

Perform extensive experiments to analyze the capabilities of several models (including logistic regression, random forest, XGBoost, neural networks bagged or boosted) to estimate surrender probabilities.
Check the effect of resampling on model performance and predicted surrender probabilities
Investigate a time-dependent evaluation of surrender rates, including confidence bands

The simulated data can be found in the directory "./Data", including the portfolio "Portfolio.csv" at time t=0.
* Sub-directories "./Data/profile_{i}" include time-series data for years t>=0 wich is unique to surrender assumptions in the ith-profile, i = 0,1,2,3.
* All data in these (sub-)directories is generated (and analyzed) by the scripts "_data{i}_{..}.py"
The files "HPSearch_{..}.py" implement an automated hyperparameter-tuning (based on the python package 'hyperopt')
The directories "./profile_{i}" contain the results for trials of HPTuning for all models and the resulting best model-parametrizations
The main files analyzes all models (given the parametrization after HPTuning)
Visual and statistical analyses of our experiments are saved in either "./Plots" or "./Tables"
All helper-functions can be found in "./functions"

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.vscode		.vscode
DATA		DATA
Plots		Plots
Tables		Tables
__pycache__		__pycache__
functions		functions
profile_0		profile_0
profile_1		profile_1
profile_2		profile_2
profile_3		profile_3
HPSearch_keras.py		HPSearch_keras.py
HPSearch_sklearn.py		HPSearch_sklearn.py
HPSearch_sklearn_visualization.py		HPSearch_sklearn_visualization.py
LICENSE.txt		LICENSE.txt
Main_surrender_modeling.py		Main_surrender_modeling.py
README.md		README.md
__data1__createInitPortfolio.py		__data1__createInitPortfolio.py
__data2__createTimeSeriesData.py		__data2__createTimeSeriesData.py
__data3__analyzeImbalanceOfData.py		__data3__analyzeImbalanceOfData.py
global_vars.py		global_vars.py