Homeworks from the Statistics for Insurance course of accademic year 2018/2019 at University of Trieste.
The project consists of two parts:
- Developing a Motor Third Party Liability (MTPL) Insurance Pricing model using General Linear Models (GLM);
- Developing a Reservation Model for MTPL Insurance claims using GLM.
All the SAS code is provided in the notebook statAss_provaPratica.ipynb
. To open and compile the file I suggest to use SAS University Edition, that provides a Jupyter Lab environment where you can run SAS code. SAS University Edition is free and can be downloaded from here.
The dataset provided in the directory /data/
has been described in La tariffazione nei rami danni con modelli lineari generalizzati (P. Gigante, L. Picech, L. Sigalotti).
The techniques used has been asked by the teacher. I am aware that considering all the explanatory variables as qualitative and merging the modalities in few levels based just on fitness on training set could cause overfitting and bad model predictiveness.
The models could be significantly improved by using Generalized Additive Models (GAM) for the quantitative variables (such as policyholder's age
and vehicle's power
) and geospatial modeling techniques for location variables (such as policyholder's province
of residence).