Detection and Prevention of Vehicle Insurance Claim Fraud

Project Overview

This project aims to tackle the pervasive issue of vehicle insurance fraud, which causes substantial financial losses for insurance companies and erodes consumer trust. Fraudulent claims vary from staged accidents to exaggerated injuries, complicating the claims process and increasing costs. By leveraging historical vehicle and policy data, our objective is to develop a robust predictive model to accurately detect and prevent fraudulent claims. The implementation of this model is intended to help insurance companies minimize financial losses, enhance the efficiency of claims processing, and maintain fair premium pricing for customers.

Dataset

Name: Vehicle Claim Fraud Detection

Source: Kaggle
Size: 15,420 records
Variables: 33 (both categorical and numerical)
Key Features:
- Month of the accident
- Day of the week
- Make of the vehicle
- Accident area
- Age of the policyholder
- Various policy details
- Indicator of whether the claim was fraudulent

The dataset offers a robust sample size for training and evaluating the predictive model and includes indicators for fraudulent claims, making it suitable for building a classification model.

Project Structure

Data Preparation: Handle missing values, convert data types, encode categorical variables, and scale numerical features.
Feature Engineering: Select relevant features, encode categorical variables using one-hot encoding, and apply SMOTE to handle class imbalance.
Model Training and Evaluation: Train and evaluate models (Isolation Forest, Gradient Boosting, Decision Tree, XGBoost, Random Forest, K-Nearest Neighbor, Logistic Regression, and CatBoost) with hyperparameter tuning.
Model Comparison: Compare models based on performance metrics, highlighting CatBoost as the top performer.

Installation and Usage

Prerequisites

Python 3.x
Required libraries:
- pandas
- numpy
- scikit-learn
- xgboost
- matplotlib
- seaborn
- scikit-optimize (for Bayesian Optimization)
- Jupyter Notebook

Instructions

Clone the repository:

git clone https://github.com/oxayavongsa/aai-510-ml-group-1
cd aai-510-ml-group-1

Install the required packages:
```
pip install -r requirements.txt
```
Open the Jupyter Notebook for Exploratory Data Analysis (EDA):
```
jupyter notebook Final Project SectionA-Team 1.ipynb
```
Follow the notebook steps to perform data cleaning, feature selection, and model training.

Team Members

Team Leader/Representative: Outhai Xayavongsa (Thai)
Technical Lead: Aaron Ramirez
Members:
- Aaron Ramirez
- Muhammad Haris
- Outhai Xayavongsa (Thai)

YouTube: Related Video

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
.DS_Store		.DS_Store
Final Project SectionA-Team 1.ipynb		Final Project SectionA-Team 1.ipynb
Final Project SectionA-Team 1.pdf		Final Project SectionA-Team 1.pdf
Final project SectionA-Team 1.pptx		Final project SectionA-Team 1.pptx
LICENSE		LICENSE
README.md		README.md
added-eda.ipynb		added-eda.ipynb
fraud_oracle.csv		fraud_oracle.csv
main.ipynb		main.ipynb
project status form.pdf		project status form.pdf
revised-eda.ipynb		revised-eda.ipynb
revised_eda_Rev2.ipynb		revised_eda_Rev2.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Detection and Prevention of Vehicle Insurance Claim Fraud

Project Overview

Dataset

Name: Vehicle Claim Fraud Detection

Project Structure

Installation and Usage

Prerequisites

Instructions

Team Members

YouTube: Related Video

License

About

Releases

Packages

Contributors 3

Languages

License

oxayavongsa/aai-510-ml-group-1

Folders and files

Latest commit

History

Repository files navigation

Detection and Prevention of Vehicle Insurance Claim Fraud

Project Overview

Dataset

Name: Vehicle Claim Fraud Detection

Project Structure

Installation and Usage

Prerequisites

Instructions

Team Members

YouTube: Related Video

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages