Fraud Detection Using Machine Learning

Senior Capstone Project with LexisNexis's HPCC Systems @Kennesaw State University

This project uses LexisNexis's HPCC Systems and ECL to analyze databases of credit card transactional data to detect fraud and anomalies. The project will first conduct data preprocessing and deterministic modeling using a static ruleset to mark the most obvious anomalous factors. Two different Machine Learning models will then be implemented, one with supervised learning using Logic Regression and one with unsupervised learning using an Agglomerative Hierarchical Clustering technique. These results will be compared to determine which is the better method, and will then use Python data visualization libraries to visualize and interpret the output into a “Client Report”.

Technologies Used: ECL with ECL ML_Core, LogisticRegression, & LearningTrees libraries; HPCC Systems; SQL; Python with libraries.

File Run Order:

First import transactions & identity (doesn't matter which you import first, but this HAS to be complete before running job 05)

For transactions:

1. 01_Data_Import
2. 02_Data_Import_Validate_Job
3. OPTIONAL: 03_Data_Patterns_Job
4. 04_Clean_Job

For identity:

1. identity/01_Data_Import
2. identity/02 Import_Validate_Job
3. OPTIONAL: identity/03_Data_Patterns_Job
4. identity/04_Clean_Job

Then you can run 05_Enrich_Data. After this, any of the models can be run.

This is the run order when you're running from scratch. If you have already imported, validated, and cleaned both transactions and identity data using the current code version, then you can ignore this somewhat.

Resources:

ML Core ECL library: hpcc-systems/ML_Core

Logistic Regression: hpcc-systems/LogisticRegression

Random Forest: hpcc-systems/LearningTrees

Python libraries: Pandas, NumPy, Seaborn, and matplotlib.pyplot

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
Data/Data Patterns		Data/Data Patterns
Fraud_ECl		Fraud_ECl
ReducedData/Fraud_ECL		ReducedData/Fraud_ECL
Visualization/FeatureSelection		Visualization/FeatureSelection
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fraud Detection Using Machine Learning

Senior Capstone Project with LexisNexis's HPCC Systems @Kennesaw State University

File Run Order:

Resources:

About

Releases

Packages

Languages

camillereaves/Fraud-Detection-using-ML

Folders and files

Latest commit

History

Repository files navigation

Fraud Detection Using Machine Learning

Senior Capstone Project with LexisNexis's HPCC Systems @Kennesaw State University

File Run Order:

Resources:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages