Fraud-Prediction

Fraud hosts with substantial amount of fraudulent traffic using the impression logs for selected IP addresses

Requirements

A One class SVM is used for predicting fraudulent traffic (+1)
Using bucketed time_stamps as features increases the test accuracy by >20%, which is a good indicator that fraudulent activities is clustered well in time. This fact can be extended to find the botnet networks.
Over-classification with SMOTE was tried to balance a highly imbalanced dataset, but was not useful in producing better results, hence ommitted.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
model		model
.gitignore		.gitignore
Fraud_Prediction.ipynb		Fraud_Prediction.ipynb
LICENSE		LICENSE
README.md		README.md
utils.py		utils.py