GitHub - JanAndrosiuk/fraud-detection-transaction-data: Pipeline for analyzing fraud in card transaction data-sets with an addition of graph features, modeled using Random Forest

About the project

Although the number of transaction fraud events grows slower than the number of transactions in total, it is still a problem for many institutions. Detecting fraudulent transactions is challenging for multiple reasons, including a general lack of labels, class imbalance, and hidden and evolving fraud patterns. Even more difficulties emerge while modeling public transaction datasets, namely feature anonymization, missing information, and data aggregation. This work suggests a pipeline of modeling fraudulent transactions, which accounts for most of those concerns based on other researchers’ experience. From the modeling approaches, one can distinguish those based on transaction features and those using graph anomaly detection methods. This research combines both methods and presents cross-validation results over two datasets. Performance scores did not indicate the superior predictive power of any presented approach. Nevertheless, the addition of graph features in the case of the second dataset significantly improved validation scores and therefore indicated the direction for further research.

Links

[Vesta raw dataset]

[Elliptic raw dataset]

[project directory structure]

[miceforest imputation method]

[Explanation of HITS algorithm]

[Great YouTube channel explaining centrality and community algorithms]

Further research

Optimize hyperparameter tuning using cuML API to train models
Entity embedding method applied within cross validation function
Evaluate Graph Neural Network (GNN) methods

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
models		models
notebooks		notebooks
reports/figures		reports/figures
src		src
CITATION.cff		CITATION.cff
LICENSE		LICENSE
README.md		README.md
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About the project

Links

Further research

About

Releases

Packages

Languages

License

JanAndrosiuk/fraud-detection-transaction-data

Folders and files

Latest commit

History

Repository files navigation

About the project

Links

Further research

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages