A plug-in to create decision tree algorithms in Neo4j with the following splitting criteria
- Gini Index
- Information Gain
- Gain Ratio
DTP comprises of 19 procedures, which read CSV-files, map nodes, split data, generate decision tree using 3 different metrics , perform k-fold cross validation, validate the classifier and visualize tree.
Data (check wiki)
Citation
Davide Chicco, Giuseppe Jurman:
"Machine learning can predict survival of patients with
heart failure from serum creatinine and ejection fraction alone."
BMC Medical Informatics and Decision Making 20, 16 (2020).
(https://www.kaggle.com/andrewmvd/heart-failure-clinical-data)
License
CC BY 4.0
Citation
Lehmann, T et al.
“Metaproteomics of fecal samples of Crohn's disease and Ulcerative Colitis.”
Journal of proteomics vol. 201 (2019): 93-103.
doi:10.1016/j.jprot.2019.04.009
(https://pubmed.ncbi.nlm.nih.gov/31009805/)
Citation
Li, W., Ma, J., Shende, N. et al.
"Using machine learning of clinical data to diagnose COVID-19:
a systematic review and meta-analysis."
BMC Med Inform Decis Mak 20, 247 (2020).
https://doi.org/10.1186/s12911-020-01266-z
(https://github.com/yoshihiko1218/COVID19ML)
Citation
Alex Teboul
(https://www.kaggle.com/alexteboul/diabetes-health-indicators-dataset)
BRFSS 2015
(https://www.kaggle.com/cdc/behavioral-risk-factor-surveillance-system)