benchm-ml/x1-data-higgs at master · szilard/benchm-ml

History

Name		Name	Last commit message	Last commit date
parent directory ..
1-getdata.txt		1-getdata.txt
2a-glmnet.R		2a-glmnet.R
2b-rf-h2o.R		2b-rf-h2o.R
2c-DL-h2o-v3_6.R		2c-DL-h2o-v3_6.R
3a-AUC.R		3a-AUC.R
3a-AUC.csv		3a-AUC.csv
3a-AUC.png		3a-AUC.png
README.md		README.md

README.md

Experiments on the Higgs dataset

Binary classification of Higgs vs background with 11 million records and 28 numeric features (simulated data) by Baldi etal https://archive.ics.uci.edu/ml/datasets/HIGGS.

Learning curves of linear (logistic regression) vs non-linear models (random forest and deep learning):

The linear model is less accurate than the more complex ones and it tops-off in accuracy quickly. (The DL models have been trained without extensive search in the hyper-parameter space and with the same hyper-parameter values accross all the different sizes, therefore not necessarily optimal. TBD if better models can be obtained for the smaller data sizes.)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

x1-data-higgs

x1-data-higgs

README.md

Experiments on the Higgs dataset

Files

x1-data-higgs

Directory actions

More options

Directory actions

More options

Latest commit

History

x1-data-higgs

Folders and files

parent directory

README.md

Experiments on the Higgs dataset