Skip to content

Latest commit

 

History

History
33 lines (24 loc) · 1.62 KB

README.md

File metadata and controls

33 lines (24 loc) · 1.62 KB

README

The Higgs Machine Learning Challenge is put on by CERN to help improve the discovery methods of the Higgs Boson.

From the challenge website: http://higgsml.lal.in2p3.fr/...

This script takes the HiggsML test, training, and random_submission datasets (CSV files) from the HiggsML challenge and imports them into MongoDB instances

How do I get set up?

  • Summary of set up
  • Configuration
  • Dependencies
  • Database configuration
    • to start mongodb (using the default configuration and install guidelines from the MongoDB install guide above) run
      • /bin/mongod --conf=
  • Deployment instructions
    • Start up mongo as shown above
    • Copy the script hml_to_mdb.py to a directory on your hard drive. Place the dataset files, test.csv, training.csv, and random_submission.csv into this same directory.
    • Run python hml_to_mdb.py <test | training | random_submission>. This creates a test, training, or random_submission database called higgsml_test (training, random_submission). Each row in the .csv file is put into an "events" collection and each row in the .csv file becomes a document in the events collection with "_id" being replaced by "EventId".

Contribution guidelines

  • Do what you wish...

Who do I talk to?