Skip to content

DeepLearnPhysics/spine_workshop_2024

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

74 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SPINE Workshop 2024

This repository contains all necessary resources to participate in the 2024 SPINE workshop organized for the SBN and 2x2 ML reconstruction groups. This workshop aims to train new comers to use SPINE, our machine-learning-based particle imaging detector reconstruction chain. You can find the workshop agenda here.

Software environment

For the workshop, we will use this "Docker container".

Some notes below:

  • The image is fairly large (multiple GBs). Please download in advance if you are using it locally. It is used in both NVIDIA GPU and CPU running mode of our software.
  • Supported GPUs include those with NVIDIA Volta (e.g. V100), Turing (e.g. RTX 2080Ti), and Ampere architectures (e.g. A100, RTX 3080). If you want an older architectures to be supported, such as Pascal, please contact Kazu.
  • We assume basic knowledge about software container, in particular Docker. If you are learning for the first time, we recommend to use/learn about Apptainer (website) instead of Docker.
    • You can pull a apptainer image as follows
$ apptainer pull docker://deeplearnphysics/larcv2:ub20.04-cuda11.6-pytorch1.13-larndsim

You can now launch a shell inside the apptainer with

$ apptainer exec --bind /path/to/workshop/folder/ larcv2_ub20.04-cuda11.6-pytorch1.13-larndsim.sif bash

For nersc:

$ salloc --nodes 1 --qos shared_interactive --time 00:30:00 --constraint gpu --gpus 1 --account=dune --image=deeplearnphysics/larcv2:ub20.04-cuda11.6-pytorch1.13-larndsim shifter /bin/bash

Docker alternative

You can also pull the docker image using docker (easier on Mac and Windows) directly with:

$ docker pull deeplearnphysics/larcv2:ub20.04-cuda11.6-pytorch1.13-larndsim

To see which images are present on your system, you can use docker images. It will look something like this:

$ docker images
REPOSITORY                TAG                                     IMAGE ID       CREATED        SIZE
deeplearnphysics/larcv2   ub20.04-cuda11.6-pytorch1.13-larndsim   cd28cb3cd04b   2 months ago   20.8GB

to run a shell in your image, simply do:

$ docker run -i -t cd28cb3cd04b bash
  • Ask Francois for questions or a request for a separate tutorial if interested.

Resources

  1. The configuration files are packages with this repository.

  2. You can find data files for the examples used in this workshop under:

  • S3DF
/sdf/data/neutrino/public_html/spine_workshop/larcv/ # Example MPV/MPR LArCV files prior to reconstruction
/sdf/data/neutrino/public_html/spine_workshop/reco/  # Reconstructed HDF5 files
  • NERSC
/global/cfs/cdirs/dune/users/drielsma/spine_workshop/larcv/ # Example MPV/MPR LArCV files prior to reconstruction
/global/cfs/cdirs/dune/users/drielsma/spine_workshop/reco/  # Reconstructed HDF5 files
  1. The network model parameters for the inference tutorial can be found at:
  • S3DF
/sdf/data/neutrino/public_html/spine_workshop/weights/generic_snapshot-4999.ckpt # Generic
/sdf/data/neutrino/public_html/spine_workshop/weights/icarus_snapshot-7999.ckpt # ICARUS
/sdf/data/neutrino/public_html/spine_workshop/weights/sbnd_snapshot-1999.ckpt # SBND
/sdf/data/neutrino/public_html/spine_workshop/weights/2x2_snapshot-3999.ckpt # 2x2
  • NERSC
/global/cfs/cdirs/dune/users/drielsma/spine_workshop/weights/generic_snapshot-4999.ckpt # Generic
/global/cfs/cdirs/dune/users/drielsma/spine_workshop/weights/icarus_snapshot-7999.ckpt # ICARUS
/global/cfs/cdirs/dune/users/drielsma/spine_workshop/weights/sbnd_snapshot-1999.ckpt # SBND
/global/cfs/cdirs/dune/users/drielsma/spine_workshop/weights/2x2_snapshot-3999.ckpt # 2x2

Computing resource

Most of the notebooks can be run strictly on CPU. The following notebooks will run significantly slower on CPU, however:

  • Training/validation notebook
  • Inference and HDF5 file making notebook

For all other notebooks, you can run them locally, provided that you download:

To gain access to GPUs:

  • SBN collaborators also have access to the Wilson Cluster at FNAL, equipped with GPUs. Below is a few commands to log-in and load Apptainer with which you can run a container image for the workshop (see the previous section). For how-to utilize the Wilson Cluster, refer to their website as well as this and that documentation from NOvA (replace nova with icarus or sbnd and most commands should just work).
$ ssh $USER@wc.fnal.gov
$ module load apptainer
$ apptainer --version
apptainer version 3.6.4

About

Resources for the 2024 SPINE ML Workshop at Tufts

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •