Model Card for mlpf-clic-clusters-v1.6
This model reconstructs particles in a detector, based on the tracks and calorimeter clusters recorded by the detector.
Model Details
Model Description
- Developed by: Joosep Pata, Eric Wulff, Farouk Mokhtar, Mengke Zhang, David Southwick, Maria Girone, David Southwick, Javier Duarte
- Model type: graph neural network with learnable structure in locality-sensitive hashing bins
- License: Apache License
Model Sources
- Repository: https://github.com/jpata/particleflow/releases/tag/v1.6
- Paper: https://doi.org/10.48550/arXiv.2309.06782
Uses
Direct Use
This model may be used to study the physics and computational performance on ML-based reconstruction in simulation.
Out-of-Scope Use
This model is not intended for physics measurements on real data.
Bias, Risks, and Limitations
The model has only been trained on simulation data and has not been validated against real data.
How to Get Started with the Model
Use the code below to get started with the model.
git clone https://github.com/jpata/particleflow/releases/tag/v1.6
cd particleflow
#Download the software image
wget https://hep.kbfi.ee/~joosep/tf-2.14.0.simg
#Download the checkpoint
wget https://huggingface.co/jpata/particleflow/resolve/clic_clusters_v1.6/weights-96-5.346523.hdf5
wget https://huggingface.co/jpata/particleflow/resolve/clic_clusters_v1.6/opt-96-5.346523.pkl
#Launch a shell in the software image
apptainer shell --nv tf-2.14.0.simg
#Continue the training from a checkpoint
python3 mlpf/pipeline.py train --config parameters/clic.yaml --weights weights-96-5.346523.hdf5 --batch-multiplier 0.5
#Run the evaluation for a given training directory, loading the best weight file in the directory
python3 mlpf/pipeline.py evaluate --train-dir experiments/clic-REPLACEME
Training Details
Training Data
Trained on the following dataset: Pata, J., Wulff, E., Duarte, J., Mokhtar, F., Zhang, M., Girone, M., & Southwick, D. (2023). Simulated datasets for detector and particle flow reconstruction: CLIC detector, machine learning format (v1.5.0) [Data set]. Zenodo. https://doi.org/10.5281/zenodo.8409592
Training Procedure
python3 mlpf/pipeline.py train --config parameters/clic.yaml
Evaluation
python3 mlpf/pipeline.py evaluate --train-dir experiments/clic-REPLACEME
Citation
BibTeX:
@misc{pata2023scalable,
title={Scalable neural network models and terascale datasets for particle-flow reconstruction},
author={Joosep Pata and Eric Wulff and Farouk Mokhtar and David Southwick and Mengke Zhang and Maria Girone and Javier Duarte},
year={2023},
eprint={2309.06782},
archivePrefix={arXiv},
primaryClass={physics.data-an}
}
Glossary
PF - particle flow reconstruction
Model Card Contact
Joosep Pata, [email protected]