|
# Model Card for mlpf-clic-clusters-v1.6 |
|
|
|
This model reconstructs particles in a detector, based on the tracks and calorimeter clusters recorded by the detector. |
|
|
|
## Model Details |
|
|
|
### Model Description |
|
|
|
<!-- Provide a longer summary of what this model is. --> |
|
|
|
- **Developed by:** Joosep Pata, Eric Wulff, Farouk Mokhtar, Mengke Zhang, David Southwick, Maria Girone, David Southwick, Javier Duarte |
|
- **Model type:** graph neural network with learnable structure in locality-sensitive hashing bins |
|
- **License:** Apache License |
|
|
|
### Model Sources |
|
|
|
<!-- Provide the basic links for the model. --> |
|
|
|
- **Repository:** https://github.com/jpata/particleflow/releases/tag/v1.6 |
|
- **Paper:** https://doi.org/10.48550/arXiv.2309.06782 |
|
|
|
## Uses |
|
|
|
<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. --> |
|
|
|
### Direct Use |
|
|
|
<!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. --> |
|
|
|
This model may be used to study the physics and computational performance on ML-based reconstruction in simulation. |
|
|
|
### Out-of-Scope Use |
|
|
|
<!-- This section addresses misuse, malicious use, and uses that the model will not work well for. --> |
|
|
|
This model is not intended for physics measurements on real data. |
|
|
|
## Bias, Risks, and Limitations |
|
|
|
<!-- This section is meant to convey both technical and sociotechnical limitations. --> |
|
|
|
The model has only been trained on simulation data and has not been validated against real data. |
|
|
|
## How to Get Started with the Model |
|
|
|
Use the code below to get started with the model. |
|
|
|
``` |
|
git clone https://github.com/jpata/particleflow/releases/tag/v1.6 |
|
cd particleflow |
|
|
|
#Download the software image |
|
wget https://hep.kbfi.ee/~joosep/tf-2.14.0.simg |
|
|
|
#Download the checkpoint |
|
wget https://huggingface.co/jpata/particleflow/resolve/clic_clusters_v1.6/weights-96-5.346523.hdf5 |
|
wget https://huggingface.co/jpata/particleflow/resolve/clic_clusters_v1.6/opt-96-5.346523.pkl |
|
|
|
#Launch a shell in the software image |
|
apptainer shell --nv tf-2.14.0.simg |
|
|
|
#Continue the training from a checkpoint |
|
python3 mlpf/pipeline.py train --config parameters/clic.yaml --weights weights-96-5.346523.hdf5 --batch-multiplier 0.5 |
|
|
|
#Run the evaluation for a given training directory, loading the best weight file in the directory |
|
python3 mlpf/pipeline.py evaluate --train-dir experiments/clic-REPLACEME |
|
``` |
|
|
|
## Training Details |
|
|
|
### Training Data |
|
|
|
Trained on the following dataset: |
|
Pata, J., Wulff, E., Duarte, J., Mokhtar, F., Zhang, M., Girone, M., & Southwick, D. (2023). Simulated datasets for detector and particle flow reconstruction: CLIC detector, machine learning format (v1.5.0) [Data set]. Zenodo. https://doi.org/10.5281/zenodo.8409592 |
|
|
|
### Training Procedure |
|
|
|
``` |
|
python3 mlpf/pipeline.py train --config parameters/clic.yaml |
|
``` |
|
|
|
## Evaluation |
|
|
|
``` |
|
python3 mlpf/pipeline.py evaluate --train-dir experiments/clic-REPLACEME |
|
``` |
|
|
|
## Citation |
|
|
|
**BibTeX:** |
|
|
|
``` |
|
@misc{pata2023scalable, |
|
title={Scalable neural network models and terascale datasets for particle-flow reconstruction}, |
|
author={Joosep Pata and Eric Wulff and Farouk Mokhtar and David Southwick and Mengke Zhang and Maria Girone and Javier Duarte}, |
|
year={2023}, |
|
eprint={2309.06782}, |
|
archivePrefix={arXiv}, |
|
primaryClass={physics.data-an} |
|
} |
|
``` |
|
## Glossary |
|
|
|
PF - particle flow reconstruction |
|
|
|
## Model Card Contact |
|
|
|
Joosep Pata, [email protected] |
|
|
|
|
|
|
|
|