Joosep Pata commited on
Commit
1742c16
·
1 Parent(s): e1748d1

added clic clusters v1.9.0

Browse files
This view is limited to 50 files because it contains too many changes.   See raw diff
Files changed (50) hide show
  1. clic/clusters/v1.9.0/.gitattributes +5 -0
  2. clic/clusters/v1.9.0/README.md +133 -0
  3. clic/clusters/v1.9.0/checkpoints/checkpoint-26-2.004527.pth +3 -0
  4. clic/clusters/v1.9.0/history/epoch_1.json +1 -0
  5. clic/clusters/v1.9.0/history/epoch_10.json +1 -0
  6. clic/clusters/v1.9.0/history/epoch_11.json +1 -0
  7. clic/clusters/v1.9.0/history/epoch_12.json +1 -0
  8. clic/clusters/v1.9.0/history/epoch_13.json +1 -0
  9. clic/clusters/v1.9.0/history/epoch_14.json +1 -0
  10. clic/clusters/v1.9.0/history/epoch_15.json +1 -0
  11. clic/clusters/v1.9.0/history/epoch_16.json +1 -0
  12. clic/clusters/v1.9.0/history/epoch_17.json +1 -0
  13. clic/clusters/v1.9.0/history/epoch_18.json +1 -0
  14. clic/clusters/v1.9.0/history/epoch_19.json +1 -0
  15. clic/clusters/v1.9.0/history/epoch_2.json +1 -0
  16. clic/clusters/v1.9.0/history/epoch_20.json +1 -0
  17. clic/clusters/v1.9.0/history/epoch_21.json +1 -0
  18. clic/clusters/v1.9.0/history/epoch_22.json +1 -0
  19. clic/clusters/v1.9.0/history/epoch_23.json +1 -0
  20. clic/clusters/v1.9.0/history/epoch_24.json +1 -0
  21. clic/clusters/v1.9.0/history/epoch_25.json +1 -0
  22. clic/clusters/v1.9.0/history/epoch_26.json +1 -0
  23. clic/clusters/v1.9.0/history/epoch_3.json +1 -0
  24. clic/clusters/v1.9.0/history/epoch_4.json +1 -0
  25. clic/clusters/v1.9.0/history/epoch_5.json +1 -0
  26. clic/clusters/v1.9.0/history/epoch_6.json +1 -0
  27. clic/clusters/v1.9.0/history/epoch_7.json +1 -0
  28. clic/clusters/v1.9.0/history/epoch_8.json +1 -0
  29. clic/clusters/v1.9.0/history/epoch_9.json +1 -0
  30. clic/clusters/v1.9.0/hyperparameters.json +1 -0
  31. clic/clusters/v1.9.0/mlpf_losses.pkl +3 -0
  32. clic/clusters/v1.9.0/model_kwargs.pkl +3 -0
  33. clic/clusters/v1.9.0/plots_checkpoint-26-2.004527/clic_edm_qq_pf/jet_eta.pdf +0 -0
  34. clic/clusters/v1.9.0/plots_checkpoint-26-2.004527/clic_edm_qq_pf/jet_eta.png +0 -0
  35. clic/clusters/v1.9.0/plots_checkpoint-26-2.004527/clic_edm_qq_pf/jet_pt.pdf +0 -0
  36. clic/clusters/v1.9.0/plots_checkpoint-26-2.004527/clic_edm_qq_pf/jet_pt.png +0 -0
  37. clic/clusters/v1.9.0/plots_checkpoint-26-2.004527/clic_edm_qq_pf/jet_pt_log.pdf +0 -0
  38. clic/clusters/v1.9.0/plots_checkpoint-26-2.004527/clic_edm_qq_pf/jet_pt_log.png +0 -0
  39. clic/clusters/v1.9.0/plots_checkpoint-26-2.004527/clic_edm_qq_pf/jet_res.pdf +0 -0
  40. clic/clusters/v1.9.0/plots_checkpoint-26-2.004527/clic_edm_qq_pf/jet_res.png +0 -0
  41. clic/clusters/v1.9.0/plots_checkpoint-26-2.004527/clic_edm_qq_pf/jet_res_bins_0p5_1p5.pdf +0 -0
  42. clic/clusters/v1.9.0/plots_checkpoint-26-2.004527/clic_edm_qq_pf/jet_res_bins_0p5_1p5.png +0 -0
  43. clic/clusters/v1.9.0/plots_checkpoint-26-2.004527/clic_edm_qq_pf/jet_res_bins_0p5_1p5_vstarget.pdf +0 -0
  44. clic/clusters/v1.9.0/plots_checkpoint-26-2.004527/clic_edm_qq_pf/jet_res_bins_0p5_1p5_vstarget.png +0 -0
  45. clic/clusters/v1.9.0/plots_checkpoint-26-2.004527/clic_edm_qq_pf/jet_res_vstarget.pdf +0 -0
  46. clic/clusters/v1.9.0/plots_checkpoint-26-2.004527/clic_edm_qq_pf/jet_res_vstarget.png +0 -0
  47. clic/clusters/v1.9.0/plots_checkpoint-26-2.004527/clic_edm_qq_pf/jet_response_binned.pdf +0 -0
  48. clic/clusters/v1.9.0/plots_checkpoint-26-2.004527/clic_edm_qq_pf/jet_response_binned.png +0 -0
  49. clic/clusters/v1.9.0/plots_checkpoint-26-2.004527/clic_edm_qq_pf/jet_response_binned_eta.pdf +0 -0
  50. clic/clusters/v1.9.0/plots_checkpoint-26-2.004527/clic_edm_qq_pf/jet_response_binned_eta.png +0 -0
clic/clusters/v1.9.0/.gitattributes ADDED
@@ -0,0 +1,5 @@
 
 
 
 
 
 
1
+ checkpoints filter=lfs diff=lfs merge=lfs -text
2
+ plots_checkpoint-26-2.004527 filter=lfs diff=lfs merge=lfs -text
3
+ runs filter=lfs diff=lfs merge=lfs -text
4
+ model_kwargs.pkl filter=lfs diff=lfs merge=lfs -text
5
+ mlpf_losses.pkl filter=lfs diff=lfs merge=lfs -text
clic/clusters/v1.9.0/README.md ADDED
@@ -0,0 +1,133 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Model Card for mlpf-clic-clusters-v1.9.0
2
+
3
+ This model reconstructs particles in a detector, based on the tracks and calorimeter clusters recorded by the detector.
4
+
5
+ ## Model Details
6
+
7
+ ### Model Description
8
+
9
+ - **Developed by:** Joosep Pata, Eric Wulff, Farouk Mokhtar, Mengke Zhang, David Southwick, Maria Girone, David Southwick, Javier Duarte
10
+ - **Model type:** transformer
11
+ - **License:** Apache License
12
+
13
+ ### Model Sources
14
+
15
+ - **Repository:** https://github.com/jpata/particleflow/releases/tag/v1.9.0
16
+
17
+ ## Uses
18
+ ### Direct Use
19
+
20
+ This model may be used to study the physics and computational performance on ML-based reconstruction in simulation.
21
+
22
+ ### Out-of-Scope Use
23
+
24
+ This model is not intended for physics measurements on real data.
25
+
26
+ ## Bias, Risks, and Limitations
27
+
28
+ The model has only been trained on simulation data and has not been validated against real data.
29
+ The model has not been peer reviewed or published in a journal.
30
+
31
+ ## How to Get Started with the Model
32
+
33
+ Use the code below to get started with the model.
34
+
35
+ ```
36
+ git clone https://github.com/jpata/particleflow/releases/tag/v1.9.0
37
+ cd particleflow
38
+ ```
39
+
40
+ ## Training Details
41
+ Trained on 8x MI250X for 26 epochs over ~3 days.
42
+ The training was continued twice from a checkpoint due to the 24h time limit.
43
+
44
+ ### Training Data
45
+ The following datasets were used:
46
+ ```
47
+ clic_edm_qq_pf:2.2.0
48
+ clic_edm_ttbar_pf:2.2.0
49
+ clic_edm_ww_fullhad_pf:2.2.0
50
+ ```
51
+ They have an updated truth and target definition: https://github.com/jpata/particleflow/pull/345
52
+
53
+ ### Training Procedure
54
+
55
+ ```bash
56
+ #!/bin/bash
57
+ #SBATCH --job-name=mlpf-train
58
+ #SBATCH --account=project_465000301
59
+ #SBATCH --time=1-00:00:00
60
+ #SBATCH --nodes=1
61
+ #SBATCH --ntasks-per-node=1
62
+ #SBATCH --cpus-per-task=32
63
+ #SBATCH --mem=200G
64
+ #SBATCH --gpus-per-task=8
65
+ #SBATCH --partition=standard-g
66
+ #SBATCH --no-requeue
67
+ #SBATCH -o logs/slurm-%x-%j-%N.out
68
+
69
+ cd /scratch/project_465000301/particleflow
70
+
71
+ module load LUMI/24.03 partition/G
72
+
73
+ export IMG=/scratch/project_465000301/pytorch-rocm6.2.simg
74
+ export PYTHONPATH=hep_tfds
75
+ export TFDS_DATA_DIR=/scratch/project_465000301/tensorflow_datasets
76
+ #export MIOPEN_DISABLE_CACHE=true
77
+ export MIOPEN_USER_DB_PATH=/tmp/${USER}-${SLURM_JOB_ID}-miopen-cache
78
+ export MIOPEN_CUSTOM_CACHE_DIR=${MIOPEN_USER_DB_PATH}
79
+ export TF_CPP_MAX_VLOG_LEVEL=-1 #to suppress ROCm fusion is enabled messages
80
+ export ROCM_PATH=/opt/rocm
81
+ #export NCCL_DEBUG=INFO
82
+ #export MIOPEN_ENABLE_LOGGING=1
83
+ #export MIOPEN_ENABLE_LOGGING_CMD=1
84
+ #export MIOPEN_LOG_LEVEL=4
85
+ export KERAS_BACKEND=torch
86
+
87
+ env
88
+
89
+ #TF training
90
+ singularity exec \
91
+ --rocm \
92
+ -B /scratch/project_465000301 \
93
+ -B /tmp \
94
+ --env LD_LIBRARY_PATH=/opt/rocm/lib/ \
95
+ --env CUDA_VISIBLE_DEVICES=$ROCR_VISIBLE_DEVICES \
96
+ $IMG python3 mlpf/pyg_pipeline.py --dataset clic --gpus 8 \
97
+ --data-dir $TFDS_DATA_DIR --config parameters/pytorch/pyg-clic.yaml \
98
+ --train --gpu-batch-multiplier 128 --num-workers 8 --prefetch-factor 100 --checkpoint-freq 1 --conv-type attention --dtype bfloat16 --lr 0.0001 --num-epochs 30
99
+ ```
100
+
101
+ ## Evaluation
102
+ ```bash
103
+ #!/bin/bash
104
+ #SBATCH --partition gpu
105
+ #SBATCH --gres gpu:mig:1
106
+ #SBATCH --mem-per-gpu 200G
107
+ #SBATCH -o logs/slurm-%x-%j-%N.out
108
+
109
+ IMG=/home/software/singularity/pytorch.simg:2024-08-18
110
+ cd ~/particleflow
111
+
112
+ WEIGHTS=experiments/pyg-clic_20241001_215132_345408/checkpoints/checkpoint-26-2.004527.pth
113
+ singularity exec -B /scratch/persistent --nv \
114
+ --env PYTHONPATH=hep_tfds \
115
+ --env KERAS_BACKEND=torch \
116
+ $IMG python3 mlpf/pyg_pipeline.py --dataset clic --gpus 1 \
117
+ --data-dir /scratch/persistent/joosep/tensorflow_datasets --config parameters/pytorch/pyg-clic.yaml \
118
+ --test --make-plots --gpu-batch-multiplier 100 --load $WEIGHTS --dtype bfloat16 --prefetch-factor 10 --num-workers 8 --load $WEIGHTS --ntest 50000
119
+ ~
120
+ ```
121
+
122
+ ## Citation
123
+
124
+ ## Glossary
125
+
126
+ PF - particle flow reconstruction
127
+
128
+ ## Model Card Contact
129
+
130
+ Joosep Pata, [email protected]
131
+
132
+
133
+
clic/clusters/v1.9.0/checkpoints/checkpoint-26-2.004527.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:19b21ceb8ac376d3335e2ecd6cea2bcc6194bb4ebf3e6b183229298016c617e1
3
+ size 770400022
clic/clusters/v1.9.0/history/epoch_1.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"train": {"Regression_pt": 0.28855742637612025, "Regression_eta": 0.0002762005724093934, "Regression_sin_phi": 0.00019368763134128313, "Regression_cos_phi": 0.00019232766693894507, "Regression_energy": 0.28378834009797993, "Classification_binary": 2.918180747208194, "Classification": 0.0343223749230364, "MET": 9.124021423109753, "Sliced_Wasserstein_Loss": 47.84192510135856, "Total": 3.5255116273383598}, "valid": {"Regression_pt": 0.24141291619568261, "Regression_eta": 0.00021294264955190934, "Regression_sin_phi": 0.00015518042555844432, "Regression_cos_phi": 0.0001549997895231015, "Regression_energy": 0.23510851597511204, "Classification_binary": 2.452336310119238, "Classification": 0.026016032649979242, "MET": 5.834425141045134, "Sliced_Wasserstein_Loss": 42.11706746158771, "Total": 2.9553976022327144}, "epoch_train_time": 8197.158108472824, "epoch_valid_time": 308.4101564884186, "epoch_total_time": 8511.027235984802}
clic/clusters/v1.9.0/history/epoch_10.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"train": {"Regression_pt": 0.1570690580211075, "Regression_eta": 0.00013947051045768962, "Regression_sin_phi": 0.00010044414141208826, "Regression_cos_phi": 0.00010078813470234971, "Regression_energy": 0.1515172943043602, "Classification_binary": 1.7574343503271925, "Classification": 0.013576094867986587, "MET": 4.725379427768689, "Sliced_Wasserstein_Loss": 35.40573209332101, "Total": 2.0799385069172773}, "valid": {"Regression_pt": 0.1653812813850432, "Regression_eta": 0.0001462171004462639, "Regression_sin_phi": 0.00010218210852252994, "Regression_cos_phi": 0.0001026289525624274, "Regression_energy": 0.15951390577797395, "Classification_binary": 1.7903946450364117, "Classification": 0.013539620764582167, "MET": 4.723702522307138, "Sliced_Wasserstein_Loss": 35.08691781370038, "Total": 2.129180087628041}, "epoch_train_time": 8185.318641901016, "epoch_valid_time": 308.51724004745483, "epoch_total_time": 8498.987010478973}
clic/clusters/v1.9.0/history/epoch_11.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"train": {"Regression_pt": 0.15499489595188135, "Regression_eta": 0.00013651650443220555, "Regression_sin_phi": 9.736275213339122e-05, "Regression_cos_phi": 9.780949655607862e-05, "Regression_energy": 0.14958861983426985, "Classification_binary": 1.7408203541770395, "Classification": 0.013324897365080308, "MET": 4.67430004801195, "Sliced_Wasserstein_Loss": 35.210896934348106, "Total": 2.05906233884878}, "valid": {"Regression_pt": 0.16356486028654169, "Regression_eta": 0.00014207559839253542, "Regression_sin_phi": 9.994619977916859e-05, "Regression_cos_phi": 0.00010007039838197442, "Regression_energy": 0.15774530271561898, "Classification_binary": 1.7749692401568502, "Classification": 0.013310355505167903, "MET": 4.633513661871799, "Sliced_Wasserstein_Loss": 34.864684198943664, "Total": 2.109930802856914}, "epoch_train_time": 8427.653780937195, "epoch_valid_time": 307.3030478954315, "epoch_total_time": 8739.897525548935}
clic/clusters/v1.9.0/history/epoch_12.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"train": {"Regression_pt": 0.15327184557601004, "Regression_eta": 0.00013454063525960644, "Regression_sin_phi": 9.546070301021449e-05, "Regression_cos_phi": 9.590576854824926e-05, "Regression_energy": 0.1479123345699374, "Classification_binary": 1.7266554397983498, "Classification": 0.01311810870149904, "MET": 4.642330424461199, "Sliced_Wasserstein_Loss": 35.04252613983925, "Total": 2.0412836083647488}, "valid": {"Regression_pt": 0.16172016842264525, "Regression_eta": 0.0001399508855101699, "Regression_sin_phi": 9.79120767986576e-05, "Regression_cos_phi": 9.878569314788154e-05, "Regression_energy": 0.15579385794079106, "Classification_binary": 1.7655068371879001, "Classification": 0.013171486604228979, "MET": 4.601477472791293, "Sliced_Wasserstein_Loss": 34.80858374679897, "Total": 2.096529014284571}, "epoch_train_time": 8400.829313993454, "epoch_valid_time": 309.892941236496, "epoch_total_time": 8714.384973287582}
clic/clusters/v1.9.0/history/epoch_13.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"train": {"Regression_pt": 0.15174396530358666, "Regression_eta": 0.00013281082443919982, "Regression_sin_phi": 9.398443768921376e-05, "Regression_cos_phi": 9.442353385959481e-05, "Regression_energy": 0.14641779592920906, "Classification_binary": 1.714163163943026, "Classification": 0.012943207314584407, "MET": 4.626692087808522, "Sliced_Wasserstein_Loss": 34.89128183014439, "Total": 2.0255891466320506}, "valid": {"Regression_pt": 0.1606722745028409, "Regression_eta": 0.00013824486473000463, "Regression_sin_phi": 9.698694555158042e-05, "Regression_cos_phi": 9.774587447450958e-05, "Regression_energy": 0.1548126455153149, "Classification_binary": 1.7547027548815621, "Classification": 0.013000990150832642, "MET": 4.577718619958387, "Sliced_Wasserstein_Loss": 34.71425606193982, "Total": 2.083521570652609}, "epoch_train_time": 8393.153034448624, "epoch_valid_time": 311.04902839660645, "epoch_total_time": 8707.63015127182}
clic/clusters/v1.9.0/history/epoch_14.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"train": {"Regression_pt": 0.15038838485912048, "Regression_eta": 0.00013126841275872933, "Regression_sin_phi": 9.267101570030086e-05, "Regression_cos_phi": 9.311190946435037e-05, "Regression_energy": 0.1450879857875738, "Classification_binary": 1.7030145558450103, "Classification": 0.012790050706022944, "MET": 4.591329119692012, "Sliced_Wasserstein_Loss": 34.75487676932926, "Total": 2.0115977476616402}, "valid": {"Regression_pt": 0.15958323460382323, "Regression_eta": 0.00013686762347569386, "Regression_sin_phi": 9.537931822631484e-05, "Regression_cos_phi": 9.583486256007196e-05, "Regression_energy": 0.15380082759295474, "Classification_binary": 1.7452042466089148, "Classification": 0.01284685391317409, "MET": 4.5404810789452625, "Sliced_Wasserstein_Loss": 34.525233074583866, "Total": 2.0717631204485434}, "epoch_train_time": 8388.53110742569, "epoch_valid_time": 310.5029287338257, "epoch_total_time": 8703.827124118805}
clic/clusters/v1.9.0/history/epoch_15.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"train": {"Regression_pt": 0.14918763461683796, "Regression_eta": 0.00012993866828013827, "Regression_sin_phi": 9.154266106708433e-05, "Regression_cos_phi": 9.203006332769732e-05, "Regression_energy": 0.14391726906196653, "Classification_binary": 1.6929264894818266, "Classification": 0.012657778201532938, "MET": 4.5700068683761295, "Sliced_Wasserstein_Loss": 34.64072835905826, "Total": 1.999001140283804}, "valid": {"Regression_pt": 0.15848333185369318, "Regression_eta": 0.0001358835362899624, "Regression_sin_phi": 9.43980643599653e-05, "Regression_cos_phi": 9.493836024048691e-05, "Regression_energy": 0.15265614488823923, "Classification_binary": 1.7365356914212549, "Classification": 0.012726887545420784, "MET": 4.549923475512164, "Sliced_Wasserstein_Loss": 34.462140384923174, "Total": 2.060725838518326}, "epoch_train_time": 8398.742284297943, "epoch_valid_time": 310.03809547424316, "epoch_total_time": 8713.863807439804}
clic/clusters/v1.9.0/history/epoch_16.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"train": {"Regression_pt": 0.148085172726323, "Regression_eta": 0.00012874122499970217, "Regression_sin_phi": 9.063202724813593e-05, "Regression_cos_phi": 9.1065128613217e-05, "Regression_energy": 0.1428479751488148, "Classification_binary": 1.683895908537236, "Classification": 0.012543750255271867, "MET": 4.546421831655879, "Sliced_Wasserstein_Loss": 34.525993136069424, "Total": 1.9876841569457286}, "valid": {"Regression_pt": 0.1575530830021857, "Regression_eta": 0.0001352393520321034, "Regression_sin_phi": 9.345126823640205e-05, "Regression_cos_phi": 9.434995516924791e-05, "Regression_energy": 0.15180414373224432, "Classification_binary": 1.7306445875180059, "Classification": 0.012630682443381882, "MET": 4.536211275108035, "Sliced_Wasserstein_Loss": 34.33787311939821, "Total": 2.052956321022727}, "epoch_train_time": 8399.565154075623, "epoch_valid_time": 310.80304169654846, "epoch_total_time": 8715.425117254257}
clic/clusters/v1.9.0/history/epoch_17.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"train": {"Regression_pt": 0.1470916615635447, "Regression_eta": 0.00012770938927557994, "Regression_sin_phi": 8.983731278315642e-05, "Regression_cos_phi": 9.02625963116228e-05, "Regression_energy": 0.1418828477865602, "Classification_binary": 1.6756351025144036, "Classification": 0.012440865490410947, "MET": 4.533450825538801, "Sliced_Wasserstein_Loss": 34.42968249875525, "Total": 1.9773576979159257}, "valid": {"Regression_pt": 0.15684377399205246, "Regression_eta": 0.00013390421943688973, "Regression_sin_phi": 9.29273044864591e-05, "Regression_cos_phi": 9.352003108524024e-05, "Regression_energy": 0.1511566220721881, "Classification_binary": 1.7246843990076826, "Classification": 0.012530567398755101, "MET": 4.5268632837508, "Sliced_Wasserstein_Loss": 34.247754281370035, "Total": 2.045535977762884}, "epoch_train_time": 8403.311542510986, "epoch_valid_time": 310.82231187820435, "epoch_total_time": 8718.01251411438}
clic/clusters/v1.9.0/history/epoch_18.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"train": {"Regression_pt": 0.14617070318056938, "Regression_eta": 0.00012670809568539055, "Regression_sin_phi": 8.914968232968105e-05, "Regression_cos_phi": 8.95520041427976e-05, "Regression_energy": 0.14099400677224108, "Classification_binary": 1.6680998937513336, "Classification": 0.012347367827520404, "MET": 4.5119476958709726, "Sliced_Wasserstein_Loss": 34.332651682196456, "Total": 1.9679167370545558}, "valid": {"Regression_pt": 0.15617713488628862, "Regression_eta": 0.00013323828444438158, "Regression_sin_phi": 9.209367896164273e-05, "Regression_cos_phi": 9.254811667907559e-05, "Regression_energy": 0.15052260082601432, "Classification_binary": 1.719816591309219, "Classification": 0.012460399681413678, "MET": 4.495553577144686, "Sliced_Wasserstein_Loss": 34.24505341709347, "Total": 2.0392947618237836}, "epoch_train_time": 8404.115213871002, "epoch_valid_time": 310.52317452430725, "epoch_total_time": 8719.601351261139}
clic/clusters/v1.9.0/history/epoch_19.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"train": {"Regression_pt": 0.14533011202232557, "Regression_eta": 0.00012578892329651698, "Regression_sin_phi": 8.847567797538117e-05, "Regression_cos_phi": 8.8866035615544e-05, "Regression_energy": 0.14017893411804716, "Classification_binary": 1.6610934276975602, "Classification": 0.012264539052636435, "MET": 4.497060935166086, "Sliced_Wasserstein_Loss": 34.2476349669251, "Total": 1.959171782754819}, "valid": {"Regression_pt": 0.15551067161804277, "Regression_eta": 0.00013254870983763633, "Regression_sin_phi": 9.167926068800512e-05, "Regression_cos_phi": 9.221706668179716e-05, "Regression_energy": 0.14994580492320042, "Classification_binary": 1.7138725017005443, "Classification": 0.012356766665332907, "MET": 4.476917301036331, "Sliced_Wasserstein_Loss": 34.143033270646605, "Total": 2.0320022719770328}, "epoch_train_time": 8394.753654956818, "epoch_valid_time": 312.4476134777069, "epoch_total_time": 8711.73755145073}
clic/clusters/v1.9.0/history/epoch_2.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"train": {"Regression_pt": 0.21122850992913791, "Regression_eta": 0.0001842088012185203, "Regression_sin_phi": 0.00014138987413874832, "Regression_cos_phi": 0.00014098804412012597, "Regression_energy": 0.2054923079466445, "Classification_binary": 2.2563553317714633, "Classification": 0.022674102533575023, "MET": 5.596587372857244, "Sliced_Wasserstein_Loss": 40.58198040401167, "Total": 2.6962171696066575}, "valid": {"Regression_pt": 0.20593208967494597, "Regression_eta": 0.00018043831353303566, "Regression_sin_phi": 0.00013638632169301013, "Regression_cos_phi": 0.00013634379328289348, "Regression_energy": 0.19939833871838988, "Classification_binary": 2.175941770116437, "Classification": 0.02032262506619306, "MET": 5.254159456025929, "Sliced_Wasserstein_Loss": 38.99773177416773, "Total": 2.6020479678497117}, "epoch_train_time": 8162.723821640015, "epoch_valid_time": 303.2779562473297, "epoch_total_time": 8471.801720142365}
clic/clusters/v1.9.0/history/epoch_20.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"train": {"Regression_pt": 0.14455884367331426, "Regression_eta": 0.00012502070637971632, "Regression_sin_phi": 8.792358778035015e-05, "Regression_cos_phi": 8.832373043510448e-05, "Regression_energy": 0.13943227258784407, "Classification_binary": 1.6546821097695426, "Classification": 0.012190093440617443, "MET": 4.480659352994523, "Sliced_Wasserstein_Loss": 34.170849189131516, "Total": 1.9511649288267303}, "valid": {"Regression_pt": 0.15504011088235736, "Regression_eta": 0.00013170978972609613, "Regression_sin_phi": 9.126108373478341e-05, "Regression_cos_phi": 9.165213904826498e-05, "Regression_energy": 0.14941190360602893, "Classification_binary": 1.7090343910051216, "Classification": 0.012279103873786487, "MET": 4.454801786571703, "Sliced_Wasserstein_Loss": 34.054997599231754, "Total": 2.0260800643705985}, "epoch_train_time": 8412.36587715149, "epoch_valid_time": 313.4527451992035, "epoch_total_time": 8730.999478816986}
clic/clusters/v1.9.0/history/epoch_21.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"train": {"Regression_pt": 0.14383612039274218, "Regression_eta": 0.00012429848920655407, "Regression_sin_phi": 8.74421040120984e-05, "Regression_cos_phi": 8.781489271791674e-05, "Regression_energy": 0.13873544985085176, "Classification_binary": 1.6487438262411978, "Classification": 0.012124398921578815, "MET": 4.46524718374351, "Sliced_Wasserstein_Loss": 34.09480804111246, "Total": 1.9437390528309269}, "valid": {"Regression_pt": 0.15456123693957066, "Regression_eta": 0.0001309682518510904, "Regression_sin_phi": 9.067468202068314e-05, "Regression_cos_phi": 9.10771755494473e-05, "Regression_energy": 0.1489220788933709, "Classification_binary": 1.7048164787732074, "Classification": 0.012230134346115758, "MET": 4.448659195942701, "Sliced_Wasserstein_Loss": 34.01774067701665, "Total": 2.020843701234395}, "epoch_train_time": 8404.843801498413, "epoch_valid_time": 310.6338655948639, "epoch_total_time": 8720.130534887314}
clic/clusters/v1.9.0/history/epoch_22.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"train": {"Regression_pt": 0.14318005432741038, "Regression_eta": 0.00012371700157167155, "Regression_sin_phi": 8.694718068741033e-05, "Regression_cos_phi": 8.729484547794743e-05, "Regression_energy": 0.1381092175997137, "Classification_binary": 1.6433301177181876, "Classification": 0.012064071406730676, "MET": 4.453553439967281, "Sliced_Wasserstein_Loss": 34.028144782701474, "Total": 1.9369783423874387}, "valid": {"Regression_pt": 0.15424380137581525, "Regression_eta": 0.00013039475748084114, "Regression_sin_phi": 9.027373393885458e-05, "Regression_cos_phi": 9.057365558211538e-05, "Regression_energy": 0.14856489428179517, "Classification_binary": 1.702117490096831, "Classification": 0.012184517942225886, "MET": 4.444493487916133, "Sliced_Wasserstein_Loss": 33.989771726952625, "Total": 2.0174222938840427}, "epoch_train_time": 8386.136945724487, "epoch_valid_time": 311.406387090683, "epoch_total_time": 8702.459220170975}
clic/clusters/v1.9.0/history/epoch_23.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"train": {"Regression_pt": 0.14256503146338645, "Regression_eta": 0.0001231370232371333, "Regression_sin_phi": 8.663441634753052e-05, "Regression_cos_phi": 8.690536119697257e-05, "Regression_energy": 0.1375177856766573, "Classification_binary": 1.6383480444021623, "Classification": 0.012010222283796189, "MET": 4.442663229514902, "Sliced_Wasserstein_Loss": 33.96384211181449, "Total": 1.930738317092254}, "valid": {"Regression_pt": 0.1538299287021847, "Regression_eta": 0.00012993425245321668, "Regression_sin_phi": 8.985695933562998e-05, "Regression_cos_phi": 9.030088419798242e-05, "Regression_energy": 0.14816165000925297, "Classification_binary": 1.6988944274667894, "Classification": 0.012136613208414193, "MET": 4.433752863416293, "Sliced_Wasserstein_Loss": 33.88206726152369, "Total": 2.013331453565141}, "epoch_train_time": 8396.534974336624, "epoch_valid_time": 311.26527166366577, "epoch_total_time": 8712.70936512947}
clic/clusters/v1.9.0/history/epoch_24.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"train": {"Regression_pt": 0.14201513765648338, "Regression_eta": 0.0001226024595803697, "Regression_sin_phi": 8.626847036855389e-05, "Regression_cos_phi": 8.655391467635938e-05, "Regression_energy": 0.13699767358764847, "Classification_binary": 1.6339384746870331, "Classification": 0.011962751701534604, "MET": 4.434276419908955, "Sliced_Wasserstein_Loss": 33.904125026673306, "Total": 1.9252086074756385}, "valid": {"Regression_pt": 0.1534394033434449, "Regression_eta": 0.0001294050608951212, "Regression_sin_phi": 8.954611462606511e-05, "Regression_cos_phi": 8.983096339180589e-05, "Regression_energy": 0.1477533162174396, "Classification_binary": 1.695526005821863, "Classification": 0.01208756553073546, "MET": 4.399779179337388, "Sliced_Wasserstein_Loss": 33.920249479833544, "Total": 2.0091141665332906}, "epoch_train_time": 8406.046130180359, "epoch_valid_time": 312.55001044273376, "epoch_total_time": 8723.192549228668}
clic/clusters/v1.9.0/history/epoch_25.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"train": {"Regression_pt": 0.1415397241924657, "Regression_eta": 0.00012215687681019446, "Regression_sin_phi": 8.594374185572037e-05, "Regression_cos_phi": 8.620610573916018e-05, "Regression_energy": 0.13654248217056245, "Classification_binary": 1.6300269455953482, "Classification": 0.011920963957279367, "MET": 4.424101109609503, "Sliced_Wasserstein_Loss": 33.860142968916705, "Total": 1.920324058432321}, "valid": {"Regression_pt": 0.1532194208396687, "Regression_eta": 0.00012923831480260694, "Regression_sin_phi": 8.962532355141242e-05, "Regression_cos_phi": 8.985207496012983e-05, "Regression_energy": 0.1475435029834547, "Classification_binary": 1.6944469105113635, "Classification": 0.012067438240393176, "MET": 4.404075866777369, "Sliced_Wasserstein_Loss": 33.846523387484, "Total": 2.0075852397767284}, "epoch_train_time": 8410.550054311752, "epoch_valid_time": 309.2099447250366, "epoch_total_time": 8724.720307826996}
clic/clusters/v1.9.0/history/epoch_26.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"train": {"Regression_pt": 0.14111298603763603, "Regression_eta": 0.00012179393247347551, "Regression_sin_phi": 8.570928307021866e-05, "Regression_cos_phi": 8.596142101986529e-05, "Regression_energy": 0.1361414709049808, "Classification_binary": 1.6266087337292838, "Classification": 0.011885722981330909, "MET": 4.416712881872821, "Sliced_Wasserstein_Loss": 33.819134984707304, "Total": 1.9160441043104062}, "valid": {"Regression_pt": 0.15294641355546273, "Regression_eta": 0.00012900540068573897, "Regression_sin_phi": 8.90286570169251e-05, "Regression_cos_phi": 8.930639863472132e-05, "Regression_energy": 0.1472566338415793, "Classification_binary": 1.691983778559139, "Classification": 0.012031799394556258, "MET": 4.3860394701304415, "Sliced_Wasserstein_Loss": 33.848111395646605, "Total": 2.0045267610635404}, "epoch_train_time": 8530.488191127777, "epoch_valid_time": 311.8221266269684, "epoch_total_time": 8846.962290525436}
clic/clusters/v1.9.0/history/epoch_3.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"train": {"Regression_pt": 0.18933616880756987, "Regression_eta": 0.00016692739706882602, "Regression_sin_phi": 0.00012729356007230375, "Regression_cos_phi": 0.0001269753605553415, "Regression_energy": 0.18358308762403086, "Classification_binary": 2.0838011349491428, "Classification": 0.01928422454241389, "MET": 5.262679266839747, "Sliced_Wasserstein_Loss": 38.709367664841025, "Total": 2.4764230208407425}, "valid": {"Regression_pt": 0.1919191289650188, "Regression_eta": 0.00016935879457622728, "Regression_sin_phi": 0.0001260681929264728, "Regression_cos_phi": 0.00012567015448239336, "Regression_energy": 0.18555543242587627, "Classification_binary": 2.0702720182958547, "Classification": 0.018398880805920396, "MET": 5.070550701224392, "Sliced_Wasserstein_Loss": 37.86263604353393, "Total": 2.4665668013764406}, "epoch_train_time": 8230.182272672653, "epoch_valid_time": 308.0108172893524, "epoch_total_time": 8543.054384231567}
clic/clusters/v1.9.0/history/epoch_4.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"train": {"Regression_pt": 0.1786143162298172, "Regression_eta": 0.00015857730314397346, "Regression_sin_phi": 0.00011952422456980786, "Regression_cos_phi": 0.00011932215638719995, "Regression_energy": 0.1728371491904652, "Classification_binary": 2.0002632599669252, "Classification": 0.017894842282273588, "MET": 5.108573689451597, "Sliced_Wasserstein_Loss": 37.76678640017071, "Total": 2.3700090355821892}, "valid": {"Regression_pt": 0.18352234043018767, "Regression_eta": 0.00016246749405366358, "Regression_sin_phi": 0.0001194455425962138, "Regression_cos_phi": 0.00011919402587734325, "Regression_energy": 0.1771486016150168, "Classification_binary": 1.9996420729633482, "Classification": 0.017335538827502925, "MET": 4.964968477412772, "Sliced_Wasserstein_Loss": 37.12014594670295, "Total": 2.378049569612276}, "epoch_train_time": 8214.041770219803, "epoch_valid_time": 306.90172290802, "epoch_total_time": 8526.108189105988}
clic/clusters/v1.9.0/history/epoch_5.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"train": {"Regression_pt": 0.17193268796900563, "Regression_eta": 0.00015361577503473782, "Regression_sin_phi": 0.00011403074440215711, "Regression_cos_phi": 0.00011392641613566356, "Regression_energy": 0.16614855739095064, "Classification_binary": 1.931910831140195, "Classification": 0.016684882933710725, "MET": 4.990084754427769, "Sliced_Wasserstein_Loss": 37.07969983640373, "Total": 2.287059918246319}, "valid": {"Regression_pt": 0.1783841141665333, "Regression_eta": 0.00015913582527378, "Regression_sin_phi": 0.00011479531184659267, "Regression_cos_phi": 0.00011529963315678009, "Regression_energy": 0.1722075282840509, "Classification_binary": 1.9372783665773048, "Classification": 0.016110067330920894, "MET": 4.881973794114116, "Sliced_Wasserstein_Loss": 36.527201204385406, "Total": 2.3043697420674616}, "epoch_train_time": 8175.288253068924, "epoch_valid_time": 308.4102942943573, "epoch_total_time": 8489.31802892685}
clic/clusters/v1.9.0/history/epoch_6.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"train": {"Regression_pt": 0.16743585695217475, "Regression_eta": 0.00015021810254176282, "Regression_sin_phi": 0.00011021218061430186, "Regression_cos_phi": 0.00011030857233516487, "Regression_energy": 0.16165704019667118, "Classification_binary": 1.8742821824542997, "Classification": 0.015562554987644146, "MET": 4.906483391421865, "Sliced_Wasserstein_Loss": 36.557209794437725, "Total": 2.2193079166370295}, "valid": {"Regression_pt": 0.17461406009297975, "Regression_eta": 0.000155569210400502, "Regression_sin_phi": 0.00011178953203440629, "Regression_cos_phi": 0.00011252947676349693, "Regression_energy": 0.1683838633050076, "Classification_binary": 1.8830095943201826, "Classification": 0.015142072025548176, "MET": 4.854947958346671, "Sliced_Wasserstein_Loss": 36.03524377800896, "Total": 2.2415282265324903}, "epoch_train_time": 8179.8211143016815, "epoch_valid_time": 309.2634208202362, "epoch_total_time": 8493.771128416061}
clic/clusters/v1.9.0/history/epoch_7.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"train": {"Regression_pt": 0.16412609221628494, "Regression_eta": 0.00014746108564885177, "Regression_sin_phi": 0.00010745920830714481, "Regression_cos_phi": 0.00010773636842148768, "Regression_energy": 0.15834741438424674, "Classification_binary": 1.8304277953624013, "Classification": 0.014763990348361703, "MET": 4.840092423003059, "Sliced_Wasserstein_Loss": 36.152082296038124, "Total": 2.168029042783982}, "valid": {"Regression_pt": 0.17170873585797455, "Regression_eta": 0.0001519108513100657, "Regression_sin_phi": 0.00010872216329989756, "Regression_cos_phi": 0.00010975258527430926, "Regression_energy": 0.1653870644856354, "Classification_binary": 1.8506965979113317, "Classification": 0.014529290486236846, "MET": 4.790179732514405, "Sliced_Wasserstein_Loss": 35.666950924295776, "Total": 2.202691423755602}, "epoch_train_time": 8176.256416082382, "epoch_valid_time": 307.4676456451416, "epoch_total_time": 8488.562541007996}
clic/clusters/v1.9.0/history/epoch_8.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"train": {"Regression_pt": 0.16152325689949498, "Regression_eta": 0.00014520841248895732, "Regression_sin_phi": 0.0001053959608196712, "Regression_cos_phi": 0.00010573406105521767, "Regression_energy": 0.15574801742322533, "Classification_binary": 1.800247089275553, "Classification": 0.014262616647087719, "MET": 4.7838134157834835, "Sliced_Wasserstein_Loss": 35.859967369656445, "Total": 2.1321396936570878}, "valid": {"Regression_pt": 0.16928602699738918, "Regression_eta": 0.00015235471893364276, "Regression_sin_phi": 0.00010825148884984503, "Regression_cos_phi": 0.00010889581620464252, "Regression_energy": 0.16294416361673236, "Classification_binary": 1.825221883502721, "Classification": 0.014123256014190165, "MET": 4.73461757762484, "Sliced_Wasserstein_Loss": 35.51880101632523, "Total": 2.1719445535071222}, "epoch_train_time": 8153.650901317596, "epoch_valid_time": 304.9373595714569, "epoch_total_time": 8463.484191894531}
clic/clusters/v1.9.0/history/epoch_9.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"train": {"Regression_pt": 0.15929947425803756, "Regression_eta": 0.00014303083126079782, "Regression_sin_phi": 0.00010358894674289073, "Regression_cos_phi": 0.000104017520206939, "Regression_energy": 0.15354411820275268, "Classification_binary": 1.7768532112436872, "Classification": 0.013880704101979271, "MET": 4.743960719112312, "Sliced_Wasserstein_Loss": 35.629550021338645, "Total": 2.103929755850345}, "valid": {"Regression_pt": 0.1672485195262484, "Regression_eta": 0.00014830634436747788, "Regression_sin_phi": 0.00010625061205804119, "Regression_cos_phi": 0.0001067290137427717, "Regression_energy": 0.16099440883582747, "Classification_binary": 1.8077555130141645, "Classification": 0.013808118549107589, "MET": 4.711372014044494, "Sliced_Wasserstein_Loss": 35.31911211587708, "Total": 2.1501678974771927}, "epoch_train_time": 8167.765163898468, "epoch_valid_time": 308.7130310535431, "epoch_total_time": 8481.429405450821}
clic/clusters/v1.9.0/hyperparameters.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"Num of mlpf parameters": 64181266, "config": "parameters/pytorch/pyg-clic.yaml", "prefix": null, "data_dir": "/scratch/project_465000301/tensorflow_datasets", "gpus": 8, "gpu_batch_multiplier": 128, "dataset": "clic", "num_workers": 8, "prefetch_factor": 100, "resume_training": null, "load": "experiments/pyg-clic_20241001_215132_345408/checkpoints/checkpoint-19-2.032002.pth", "train": true, "test": null, "num_epochs": 30, "patience": null, "lr": 0.0001, "conv_type": "attention", "num_convs": null, "make_plots": null, "export_onnx": null, "ntrain": null, "ntest": null, "nvalid": null, "val_freq": null, "checkpoint_freq": 1, "hpo": null, "ray_train": false, "local": null, "ray_cpus": null, "ray_gpus": null, "raytune_num_samples": null, "comet": false, "comet_offline": false, "comet_step_freq": null, "experiments_dir": null, "pipeline": null, "dtype": "bfloat16", "attention_type": null, "test_datasets": {"clic_edm_qq_pf": {"version": "2.2.0"}, "clic_edm_ttbar_pf": {"version": "2.2.0"}, "clic_edm_ww_fullhad_pf": {"version": "2.2.0"}}}
clic/clusters/v1.9.0/mlpf_losses.pkl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f611636a6eac781983b242839d9f59e501eba3a88bae9add3b8853b542e96ee1
3
+ size 1272
clic/clusters/v1.9.0/model_kwargs.pkl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cf15ce6c3bae954fd8cf8048452e8e35e1b6d05ab133828cc0a4c60baa6c0dd5
3
+ size 552
clic/clusters/v1.9.0/plots_checkpoint-26-2.004527/clic_edm_qq_pf/jet_eta.pdf ADDED
Binary file (25.9 kB). View file
 
clic/clusters/v1.9.0/plots_checkpoint-26-2.004527/clic_edm_qq_pf/jet_eta.png ADDED
clic/clusters/v1.9.0/plots_checkpoint-26-2.004527/clic_edm_qq_pf/jet_pt.pdf ADDED
Binary file (24 kB). View file
 
clic/clusters/v1.9.0/plots_checkpoint-26-2.004527/clic_edm_qq_pf/jet_pt.png ADDED
clic/clusters/v1.9.0/plots_checkpoint-26-2.004527/clic_edm_qq_pf/jet_pt_log.pdf ADDED
Binary file (25.4 kB). View file
 
clic/clusters/v1.9.0/plots_checkpoint-26-2.004527/clic_edm_qq_pf/jet_pt_log.png ADDED
clic/clusters/v1.9.0/plots_checkpoint-26-2.004527/clic_edm_qq_pf/jet_res.pdf ADDED
Binary file (33 kB). View file
 
clic/clusters/v1.9.0/plots_checkpoint-26-2.004527/clic_edm_qq_pf/jet_res.png ADDED
clic/clusters/v1.9.0/plots_checkpoint-26-2.004527/clic_edm_qq_pf/jet_res_bins_0p5_1p5.pdf ADDED
Binary file (25.1 kB). View file
 
clic/clusters/v1.9.0/plots_checkpoint-26-2.004527/clic_edm_qq_pf/jet_res_bins_0p5_1p5.png ADDED
clic/clusters/v1.9.0/plots_checkpoint-26-2.004527/clic_edm_qq_pf/jet_res_bins_0p5_1p5_vstarget.pdf ADDED
Binary file (27 kB). View file
 
clic/clusters/v1.9.0/plots_checkpoint-26-2.004527/clic_edm_qq_pf/jet_res_bins_0p5_1p5_vstarget.png ADDED
clic/clusters/v1.9.0/plots_checkpoint-26-2.004527/clic_edm_qq_pf/jet_res_vstarget.pdf ADDED
Binary file (23.6 kB). View file
 
clic/clusters/v1.9.0/plots_checkpoint-26-2.004527/clic_edm_qq_pf/jet_res_vstarget.png ADDED
clic/clusters/v1.9.0/plots_checkpoint-26-2.004527/clic_edm_qq_pf/jet_response_binned.pdf ADDED
Binary file (28.7 kB). View file
 
clic/clusters/v1.9.0/plots_checkpoint-26-2.004527/clic_edm_qq_pf/jet_response_binned.png ADDED
clic/clusters/v1.9.0/plots_checkpoint-26-2.004527/clic_edm_qq_pf/jet_response_binned_eta.pdf ADDED
Binary file (26.2 kB). View file
 
clic/clusters/v1.9.0/plots_checkpoint-26-2.004527/clic_edm_qq_pf/jet_response_binned_eta.png ADDED