Joosep Pata
commited on
Commit
·
1ae7f28
1
Parent(s):
7f0f139
added model trained on 4M events per dataset
Browse filesThis view is limited to 50 files because it contains too many changes.
See raw diff
- clic/clusters/v2.3.0/README.md +1 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/.gitattributes +4 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/checkpoints/checkpoint-10-2.022640.pth +3 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/history/epoch_1.json +1 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/history/epoch_10.json +1 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/history/epoch_2.json +1 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/history/epoch_3.json +1 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/history/epoch_4.json +1 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/history/epoch_5.json +1 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/history/epoch_6.json +1 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/history/epoch_7.json +1 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/history/epoch_8.json +1 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/history/epoch_9.json +1 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/hyperparameters.json +1 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/model_kwargs.pkl +3 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/overridden_config.yaml +225 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_eta.pdf +0 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_eta.png +0 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_pt.pdf +0 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_pt.png +0 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_pt_log.pdf +0 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_pt_log.png +0 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_res.pdf +0 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_res.png +0 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_res_bins_0p5_1p5.pdf +0 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_res_bins_0p5_1p5.png +0 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_res_bins_0p5_1p5_vstarget.pdf +0 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_res_bins_0p5_1p5_vstarget.png +0 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_res_vstarget.pdf +0 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_res_vstarget.png +0 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_binned_10.pdf +0 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_binned_10.png +0 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_binned_100.pdf +0 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_binned_100.png +0 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_binned_20.pdf +0 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_binned_20.png +0 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_binned_200.pdf +0 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_binned_200.png +0 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_binned_40.pdf +0 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_binned_40.png +0 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_binned_400.pdf +0 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_binned_400.png +0 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_binned_60.pdf +0 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_binned_60.png +0 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_binned_80.pdf +0 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_binned_80.png +0 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_binned_eta.pdf +0 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_binned_eta.png +0 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_iqr_over_med_eta.pdf +0 -0
- clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_iqr_over_med_eta.png +0 -0
clic/clusters/v2.3.0/README.md
CHANGED
@@ -3,4 +3,5 @@ pyg-clic_20250204_081614_352844 - legacy GNNLSH model, full dataset, 2 epochs /
|
|
3 |
pyg-clic_20250211_145811_219129 - transformer, full dataset, 2 epochs / ~45 hours
|
4 |
pyg-clic_20250130_214007_333962 - transformer + flash attention, full dataset, 10 epochs / ~80 hours
|
5 |
pyg-clic_20250208_095515_010468 - transformer + flash attention, 1M events from each dataset, 10 epochs / ~12 hours
|
|
|
6 |
```
|
|
|
3 |
pyg-clic_20250211_145811_219129 - transformer, full dataset, 2 epochs / ~45 hours
|
4 |
pyg-clic_20250130_214007_333962 - transformer + flash attention, full dataset, 10 epochs / ~80 hours
|
5 |
pyg-clic_20250208_095515_010468 - transformer + flash attention, 1M events from each dataset, 10 epochs / ~12 hours
|
6 |
+
pyg-clic_20250209_100514_187330 - transformer + flash attention, 4M events from each dataset, 10 epochs / ~40 hours
|
7 |
```
|
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/.gitattributes
ADDED
@@ -0,0 +1,4 @@
|
|
|
|
|
|
|
|
|
|
|
1 |
+
plots_checkpoint*/** filter=lfs diff=lfs merge=lfs -text
|
2 |
+
preds_checkpoint*/** filter=lfs diff=lfs merge=lfs -text
|
3 |
+
runs/** filter=lfs diff=lfs merge=lfs -text
|
4 |
+
checkpoints/** filter=lfs diff=lfs merge=lfs -text
|
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/checkpoints/checkpoint-10-2.022640.pth
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f7b4a648f3b8a5c469daf46d1f04a4e1981132f0c830a18e0330ae5e9ea9bf0f
|
3 |
+
size 631766830
|
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/history/epoch_1.json
ADDED
@@ -0,0 +1 @@
|
|
|
|
|
1 |
+
{"train": {"Regression_pt": 0.14345213541666665, "Regression_eta": 0.00018138920084635418, "Regression_sin_phi": 0.00012380649820963543, "Regression_cos_phi": 0.0001234324747721354, "Regression_energy": 0.14386429166666667, "Classification_binary": 2.556795166666667, "Classification": 0.021014436197916667, "ispu": 1.9530105590820312e-05, "MET": 5.339154, "Sliced_Wasserstein_Loss": 46.48268266666667, "Total": 2.8655763333333333}, "valid": {"Regression_pt": 0.12151891304947443, "Regression_eta": 0.00016556585027634096, "Regression_sin_phi": 0.00010927752452673626, "Regression_cos_phi": 0.00010814136957323869, "Regression_energy": 0.12153660913093774, "Classification_binary": 2.2896245954584358, "Classification": 0.01716714167041397, "ispu": 3.655637660323791e-14, "MET": 4.209416707900931, "Sliced_Wasserstein_Loss": 42.2115217449449, "Total": 2.5502346173906068}, "epoch_train_time": 14776.063064575195, "epoch_valid_time": 1052.652112007141, "epoch_total_time": 15828.715177297592}
|
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/history/epoch_10.json
ADDED
@@ -0,0 +1 @@
|
|
|
|
|
1 |
+
{"train": {"Regression_pt": 0.08632264583333334, "Regression_eta": 0.00012097669474283854, "Regression_sin_phi": 7.552708943684896e-05, "Regression_cos_phi": 7.548623657226563e-05, "Regression_energy": 0.08628893229166666, "Classification_binary": 1.7798005, "Classification": 0.011805893229166666, "ispu": 1.4743171128372978e-15, "MET": 3.1923203333333334, "Sliced_Wasserstein_Loss": 35.96093333333334, "Total": 1.9644896666666667}, "valid": {"Regression_pt": 0.0918824169436049, "Regression_eta": 0.00012829951121741545, "Regression_sin_phi": 8.007459644584129e-05, "Regression_cos_phi": 8.001636282905635e-05, "Regression_energy": 0.09222343545221728, "Classification_binary": 1.8263311005536536, "Classification": 0.011915614825079655, "ispu": 3.528617966985692e-15, "MET": 3.189476110316144, "Sliced_Wasserstein_Loss": 35.713200090938265, "Total": 2.022640118487215}, "epoch_train_time": 14746.338783025742, "epoch_valid_time": 1048.431301832199, "epoch_total_time": 15794.770089626312}
|
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/history/epoch_2.json
ADDED
@@ -0,0 +1 @@
|
|
|
|
|
1 |
+
{"train": {"Regression_pt": 0.11259471875, "Regression_eta": 0.0001522861124674479, "Regression_sin_phi": 9.933065795898437e-05, "Regression_cos_phi": 9.910172526041666e-05, "Regression_energy": 0.11210575, "Classification_binary": 2.1718713333333333, "Classification": 0.016470404947916666, "ispu": 1.6110830832379482e-13, "MET": 3.9014626666666667, "Sliced_Wasserstein_Loss": 41.21674933333333, "Total": 2.4133845}, "valid": {"Regression_pt": 0.11062613254921365, "Regression_eta": 0.00015335334511737818, "Regression_sin_phi": 9.96017399938193e-05, "Regression_cos_phi": 9.946891607747967e-05, "Regression_energy": 0.11057723651305232, "Classification_binary": 2.1408815997111375, "Classification": 0.015836920370402102, "ispu": 9.730737191254473e-14, "MET": 3.7601486439499303, "Sliced_Wasserstein_Loss": 40.27225981598374, "Total": 2.3782706015299024}, "epoch_train_time": 14780.387635231018, "epoch_valid_time": 1050.3413667678833, "epoch_total_time": 15830.729005336761}
|
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/history/epoch_3.json
ADDED
@@ -0,0 +1 @@
|
|
|
|
|
1 |
+
{"train": {"Regression_pt": 0.10544566666666666, "Regression_eta": 0.00014501154581705728, "Regression_sin_phi": 9.309078979492188e-05, "Regression_cos_phi": 9.300138346354167e-05, "Regression_energy": 0.1050540625, "Classification_binary": 2.0617886666666667, "Classification": 0.015343759114583333, "ispu": 2.68790264120374e-13, "MET": 3.6933973333333334, "Sliced_Wasserstein_Loss": 39.756704, "Total": 2.2879665}, "valid": {"Regression_pt": 0.10576827087108498, "Regression_eta": 0.00014764572370877505, "Regression_sin_phi": 9.513243106618799e-05, "Regression_cos_phi": 9.496072734678902e-05, "Regression_energy": 0.1057881544755336, "Classification_binary": 2.0440354776265113, "Classification": 0.014765050432395588, "ispu": 1.2218228283550015e-13, "MET": 3.6542022172889697, "Sliced_Wasserstein_Loss": 38.955299828822085, "Total": 2.270691590550444}, "epoch_train_time": 14783.080834150314, "epoch_valid_time": 1051.713621377945, "epoch_total_time": 15834.79445886612}
|
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/history/epoch_4.json
ADDED
@@ -0,0 +1 @@
|
|
|
|
|
1 |
+
{"train": {"Regression_pt": 0.10056632291666667, "Regression_eta": 0.0001380366007486979, "Regression_sin_phi": 8.740170288085938e-05, "Regression_cos_phi": 8.738942464192709e-05, "Regression_energy": 0.10040232291666666, "Classification_binary": 1.9751141666666667, "Classification": 0.014366264322916667, "ispu": 1.1744162028056356e-13, "MET": 3.5545613333333335, "Sliced_Wasserstein_Loss": 38.72884533333333, "Total": 2.1907466666666666}, "valid": {"Regression_pt": 0.10218279662968399, "Regression_eta": 0.00014112312500140395, "Regression_sin_phi": 8.978832975815895e-05, "Regression_cos_phi": 8.988734348656673e-05, "Regression_energy": 0.10243130577073527, "Classification_binary": 1.979122228388788, "Classification": 0.0139767471399998, "ispu": 6.654092656737552e-14, "MET": 3.607495636969081, "Sliced_Wasserstein_Loss": 37.96207004921365, "Total": 2.1980322897721196}, "epoch_train_time": 14766.17946434021, "epoch_valid_time": 1046.7795479297638, "epoch_total_time": 15812.959015369415}
|
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/history/epoch_5.json
ADDED
@@ -0,0 +1 @@
|
|
|
|
|
1 |
+
{"train": {"Regression_pt": 0.09685227083333334, "Regression_eta": 0.00013351305135091146, "Regression_sin_phi": 8.390215047200521e-05, "Regression_cos_phi": 8.392649332682291e-05, "Regression_energy": 0.09670648958333333, "Classification_binary": 1.925606, "Classification": 0.013763524739583333, "ispu": 4.683713920409597e-14, "MET": 3.4562666666666666, "Sliced_Wasserstein_Loss": 38.02410666666667, "Total": 2.1332263333333334}, "valid": {"Regression_pt": 0.09853944118308347, "Regression_eta": 0.00013799951511920551, "Regression_sin_phi": 8.69795270331903e-05, "Regression_cos_phi": 8.684615470565448e-05, "Regression_energy": 0.09891279585236239, "Classification_binary": 1.9436084522440356, "Classification": 0.013558235629632586, "ispu": 5.780081176356482e-15, "MET": 3.348571692454798, "Sliced_Wasserstein_Loss": 37.41128838129881, "Total": 2.154925627875254}, "epoch_train_time": 14815.98518037796, "epoch_valid_time": 1051.0379424095154, "epoch_total_time": 15867.023137331009}
|
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/history/epoch_6.json
ADDED
@@ -0,0 +1 @@
|
|
|
|
|
1 |
+
{"train": {"Regression_pt": 0.09374578125, "Regression_eta": 0.00012966026814778645, "Regression_sin_phi": 8.1197021484375e-05, "Regression_cos_phi": 8.118624369303386e-05, "Regression_energy": 0.09360591666666666, "Classification_binary": 1.8861036666666666, "Classification": 0.013247291666666666, "ispu": 2.0679910524753118e-14, "MET": 3.374214333333333, "Sliced_Wasserstein_Loss": 37.44024266666667, "Total": 2.087004}, "valid": {"Regression_pt": 0.09613845513391663, "Regression_eta": 0.00013378030720452924, "Regression_sin_phi": 8.459722242011506e-05, "Regression_cos_phi": 8.435377886400664e-05, "Regression_energy": 0.09640644302441626, "Classification_binary": 1.9046849089279982, "Classification": 0.013058696798657407, "ispu": 5.780081176356482e-15, "MET": 3.427107293784102, "Sliced_Wasserstein_Loss": 36.837447844228095, "Total": 2.110593848627902}, "epoch_train_time": 14761.964252233505, "epoch_valid_time": 1050.0273101329803, "epoch_total_time": 15811.99156332016}
|
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/history/epoch_7.json
ADDED
@@ -0,0 +1 @@
|
|
|
|
|
1 |
+
{"train": {"Regression_pt": 0.09120145833333333, "Regression_eta": 0.00012652694702148439, "Regression_sin_phi": 7.902403767903645e-05, "Regression_cos_phi": 7.899856567382812e-05, "Regression_energy": 0.091082625, "Classification_binary": 1.8449146666666667, "Classification": 0.01261265234375, "ispu": 2.9779337514431366e-15, "MET": 3.3050646666666665, "Sliced_Wasserstein_Loss": 36.90396533333333, "Total": 2.0401078333333333}, "valid": {"Regression_pt": 0.09425217691839628, "Regression_eta": 0.0001316729542838452, "Regression_sin_phi": 8.258784379731579e-05, "Regression_cos_phi": 8.238991837049022e-05, "Regression_energy": 0.09451814706381392, "Classification_binary": 1.8677452073526266, "Classification": 0.012429491726487694, "ispu": 0.0, "MET": 3.2817853489087407, "Sliced_Wasserstein_Loss": 36.440167968332084, "Total": 2.069240757395421}, "epoch_train_time": 14761.517078876495, "epoch_valid_time": 1050.0858120918274, "epoch_total_time": 15811.602896690369}
|
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/history/epoch_8.json
ADDED
@@ -0,0 +1 @@
|
|
|
|
|
1 |
+
{"train": {"Regression_pt": 0.08908064583333333, "Regression_eta": 0.00012395712280273438, "Regression_sin_phi": 7.736606852213541e-05, "Regression_cos_phi": 7.733473714192708e-05, "Regression_energy": 0.08898997916666666, "Classification_binary": 1.8095385, "Classification": 0.012140442708333333, "ispu": 3.6476338986328e-15, "MET": 3.245734666666667, "Sliced_Wasserstein_Loss": 36.46280266666667, "Total": 2.000024}, "valid": {"Regression_pt": 0.09259154006633144, "Regression_eta": 0.00012971040742644987, "Regression_sin_phi": 8.117154637568645e-05, "Regression_cos_phi": 8.10459602117615e-05, "Regression_energy": 0.09286328919038193, "Classification_binary": 1.8419527572349417, "Classification": 0.012070082516240171, "ispu": 7.06212367094189e-15, "MET": 3.2130101486439497, "Sliced_Wasserstein_Loss": 36.224229699368784, "Total": 2.039768358163047}, "epoch_train_time": 14774.942404031754, "epoch_valid_time": 1049.9855282306671, "epoch_total_time": 15824.927933216095}
|
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/history/epoch_9.json
ADDED
@@ -0,0 +1 @@
|
|
|
|
|
1 |
+
{"train": {"Regression_pt": 0.08741290625, "Regression_eta": 0.00012209530639648437, "Regression_sin_phi": 7.62167714436849e-05, "Regression_cos_phi": 7.61659444173177e-05, "Regression_energy": 0.08735395833333333, "Classification_binary": 1.7886371666666667, "Classification": 0.0119045390625, "ispu": 1.445688309805367e-15, "MET": 3.210315666666667, "Sliced_Wasserstein_Loss": 36.153306666666666, "Total": 1.9755803333333333}, "valid": {"Regression_pt": 0.09195151002034409, "Regression_eta": 0.00012858418764322726, "Regression_sin_phi": 8.051994459838476e-05, "Regression_cos_phi": 8.036921441880243e-05, "Regression_energy": 0.09229617875387824, "Classification_binary": 1.8273226035091472, "Classification": 0.011916637573699282, "ispu": 0.0, "MET": 3.1647733731411147, "Sliced_Wasserstein_Loss": 35.92370881031347, "Total": 2.0237791453808707}, "epoch_train_time": 14748.775985002518, "epoch_valid_time": 1047.9388043880463, "epoch_total_time": 15796.71479010582}
|
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/hyperparameters.json
ADDED
@@ -0,0 +1 @@
|
|
|
|
|
1 |
+
{"num_mlpf_params": 52630547, "train": true, "test": null, "make_plots": null, "comet": true, "save_attention": true, "dataset": "clic", "sort_data": false, "data_dir": "/scratch/persistent/joosep/tensorflow_datasets", "gpus": 1, "gpu_batch_multiplier": 256, "load": null, "finetune": null, "num_epochs": 10, "patience": 20, "lr": 0.0001, "lr_schedule": "cosinedecay", "conv_type": "attention", "ntrain": 4000000, "ntest": 2000, "nvalid": null, "num_workers": 8, "prefetch_factor": 100, "checkpoint_freq": 1, "comet_name": "particleflow-pt", "comet_offline": false, "comet_step_freq": 100, "dtype": "bfloat16", "val_freq": null, "model": {"trainable": "all", "learned_representation_mode": "last", "input_encoding": "split", "pt_mode": "direct-elemtype-split", "eta_mode": "linear", "sin_phi_mode": "linear", "cos_phi_mode": "linear", "energy_mode": "direct-elemtype-split", "gnn_lsh": {"conv_type": "gnn_lsh", "embedding_dim": 1024, "width": 1024, "num_convs": 3, "activation": "elu", "bin_size": 32, "max_num_bins": 200, "distance_dim": 128, "layernorm": true, "num_node_messages": 2, "ffn_dist_hidden_dim": 128, "ffn_dist_num_layers": 2}, "attention": {"conv_type": "attention", "num_convs": 3, "dropout_ff": 0.0, "dropout_conv_id_mha": 0.0, "dropout_conv_id_ff": 0.0, "dropout_conv_reg_mha": 0.0, "dropout_conv_reg_ff": 0.0, "activation": "relu", "head_dim": 32, "num_heads": 32, "attention_type": "flash", "use_pre_layernorm": true}, "mamba": {"conv_type": "mamba", "embedding_dim": 128, "width": 128, "num_convs": 2, "dropout": 0.0, "activation": "elu", "num_heads": 2, "d_state": 16, "d_conv": 4, "expand": 2}}, "lr_schedule_config": {"onecycle": {"pct_start": 0.3}}, "raytune": {"local_dir": null, "sched": null, "search_alg": null, "default_metric": "val_loss", "default_mode": "min", "asha": {"max_t": 200, "reduction_factor": 4, "brackets": 1, "grace_period": 10}, "hyperband": {"max_t": 200, "reduction_factor": 4}, "hyperopt": {"n_random_steps": 10}, "nevergrad": {"n_random_steps": 10}}, "train_dataset": {"clic": {"physical": {"batch_size": 1, "samples": {"clic_edm_qq_pf": {"version": "2.5.0", "splits": [1, 2, 3, 4, 5, 6, 7, 8, 9, 10]}, "clic_edm_ttbar_pf": {"version": "2.5.0", "splits": [1, 2, 3, 4, 5, 6, 7, 8, 9, 10]}, "clic_edm_ww_fullhad_pf": {"version": "2.5.0", "splits": [1, 2, 3, 4, 5, 6, 7, 8, 9, 10]}}}}}, "valid_dataset": {"clic": {"physical": {"batch_size": 1, "samples": {"clic_edm_qq_pf": {"version": "2.5.0", "splits": [1, 2, 3, 4, 5, 6, 7, 8, 9, 10]}, "clic_edm_ttbar_pf": {"version": "2.5.0", "splits": [1, 2, 3, 4, 5, 6, 7, 8, 9, 10]}, "clic_edm_ww_fullhad_pf": {"version": "2.5.0", "splits": [1, 2, 3, 4, 5, 6, 7, 8, 9, 10]}}}}}, "test_dataset": {"clic_edm_qq_pf": {"version": "2.5.0", "splits": [1, 2, 3, 4, 5, 6, 7, 8, 9, 10]}, "clic_edm_ttbar_pf": {"version": "2.5.0", "splits": [1, 2, 3, 4, 5, 6, 7, 8, 9, 10]}, "clic_edm_ww_fullhad_pf": {"version": "2.5.0", "splits": [1, 2, 3, 4, 5, 6, 7, 8, 9, 10]}}, "enabled_test_datasets": ["clic_edm_qq_pf"]}
|
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/model_kwargs.pkl
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:b2784c409dcad68c684a808b0e9c311e9eb8ee97cd7c92d4d4f1771878dd87f5
|
3 |
+
size 553
|
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/overridden_config.yaml
ADDED
@@ -0,0 +1,225 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
checkpoint_freq: 1
|
2 |
+
comet: true
|
3 |
+
comet_name: particleflow-pt
|
4 |
+
comet_offline: false
|
5 |
+
comet_step_freq: 100
|
6 |
+
conv_type: attention
|
7 |
+
data_dir: /scratch/persistent/joosep/tensorflow_datasets
|
8 |
+
dataset: clic
|
9 |
+
dtype: bfloat16
|
10 |
+
enabled_test_datasets:
|
11 |
+
- clic_edm_qq_pf
|
12 |
+
finetune: null
|
13 |
+
gpu_batch_multiplier: 256
|
14 |
+
gpus: 1
|
15 |
+
load: null
|
16 |
+
lr: 0.0001
|
17 |
+
lr_schedule: cosinedecay
|
18 |
+
lr_schedule_config:
|
19 |
+
onecycle:
|
20 |
+
pct_start: 0.3
|
21 |
+
make_plots: null
|
22 |
+
model:
|
23 |
+
attention:
|
24 |
+
activation: relu
|
25 |
+
attention_type: flash
|
26 |
+
conv_type: attention
|
27 |
+
dropout_conv_id_ff: 0.0
|
28 |
+
dropout_conv_id_mha: 0.0
|
29 |
+
dropout_conv_reg_ff: 0.0
|
30 |
+
dropout_conv_reg_mha: 0.0
|
31 |
+
dropout_ff: 0.0
|
32 |
+
head_dim: 32
|
33 |
+
num_convs: 3
|
34 |
+
num_heads: 32
|
35 |
+
use_pre_layernorm: true
|
36 |
+
cos_phi_mode: linear
|
37 |
+
energy_mode: direct-elemtype-split
|
38 |
+
eta_mode: linear
|
39 |
+
gnn_lsh:
|
40 |
+
activation: elu
|
41 |
+
bin_size: 32
|
42 |
+
conv_type: gnn_lsh
|
43 |
+
distance_dim: 128
|
44 |
+
embedding_dim: 1024
|
45 |
+
ffn_dist_hidden_dim: 128
|
46 |
+
ffn_dist_num_layers: 2
|
47 |
+
layernorm: true
|
48 |
+
max_num_bins: 200
|
49 |
+
num_convs: 3
|
50 |
+
num_node_messages: 2
|
51 |
+
width: 1024
|
52 |
+
input_encoding: split
|
53 |
+
learned_representation_mode: last
|
54 |
+
mamba:
|
55 |
+
activation: elu
|
56 |
+
conv_type: mamba
|
57 |
+
d_conv: 4
|
58 |
+
d_state: 16
|
59 |
+
dropout: 0.0
|
60 |
+
embedding_dim: 128
|
61 |
+
expand: 2
|
62 |
+
num_convs: 2
|
63 |
+
num_heads: 2
|
64 |
+
width: 128
|
65 |
+
pt_mode: direct-elemtype-split
|
66 |
+
sin_phi_mode: linear
|
67 |
+
trainable: all
|
68 |
+
ntest: 2000
|
69 |
+
ntrain: 4000000
|
70 |
+
num_epochs: 10
|
71 |
+
num_workers: 8
|
72 |
+
nvalid: null
|
73 |
+
patience: 20
|
74 |
+
prefetch_factor: 100
|
75 |
+
raytune:
|
76 |
+
asha:
|
77 |
+
brackets: 1
|
78 |
+
grace_period: 10
|
79 |
+
max_t: 200
|
80 |
+
reduction_factor: 4
|
81 |
+
default_metric: val_loss
|
82 |
+
default_mode: min
|
83 |
+
hyperband:
|
84 |
+
max_t: 200
|
85 |
+
reduction_factor: 4
|
86 |
+
hyperopt:
|
87 |
+
n_random_steps: 10
|
88 |
+
local_dir: null
|
89 |
+
nevergrad:
|
90 |
+
n_random_steps: 10
|
91 |
+
sched: null
|
92 |
+
search_alg: null
|
93 |
+
save_attention: true
|
94 |
+
sort_data: false
|
95 |
+
test: null
|
96 |
+
test_dataset:
|
97 |
+
clic_edm_qq_pf:
|
98 |
+
splits:
|
99 |
+
- 1
|
100 |
+
- 2
|
101 |
+
- 3
|
102 |
+
- 4
|
103 |
+
- 5
|
104 |
+
- 6
|
105 |
+
- 7
|
106 |
+
- 8
|
107 |
+
- 9
|
108 |
+
- 10
|
109 |
+
version: 2.5.0
|
110 |
+
clic_edm_ttbar_pf:
|
111 |
+
splits:
|
112 |
+
- 1
|
113 |
+
- 2
|
114 |
+
- 3
|
115 |
+
- 4
|
116 |
+
- 5
|
117 |
+
- 6
|
118 |
+
- 7
|
119 |
+
- 8
|
120 |
+
- 9
|
121 |
+
- 10
|
122 |
+
version: 2.5.0
|
123 |
+
clic_edm_ww_fullhad_pf:
|
124 |
+
splits:
|
125 |
+
- 1
|
126 |
+
- 2
|
127 |
+
- 3
|
128 |
+
- 4
|
129 |
+
- 5
|
130 |
+
- 6
|
131 |
+
- 7
|
132 |
+
- 8
|
133 |
+
- 9
|
134 |
+
- 10
|
135 |
+
version: 2.5.0
|
136 |
+
train: true
|
137 |
+
train_dataset:
|
138 |
+
clic:
|
139 |
+
physical:
|
140 |
+
batch_size: 1
|
141 |
+
samples:
|
142 |
+
clic_edm_qq_pf:
|
143 |
+
splits:
|
144 |
+
- 1
|
145 |
+
- 2
|
146 |
+
- 3
|
147 |
+
- 4
|
148 |
+
- 5
|
149 |
+
- 6
|
150 |
+
- 7
|
151 |
+
- 8
|
152 |
+
- 9
|
153 |
+
- 10
|
154 |
+
version: 2.5.0
|
155 |
+
clic_edm_ttbar_pf:
|
156 |
+
splits:
|
157 |
+
- 1
|
158 |
+
- 2
|
159 |
+
- 3
|
160 |
+
- 4
|
161 |
+
- 5
|
162 |
+
- 6
|
163 |
+
- 7
|
164 |
+
- 8
|
165 |
+
- 9
|
166 |
+
- 10
|
167 |
+
version: 2.5.0
|
168 |
+
clic_edm_ww_fullhad_pf:
|
169 |
+
splits:
|
170 |
+
- 1
|
171 |
+
- 2
|
172 |
+
- 3
|
173 |
+
- 4
|
174 |
+
- 5
|
175 |
+
- 6
|
176 |
+
- 7
|
177 |
+
- 8
|
178 |
+
- 9
|
179 |
+
- 10
|
180 |
+
version: 2.5.0
|
181 |
+
val_freq: null
|
182 |
+
valid_dataset:
|
183 |
+
clic:
|
184 |
+
physical:
|
185 |
+
batch_size: 1
|
186 |
+
samples:
|
187 |
+
clic_edm_qq_pf:
|
188 |
+
splits:
|
189 |
+
- 1
|
190 |
+
- 2
|
191 |
+
- 3
|
192 |
+
- 4
|
193 |
+
- 5
|
194 |
+
- 6
|
195 |
+
- 7
|
196 |
+
- 8
|
197 |
+
- 9
|
198 |
+
- 10
|
199 |
+
version: 2.5.0
|
200 |
+
clic_edm_ttbar_pf:
|
201 |
+
splits:
|
202 |
+
- 1
|
203 |
+
- 2
|
204 |
+
- 3
|
205 |
+
- 4
|
206 |
+
- 5
|
207 |
+
- 6
|
208 |
+
- 7
|
209 |
+
- 8
|
210 |
+
- 9
|
211 |
+
- 10
|
212 |
+
version: 2.5.0
|
213 |
+
clic_edm_ww_fullhad_pf:
|
214 |
+
splits:
|
215 |
+
- 1
|
216 |
+
- 2
|
217 |
+
- 3
|
218 |
+
- 4
|
219 |
+
- 5
|
220 |
+
- 6
|
221 |
+
- 7
|
222 |
+
- 8
|
223 |
+
- 9
|
224 |
+
- 10
|
225 |
+
version: 2.5.0
|
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_eta.pdf
ADDED
Binary file (24.3 kB). View file
|
|
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_eta.png
ADDED
![]() |
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_pt.pdf
ADDED
Binary file (22.7 kB). View file
|
|
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_pt.png
ADDED
![]() |
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_pt_log.pdf
ADDED
Binary file (23.4 kB). View file
|
|
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_pt_log.png
ADDED
![]() |
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_res.pdf
ADDED
Binary file (25 kB). View file
|
|
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_res.png
ADDED
![]() |
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_res_bins_0p5_1p5.pdf
ADDED
Binary file (30.5 kB). View file
|
|
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_res_bins_0p5_1p5.png
ADDED
![]() |
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_res_bins_0p5_1p5_vstarget.pdf
ADDED
Binary file (28.2 kB). View file
|
|
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_res_bins_0p5_1p5_vstarget.png
ADDED
![]() |
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_res_vstarget.pdf
ADDED
Binary file (25.6 kB). View file
|
|
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_res_vstarget.png
ADDED
![]() |
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_binned_10.pdf
ADDED
Binary file (18.2 kB). View file
|
|
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_binned_10.png
ADDED
![]() |
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_binned_100.pdf
ADDED
Binary file (17.7 kB). View file
|
|
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_binned_100.png
ADDED
![]() |
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_binned_20.pdf
ADDED
Binary file (18.9 kB). View file
|
|
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_binned_20.png
ADDED
![]() |
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_binned_200.pdf
ADDED
Binary file (12.7 kB). View file
|
|
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_binned_200.png
ADDED
![]() |
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_binned_40.pdf
ADDED
Binary file (17.6 kB). View file
|
|
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_binned_40.png
ADDED
![]() |
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_binned_400.pdf
ADDED
Binary file (12.7 kB). View file
|
|
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_binned_400.png
ADDED
![]() |
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_binned_60.pdf
ADDED
Binary file (15.9 kB). View file
|
|
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_binned_60.png
ADDED
![]() |
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_binned_80.pdf
ADDED
Binary file (15.8 kB). View file
|
|
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_binned_80.png
ADDED
![]() |
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_binned_eta.pdf
ADDED
Binary file (23.1 kB). View file
|
|
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_binned_eta.png
ADDED
![]() |
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_iqr_over_med_eta.pdf
ADDED
Binary file (25.2 kB). View file
|
|
clic/clusters/v2.3.0/pyg-clic_20250209_100514_187330/plots_epoch_10/clic_edm_qq_pf/jet_response_iqr_over_med_eta.png
ADDED
![]() |