MilesCranmer commited on
Commit
ca7756a
1 Parent(s): 53830ff

Add more paper examples

Browse files
Files changed (1) hide show
  1. docs/papers.yml +56 -3
docs/papers.yml CHANGED
@@ -26,10 +26,63 @@ papers:
26
  abstract: While neural networks offer an attractive way to numerically encode functions, actual formulas remain the language of theoretical particle physics. We show how symbolic regression trained on matrix-element information provides, for instance, optimal LHC observables in an easily interpretable form. We introduce the method using the effect of a dimension-6 coefficient on associated ZH production. We then validate it for the known case of CP-violation in weak-boson-fusion Higgs production, including detector effects.
27
  image: back_to_formula.png
28
  date: 2021-09-21
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
29
 
30
  # To add:
31
- # https://arxiv.org/abs/2109.10414v2 - particle physics paper, where they use PySR to discover new analytic formulae from one of the inference pipelines built for analyzing LHC (Large Hadron Collider) data.
32
- # https://doi.org/10.1007/JHEP06(2021)040 - high-energy physics paper, where they discover a formula for the "hyperbolic volume" of a "knot". This is a really interesting paper, although I note I'm not versed in high-energy physics so am unable to explain the specific problem.
33
- # https://arxiv.org/abs/2111.02422v1 - astrophysics paper, where they discover a better way for predicting galaxy properties using properties of the surrounding dark matter halo using PySR.
34
  # https://arxiv.org/abs/2109.04484v1 - astrophysics paper, where they use PySR to discover a more accurate model for the properties of dark matter subhalos in an interpretable way.
35
  # https://arxiv.org/abs/2012.00111 - astrophysics paper, where they use PySR to model assembly bias, and recover a new interpretable model for doing so.
 
26
  abstract: While neural networks offer an attractive way to numerically encode functions, actual formulas remain the language of theoretical particle physics. We show how symbolic regression trained on matrix-element information provides, for instance, optimal LHC observables in an easily interpretable form. We introduce the method using the effect of a dimension-6 coefficient on associated ZH production. We then validate it for the known case of CP-violation in weak-boson-fusion Higgs production, including detector effects.
27
  image: back_to_formula.png
28
  date: 2021-09-21
29
+ - title: Disentangling a deep learned volume formula
30
+ authors:
31
+ - Jessica Craven (1)
32
+ - Vishnu Jejjala (1)
33
+ - Arjun Kar (2)
34
+ affiliations:
35
+ 1: University of the Witwatersrand
36
+ 2: University of British Columbia
37
+ link: https://link.springer.com/article/10.1007/JHEP06(2021)040
38
+ abstract: We present a simple phenomenological formula which approximates the hyperbolic volume of a knot using only a single evaluation of its Jones polynomial at a root of unity. The average error is just 2.86% on the first 1.7 million knots, which represents a large improvement over previous formulas of this kind. To find the approximation formula, we use layer-wise relevance propagation to reverse engineer a black box neural network which achieves a similar average error for the same approximation task when trained on 10% of the total dataset. The particular roots of unity which appear in our analysis cannot be written as e2πi/(k+2) with integer k; therefore, the relevant Jones polynomial evaluations are not given by unknot-normalized expectation values of Wilson loop operators in conventional SU(2) Chern-Simons theory with level k. Instead, they correspond to an analytic continuation of such expectation values to fractional level. We briefly review the continuation procedure and comment on the presence of certain Lefschetz thimbles, to which our approximation formula is sensitive, in the analytically continued Chern-Simons integration cycle.
39
+ image: hyperbolic_volume.png
40
+ date: 2021-06-07
41
+
42
+ # Modeling the galaxy-halo connection with machine learning
43
+ # Ana Maria Delgado, 1
44
+ # Digvijay Wadekar, 2 3
45
+ # Boryana Hadzhiyska,1
46
+ # Sownak Bose,1 7
47
+ # Lars Hernquist,1
48
+ # Shirley Ho 2 4 5 6
49
+ # 1Center for Astrophysics | Harvard & Smithsonian, 60 Garden Street, Cambridge, MA 02138, USA
50
+ # 2Center for Cosmology and Particle Physics, Department of Physics, New York University, New York, NY 10003, USA
51
+ # 3School of Natural Sciences, Institute for Advanced Study, Princeton, NJ 08540, USA
52
+ # 4Center for Computational Astrophysics, Flatiron Institute, 162 5th Ave, New York, NY 10010, USA
53
+ # 5Department of Astrophysical Sciences, Princeton University, Peyton Hall, Princeton NJ 08544-0010, USA
54
+ # 6Department of Physics, Carnegie Mellon University, Pittsburgh, PA 15217, USA
55
+ # 7Institute for Computational Cosmology, Department of Physics, Durham University, Durham DH1 3LE, UK
56
+ # https://arxiv.org/abs/2111.02422v1
57
+ # To extract information from the clustering of galaxies on non-linear scales, we need to model the connection between galaxies and halos accurately and in a flexible manner. Standard halo occupation distribution (HOD) models make the assumption that the galaxy occupation in a halo is a function of only its mass, however, in reality, the occupation can depend on various other parameters including halo concentration, assembly history, environment, spin, etc. Using the IllustrisTNG hydrodynamic simulation as our target, we show that machine learning tools can be used to capture this high-dimensional dependence and provide more accurate galaxy occupation models. Specifically, we use a random forest regressor to identify which secondary halo parameters best model the galaxy-halo connection and symbolic regression to augment the standard HOD model with simple equations capturing the dependence on those parameters, namely the local environmental overdensity and shear, at the location of a halo. This not only provides insights into the galaxy-formation relationship but, more importantly, improves the clustering statistics of the modeled galaxies significantly. Our approach demonstrates that machine learning tools can help us better understand and model the galaxy-halo connection, and are therefore useful for galaxy formation and cosmology studies from upcoming galaxy surveys.
58
+ # hod_importances.png
59
+ # 3 Nov 2021
60
+ - title: Modeling the galaxy-halo connection with machine learning
61
+ authors:
62
+ - Ana Maria Delgado (1)
63
+ - Digvijay Wadekar (2,3)
64
+ - Boryana Hadzhiyska (1)
65
+ - Sownak Bose (1,7)
66
+ - Lars Hernquist (1)
67
+ - Shirley Ho (2,4,5,6)
68
+ affiliations:
69
+ 1: Center for Astrophysics | Harvard & Smithsonian
70
+ 2: New York University
71
+ 3: Institute for Advanced Study
72
+ 4: Flatiron Institute
73
+ 5: Princeton University
74
+ 6: Carnegie Mellon University
75
+ 7: Durham University
76
+ link: https://arxiv.org/abs/2111.02422v1
77
+ abstract: To extract information from the clustering of galaxies on non-linear scales, we need to model the connection between galaxies and halos accurately and in a flexible manner. Standard halo occupation distribution (HOD) models make the assumption that the galaxy occupation in a halo is a function of only its mass, however, in reality, the occupation can depend on various other parameters including halo concentration, assembly history, environment, spin, etc. Using the IllustrisTNG hydrodynamic simulation as our target, we show that machine learning tools can be used to capture this high-dimensional dependence and provide more accurate galaxy occupation models. Specifically, we use a random forest regressor to identify which secondary halo parameters best model the galaxy-halo connection and symbolic regression to augment the standard HOD model with simple equations capturing the dependence on those parameters, namely the local environmental overdensity and shear, at the location of a halo. This not only provides insights into the galaxy-formation relationship but, more importantly, improves the clustering statistics of the modeled galaxies significantly. Our approach demonstrates that machine learning tools can help us better understand and model the galaxy-halo connection, and are therefore useful for galaxy formation and cosmology studies from upcoming galaxy surveys.
78
+ image: hod_importances.png
79
+ date: 2021-11-03
80
+
81
+
82
+
83
+
84
+
85
 
86
  # To add:
 
 
 
87
  # https://arxiv.org/abs/2109.04484v1 - astrophysics paper, where they use PySR to discover a more accurate model for the properties of dark matter subhalos in an interpretable way.
88
  # https://arxiv.org/abs/2012.00111 - astrophysics paper, where they use PySR to model assembly bias, and recover a new interpretable model for doing so.