PySR / docs /papers.yml
MilesCranmer's picture
Add paper "Back to the Formula"
35e2776
raw
history blame
3.55 kB
# This file stores papers which have used PySR, with
# information to generate the "Research Showcase"
# The order here is in terms of date. New papers should be added at the top.
papers:
- title: Machine Learning the Gravity Equation for International Trade
authors:
- Sergiy Verstyuk (1)
- Michael R. Douglas (1)
affiliations:
1: Harvard University
link: https://papers.ssrn.com/abstract=4053795
abstract: Machine learning (ML) is becoming more and more important throughout the mathematical and theoretical sciences. In this work we apply modern ML methods to gravity models of pairwise interactions in international economics. We explain the formulation of graphical neural networks (GNNs), models for graph-structured data that respect the properties of exchangeability and locality. GNNs are a natural and theoretically appealing class of models for international trade, which we demonstrate empirically by fitting them to a large panel of annual-frequency country-level data. We then use a symbolic regression algorithm to turn our fits into interpretable models with performance comparable to state of the art hand-crafted models motivated by economic theory. The resulting symbolic models contain objects resembling market access functions, which were developed in modern structural literature, but in our analysis arise ab initio without being explicitly postulated. Along the way, we also produce several model-consistent and model-agnostic ML-based measures of bilateral trade accessibility.
image: economic_theory_gravity.png
date: 2022-03-15
- title: Back to the Formula -- LHC Edition
authors:
- Anja Butter (1)
- Tilman Plehn (1)
- Nathalie Soybelman (1)
- Johann Brehmer (2)
affiliations:
1: Institut fur Theoretische Physik, Universitat Heidelberg
2: Center for Data Science, New York University
link: https://arxiv.org/abs/2109.10414
abstract: While neural networks offer an attractive way to numerically encode functions, actual formulas remain the language of theoretical particle physics. We show how symbolic regression trained on matrix-element information provides, for instance, optimal LHC observables in an easily interpretable form. We introduce the method using the effect of a dimension-6 coefficient on associated ZH production. We then validate it for the known case of CP-violation in weak-boson-fusion Higgs production, including detector effects.
image: back_to_formula.png
date: 2021-09-21
# To add:
# https://arxiv.org/abs/2109.10414v2 - particle physics paper, where they use PySR to discover new analytic formulae from one of the inference pipelines built for analyzing LHC (Large Hadron Collider) data.
# https://doi.org/10.1007/JHEP06(2021)040 - high-energy physics paper, where they discover a formula for the "hyperbolic volume" of a "knot". This is a really interesting paper, although I note I'm not versed in high-energy physics so am unable to explain the specific problem.
# https://arxiv.org/abs/2111.02422v1 - astrophysics paper, where they discover a better way for predicting galaxy properties using properties of the surrounding dark matter halo using PySR.
# https://arxiv.org/abs/2109.04484v1 - astrophysics paper, where they use PySR to discover a more accurate model for the properties of dark matter subhalos in an interpretable way.
# https://arxiv.org/abs/2012.00111 - astrophysics paper, where they use PySR to model assembly bias, and recover a new interpretable model for doing so.