Talks and Posters

2023

Aug 1, 2023 Lawrence Berkeley National Lab

Fast jet simulations and how to evaluate them

Fast simulations which can accurately model jet substructure are will be of utmost importance for boosted jet analyses at the HL-LHC. There has been significant development recently in generative models for accelerating LHC simulations, but less explored are methods for validating these simulations. We present a rigorous study on evaluation metrics, and discuss the novel Frechet and kernel physics distances as highly sensitive, quantitative metrics for validating not only ML, but potentially also traditional GEANT-based, simulations. We finally introduce our graph network and novel attention-based generative models, which have excellent qualitative and quantitative performance in generating LHC jets, as a case study for the use of these metrics.

Jul 31, 2023 Lawrence Berkeley National Lab

Boosted multi-Higgs with jets measurements in CMS

Characterising double-Higgs production has been a major part of the LHC physics program in Run 2 and beyond. We discuss new techniques and results in boosted, hadronic final states in CMS, with a focus on wide-radius jet taggers and data-driven multi-jet background estimation, as well as measurements of gluon-gluon- and vector-boson-fusion HH production in the 4 beauty quark final state in 138fb^-1 of data at √s = 13 TeV, which observed (expected) a cross section of 9.9 (5.1) relative to the SM prediction and excluded the quartic VVHH coupling κ2V = 0 for the first time. Finally, we look ahead to possible new final states and improvements to triggers and techniques in Run 3.

Jul 24, 2023 Princeton

Machine learning for particle physics simulations

Accurate detector simulations are key components of any measurement or search for new physics. Due to their stochastic nature, ML-based generative models are natural opportunities for fast, differentiable simulations. We present two such graph- and attention-based models for generating LHC-like data using sparse and efficient point cloud representations, with state-of-the-art results. We measure a three-orders-of-magnitude improvement in latency compared to LHC full simulations, and also discuss recent work on evaluation metrics for validating such ML-based fast simulations.

Jun 27, 2023 UC Irvine

Generative transformers and how to evaluate them (+ Lorentz-equivariant networks)

With the increase in luminosity and detector granularity, simulation will be a significant computational challenge in the HL-LHC. To tackle this, I present developments in machine learning graph- and attention-based models for generating jets at the LHC using sparse and efficient point cloud representations of our data, which offer a three-orders-of-magnitude improvement in latency compared to full (Geant4) simulation. I also present studies on metrics for validating ML-based simulations, including the novel Frechet and kernel physics distances, which are found to be highly sensitive to typical mismodelling by ML generative models, and perspectives for future work in this area.

Jun 1, 2023 Virtual

Applications of two-sample goodness-of-fit tests to deep generative models

May 31, 2023 Carnegie Mellon

Multivariate goodness of fit testing for evaluating HEP generative models

May 31, 2023 CERN

Evaluation Metrics for FastSim

May 22, 2023 CERN

Multivariate goodness of fit testing for evaluating HEP generative models

Feb 14, 2023 CERN

Generative transformers and how to evaluate them

With the increase in luminosity and detector granularity, simulation will be a significant computational challenge in the HL-LHC. To tackle this, we present developments in machine learning (ML) graph- and attention-based models for generating jets at the LHC using sparse and efficient point cloud representations of our data, which offer a three-orders-of-magnitude improvement in latency compared to full (Geant4) simulation. We also present studies on metrics for validating ML-based simulations, including the novel Frechet and kernel physics distances, which are found to be highly sensitive to typical mismodelling by ML generative models.

2022

Dec 13, 2022 CERN

FastSim on GPUs

Nov 21, 2022 CERN

Generative transformers and how to evaluate them

Nov 1, 2022 Rutgers (Virtual)

On the Evaluation of Generative Models in HEP

Sep 16, 2022 Virtual

JetNet library for machine learning in high energy physics

Sep 14, 2022 Galileo Galilei Institute, Florence

Discussion on Generative Models

Sep 9, 2022 Galileo Galilei Institute, Florence

Particle Cloud Generation with Message Passing GANs

Jul 21, 2022 CERN

Overview and Outlook: Machine Learning for Simulation

Jul 17, 2022 UW Seattle

Particle Cloud Generation with Message Passing GANs

Jul 14, 2022 Fermilab

Machine Learning for LHC Simulations

Apr 10, 2022 New York

Particle Cloud Generation with Message Passing GANs

There has been significant development recently in generative models for accelerating LHC simulations. Work on simulating jets has primarily used image-based representations, which tend to be sparse and of limited resolution. We advocate for the more natural ‘particle cloud’ representation of jets, i.e. as a set of particles in momentum space, and discuss four physics- and computer-vision-inspired metrics: (1) the 1-Wasserstein distance between high- and low-level feature distributions; (2) a new Fréchet ParticleNet Distance; (3) the coverage; and (4) the minimum matching distance as means of quantitatively and holistically evaluating generated particle clouds. We then present our new message-passing generative adversarial network (MPGAN), which has excellent performance on gluon, top quark, and lighter quark jets on all metrics, validated against real samples via bootstrapping as well as existing point cloud generative models, and shows promise for use in high energy physics.

Apr 9, 2022 New York

Search for boosted Higgs boson pair production in the bbVV all-hadronic final state in CMS

We present developments in a search for boosted (pT > 250 GeV) Higgs boson pair production, where one Higgs decay to bb quarks and the other to two vector bosons in the all-hadronic final state. Using data collected by the CMS experiment in 2016—2018, corresponding to 137 inverse femtobarns, we show an expected upper limit on HH pair production using a cut-based analysis and a newly developed H(WW) graph neural network tagger. Such an analysis can provide insight into the trilinear Higgs self-coupling as well as the vector-boson-Higgs couplings.

2021

Dec 3, 2021 NeurIPS 21 (Virtual)

Particle Cloud Generation with Message Passing GANs

In high energy physics (HEP), jets are collections of correlated particles produced ubiquitously in particle collisions such as those at the CERN Large Hadron Collider (LHC). Machine learning (ML)-based generative models, such as generative adversarial networks (GANs), have the potential to significantly accelerate LHC jet simulations. However, despite jets having a natural representation as a set of particles in momentum-space, a.k.a. a particle cloud, there exist no generative models applied to such a dataset. In this work, we introduce a new particle cloud dataset (JetNet), and apply to it existing point cloud GANs. Results are evaluated using (1) 1-Wasserstein distances between high- and low-level feature distributions, (2) a newly developed Fréchet ParticleNet Distance, and (3) the coverage and (4) minimum matching distance metrics. Existing GANs are found to be inadequate for physics applications, hence we develop a new message passing GAN (MPGAN), which outperforms existing point cloud GANs on virtually every metric and shows promise for use in HEP. We propose JetNet as a novel point-cloud-style dataset for the ML community to experiment with, and set MPGAN as a benchmark to improve upon for future generative models. Additionally, to facilitate research and improve accessibility and reproducibility in this area, we release the open-source JetNet Python package with interfaces for particle cloud datasets, implementations for evaluation and loss metrics, and more tools for ML in HEP development.

Nov 28, 2021 South Korea (Virtual)

Particle Cloud Generation with Message Passing GANs

Nov 23, 2021 CERN (Virtual)

Validation Techniques for Machine-Learned FastSim

Nov 8, 2021 CERN (Virtual)

Particle Cloud Generation with Message Passing GANs

Oct 11, 2021 University of Washington (Virtual)

Particle Cloud Generation with Message Passing GANs

Jul 7, 2021 University of Heidelberg (Virtual)

Particle Cloud Generation with Message Passing GANs

There has been significant development recently in generative models for accelerating LHC simulations. Work on simulating jets has primarily used image-based representations, which tend to be sparse and of limited resolution. We advocate for the more natural ‘particle cloud’ representation of jets, i.e. as a set of particles in momentum space, and discuss four physics- and computer-vision-inspired metrics: (1) the 1-Wasserstein distance between high- and low-level feature distributions; (2) a new Fréchet ParticleNet Distance; (3) the coverage; and (4) the minimum matching distance as means of quantitatively and holistically evaluating generated particle clouds. We then present our new message-passing generative adversarial network (MPGAN), which has excellent performance on gluon, top quark, and lighter quark jets on all metrics, evaluated against real samples via bootstrapping as well as existing point cloud GANs, and shows promise for use in HEP.

Jun 23, 2021 Mainz Institute for Theoretical Physics (Virtual)

Particle Cloud Generation with Message Passing GANs

In high energy physics (HEP), jets are collections of correlated particles produced ubiquitously in particle collisions such as those at the CERN Large Hadron Collider (LHC). Machine-learning-based generative models, such as generative adversarial networks (GANs), have the potential to significantly accelerate LHC jet simulations. However, despite jets having a natural representation as a set of particles in momentum-space, a.k.a. a particle cloud, to our knowledge there exist no generative models applied to such a dataset. We introduce a new particle cloud dataset (JetNet), and, due to similarities between particle and point clouds, apply to it existing point cloud GANs. Results are evaluated using (1) the 1-Wasserstein distance between high- and low-level feature distributions, (2) a newly developed Fréchet ParticleNet Distance, and (3) the coverage and (4) minimum matching distance metrics. Existing GANs are found to be inadequate for physics applications, hence we develop a new message passing GAN (MPGAN), which outperforms existing point cloud GANs on virtually every metric and shows promise for use in HEP. We propose JetNet as a novel point-cloud-style dataset for the machine learning community to experiment with, and set MPGAN as a benchmark to improve upon for future generative models.

May 5, 2021 CERN (Virtual)

Sparse Data Generation

Mar 18, 2021 James Madison University (Virtual)

Graph GANs for High Energy Physics Data Generation

James Madison University (Virtual)

Mar 17, 2021 Berkeley Institute for Data Science (Virtual)

Graph GANs for High Energy Physics Data Generation

Feb 2, 2021 Imperial College London (Virtual)

Graph GANs for High Energy Physics Data Generation

Graph-based networks, with their ability to handle sparse, permutation invariant data with complex geometries, have recently proven useful in a variety of disciplines. One of these is high energy physics, where they have been successfully applied to important classification and reconstruction tasks, however have yet to be explored much for generation. We discuss some generative models for simulating datasets like those produced at the CERN Large Hadron Collider (LHC), and focus on a new message-passing graph based generative adversarial network. This approach is demonstrated by training on and generating sparse representations of MNIST images and jets of particles in proton-proton collisions like those at the LHC.