Machine Learning & AI

Connecting simulations to observations with machine learning and simulation-based inference

Modern cosmology and galaxy formation share a problem: the models are too complex for analytic likelihoods, and the simulations are too expensive to run everywhere we need them. Machine learning offers a way through — emulators that learn the mapping from physical parameters to observables, neural networks that extract cosmological information directly from field-level data, and simulation-based (implicit likelihood) inference that turns suites of simulations into rigorous posteriors on the parameters of cosmology and galaxy formation.

Much of this work builds on the CAMELS project — thousands of cosmological simulations with varied cosmology and feedback physics, designed from the start as a machine-learning training set (Villaescusa-Navarro et al., 2021; Villaescusa-Navarro et al., 2023). The image above shows gas surface-density maps from a sample of these simulations. With suites like these we can perform robust field-level inference of cosmological parameters (Shao et al., 2023) and, conversely, use observations to calibrate the subgrid physics of the simulations themselves (Jo et al., 2023).

On the methods side, we develop tools and generative models for the community: the LtU-ILI framework packages the full simulation-based inference pipeline for astrophysical applications (Ho et al., 2024), autoregressive networks generate halo catalogs directly (Pandey et al., 2025), and hybrid physics-informed approaches like sapphire aim to combine the interpretability of analytic models with the flexibility of machine learning (Pandya et al., 2026).

References

2026

  1. arXiv
    Introducing sapphire: Towards Hybrid Physics-Informed, Data-Driven Modeling of Galaxy Formation
    Viraj Pandya, Greg L. Bryan, T. Lucas Makinen, and 18 more authors
    arXiv e-prints, Apr 2026

2025

  1. PRD
    Creating halos with autoregressive multistage networks
    Shivam Pandey, Chirag Modi, Benjamin D. Wandelt, and 7 more authors
    Physical Review D, Nov 2025

2024

  1. OJAp
    LtU-ILI: An All-in-One Framework for Implicit Inference in Astrophysics and Cosmology
    Matthew Ho, Deaglan J. Bartlett, Nicolas Chartier, and 12 more authors
    The Open Journal of Astrophysics, Jul 2024

2023

  1. ApJS
    The CAMELS Project: Public Data Release
    Francisco Villaescusa-Navarro, Shy Genel, Daniel Anglés-Alcázar, and 45 more authors
    The Astrophysical Journal Supplement Series, Apr 2023
  2. ApJ
    Robust Field-level Inference of Cosmological Parameters with Dark Matter Halos
    Helen Shao, Francisco Villaescusa-Navarro, Pablo Villanueva-Domingo, and 13 more authors
    The Astrophysical Journal, Feb 2023
  3. ApJ
    Calibrating Cosmological Simulations with Implicit Likelihood Inference Using Galaxy Growth Observables
    Yongseok Jo, Shy Genel, Benjamin Wandelt, and 7 more authors
    The Astrophysical Journal, Feb 2023

2021

  1. ApJ
    The CAMELS Project: Cosmology and Astrophysics with Machine-learning Simulations
    Francisco Villaescusa-Navarro, Daniel Anglés-Alcázar, Shy Genel, and 19 more authors
    The Astrophysical Journal, Jul 2021