The scatter in the galaxy-halo connection: a machine learning analysis

We apply machine learning (ML), a powerful method for uncovering complex correlations in high-dimensional data, to the galaxy-halo connection of cosmological hydrodynamical simulations. The mapping between galaxy and halo variables is stochastic in the absence of perfect information, but conventiona...

Celý popis

Podrobná bibliografie
Hlavní autoři: Stiskalek, R, Bartlett, DJ, Desmond, H, Anbajagane, D
Médium: Journal article
Jazyk:English
Vydáno: Oxford University Press 2022
_version_ 1826309400113971200
author Stiskalek, R
Bartlett, DJ
Desmond, H
Anbajagane, D
author_facet Stiskalek, R
Bartlett, DJ
Desmond, H
Anbajagane, D
author_sort Stiskalek, R
collection OXFORD
description We apply machine learning (ML), a powerful method for uncovering complex correlations in high-dimensional data, to the galaxy-halo connection of cosmological hydrodynamical simulations. The mapping between galaxy and halo variables is stochastic in the absence of perfect information, but conventional ML models are deterministic and hence cannot capture its intrinsic scatter. To overcome this limitation, we design an ensemble of neural networks with a Gaussian loss function that predict probability distributions, allowing us to model statistical uncertainties in the galaxy-halo connection as well as its best-fitting trends. We extract a number of galaxy and halo variables from the Horizon-AGN and IllustrisTNG100-1 simulations and quantify the extent to which knowledge of some subset of one enables prediction of the other. This allows us to identify the key features of the galaxy-halo connection and investigate the origin of its scatter in various projections. We find that while halo properties beyond mass account for up to 50 per cent of the scatter in the halo-To-stellar mass relation, the prediction of stellar half-mass radius or total gas mass is not substantially improved by adding further halo properties. We also use these results to investigate semi-Analytic models for galaxy size in the two simulations, finding that assumptions relating galaxy size to halo size or spin are not successful.
first_indexed 2024-03-07T07:33:38Z
format Journal article
id oxford-uuid:9895d184-6f23-4dd8-ae2a-bcc89c98eaee
institution University of Oxford
language English
last_indexed 2024-03-07T07:33:38Z
publishDate 2022
publisher Oxford University Press
record_format dspace
spelling oxford-uuid:9895d184-6f23-4dd8-ae2a-bcc89c98eaee2023-02-20T11:47:20ZThe scatter in the galaxy-halo connection: a machine learning analysisJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:9895d184-6f23-4dd8-ae2a-bcc89c98eaeeEnglishSymplectic ElementsOxford University Press2022Stiskalek, RBartlett, DJDesmond, HAnbajagane, DWe apply machine learning (ML), a powerful method for uncovering complex correlations in high-dimensional data, to the galaxy-halo connection of cosmological hydrodynamical simulations. The mapping between galaxy and halo variables is stochastic in the absence of perfect information, but conventional ML models are deterministic and hence cannot capture its intrinsic scatter. To overcome this limitation, we design an ensemble of neural networks with a Gaussian loss function that predict probability distributions, allowing us to model statistical uncertainties in the galaxy-halo connection as well as its best-fitting trends. We extract a number of galaxy and halo variables from the Horizon-AGN and IllustrisTNG100-1 simulations and quantify the extent to which knowledge of some subset of one enables prediction of the other. This allows us to identify the key features of the galaxy-halo connection and investigate the origin of its scatter in various projections. We find that while halo properties beyond mass account for up to 50 per cent of the scatter in the halo-To-stellar mass relation, the prediction of stellar half-mass radius or total gas mass is not substantially improved by adding further halo properties. We also use these results to investigate semi-Analytic models for galaxy size in the two simulations, finding that assumptions relating galaxy size to halo size or spin are not successful.
spellingShingle Stiskalek, R
Bartlett, DJ
Desmond, H
Anbajagane, D
The scatter in the galaxy-halo connection: a machine learning analysis
title The scatter in the galaxy-halo connection: a machine learning analysis
title_full The scatter in the galaxy-halo connection: a machine learning analysis
title_fullStr The scatter in the galaxy-halo connection: a machine learning analysis
title_full_unstemmed The scatter in the galaxy-halo connection: a machine learning analysis
title_short The scatter in the galaxy-halo connection: a machine learning analysis
title_sort scatter in the galaxy halo connection a machine learning analysis
work_keys_str_mv AT stiskalekr thescatterinthegalaxyhaloconnectionamachinelearninganalysis
AT bartlettdj thescatterinthegalaxyhaloconnectionamachinelearninganalysis
AT desmondh thescatterinthegalaxyhaloconnectionamachinelearninganalysis
AT anbajaganed thescatterinthegalaxyhaloconnectionamachinelearninganalysis
AT stiskalekr scatterinthegalaxyhaloconnectionamachinelearninganalysis
AT bartlettdj scatterinthegalaxyhaloconnectionamachinelearninganalysis
AT desmondh scatterinthegalaxyhaloconnectionamachinelearninganalysis
AT anbajaganed scatterinthegalaxyhaloconnectionamachinelearninganalysis