The sensitivity of GPz estimates of photo-z posterior PDFs to realistically complex training set imperfections

The accurate estimation of photometric redshifts is crucial to many upcoming galaxy surveys, for example, the Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST). Almost all Rubin extragalactic and cosmological science requires accurate and precise calculation of photometric redshifts;...

Full description

Bibliographic Details
Main Authors: Stylianou, N, Malz, AI, Hatfield, P, Crenshaw, JF, Gschwend, J
Format: Journal article
Language:English
Published: IOP Publishing 2022
_version_ 1797106934298443776
author Stylianou, N
Malz, AI
Hatfield, P
Crenshaw, JF
Gschwend, J
author_facet Stylianou, N
Malz, AI
Hatfield, P
Crenshaw, JF
Gschwend, J
author_sort Stylianou, N
collection OXFORD
description The accurate estimation of photometric redshifts is crucial to many upcoming galaxy surveys, for example, the Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST). Almost all Rubin extragalactic and cosmological science requires accurate and precise calculation of photometric redshifts; many diverse approaches to this problem are currently in the process of being developed, validated, and tested. In this work, we use the photometric redshift code GPz to examine two realistically complex training set imperfections scenarios for machine learning based photometric redshift calculation: (i) where the spectroscopic training set has a very different distribution in color–magnitude space to the test set, and (ii) where the effect of emission line confusion causes a fraction of the training spectroscopic sample to not have the true redshift. By evaluating the sensitivity of GPz to a range of increasingly severe imperfections, with a range of metrics (both of photo-z point estimates as well as posterior probability distribution functions, PDFs), we quantify the degree to which predictions get worse with higher degrees of degradation. In particular, we find that there is a substantial drop-off in photo-z quality when line-confusion goes above ∼1%, and sample incompleteness below a redshift of 1.5, for an experimental setup using data from the Buzzard Flock synthetic sky catalogs.
first_indexed 2024-03-07T07:09:28Z
format Journal article
id oxford-uuid:9d2874a9-fe2f-4217-b5bc-2a349494bb19
institution University of Oxford
language English
last_indexed 2024-03-07T07:09:28Z
publishDate 2022
publisher IOP Publishing
record_format dspace
spelling oxford-uuid:9d2874a9-fe2f-4217-b5bc-2a349494bb192022-05-30T10:11:46ZThe sensitivity of GPz estimates of photo-z posterior PDFs to realistically complex training set imperfectionsJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:9d2874a9-fe2f-4217-b5bc-2a349494bb19EnglishSymplectic ElementsIOP Publishing2022Stylianou, NMalz, AIHatfield, PCrenshaw, JFGschwend, JThe accurate estimation of photometric redshifts is crucial to many upcoming galaxy surveys, for example, the Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST). Almost all Rubin extragalactic and cosmological science requires accurate and precise calculation of photometric redshifts; many diverse approaches to this problem are currently in the process of being developed, validated, and tested. In this work, we use the photometric redshift code GPz to examine two realistically complex training set imperfections scenarios for machine learning based photometric redshift calculation: (i) where the spectroscopic training set has a very different distribution in color–magnitude space to the test set, and (ii) where the effect of emission line confusion causes a fraction of the training spectroscopic sample to not have the true redshift. By evaluating the sensitivity of GPz to a range of increasingly severe imperfections, with a range of metrics (both of photo-z point estimates as well as posterior probability distribution functions, PDFs), we quantify the degree to which predictions get worse with higher degrees of degradation. In particular, we find that there is a substantial drop-off in photo-z quality when line-confusion goes above ∼1%, and sample incompleteness below a redshift of 1.5, for an experimental setup using data from the Buzzard Flock synthetic sky catalogs.
spellingShingle Stylianou, N
Malz, AI
Hatfield, P
Crenshaw, JF
Gschwend, J
The sensitivity of GPz estimates of photo-z posterior PDFs to realistically complex training set imperfections
title The sensitivity of GPz estimates of photo-z posterior PDFs to realistically complex training set imperfections
title_full The sensitivity of GPz estimates of photo-z posterior PDFs to realistically complex training set imperfections
title_fullStr The sensitivity of GPz estimates of photo-z posterior PDFs to realistically complex training set imperfections
title_full_unstemmed The sensitivity of GPz estimates of photo-z posterior PDFs to realistically complex training set imperfections
title_short The sensitivity of GPz estimates of photo-z posterior PDFs to realistically complex training set imperfections
title_sort sensitivity of gpz estimates of photo z posterior pdfs to realistically complex training set imperfections
work_keys_str_mv AT stylianoun thesensitivityofgpzestimatesofphotozposteriorpdfstorealisticallycomplextrainingsetimperfections
AT malzai thesensitivityofgpzestimatesofphotozposteriorpdfstorealisticallycomplextrainingsetimperfections
AT hatfieldp thesensitivityofgpzestimatesofphotozposteriorpdfstorealisticallycomplextrainingsetimperfections
AT crenshawjf thesensitivityofgpzestimatesofphotozposteriorpdfstorealisticallycomplextrainingsetimperfections
AT gschwendj thesensitivityofgpzestimatesofphotozposteriorpdfstorealisticallycomplextrainingsetimperfections
AT stylianoun sensitivityofgpzestimatesofphotozposteriorpdfstorealisticallycomplextrainingsetimperfections
AT malzai sensitivityofgpzestimatesofphotozposteriorpdfstorealisticallycomplextrainingsetimperfections
AT hatfieldp sensitivityofgpzestimatesofphotozposteriorpdfstorealisticallycomplextrainingsetimperfections
AT crenshawjf sensitivityofgpzestimatesofphotozposteriorpdfstorealisticallycomplextrainingsetimperfections
AT gschwendj sensitivityofgpzestimatesofphotozposteriorpdfstorealisticallycomplextrainingsetimperfections