The sensitivity of GPz estimates of photo-z posterior PDFs to realistically complex training set imperfections
The accurate estimation of photometric redshifts is crucial to many upcoming galaxy surveys, for example, the Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST). Almost all Rubin extragalactic and cosmological science requires accurate and precise calculation of photometric redshifts;...
Main Authors: | , , , , |
---|---|
Format: | Journal article |
Language: | English |
Published: |
IOP Publishing
2022
|
_version_ | 1797106934298443776 |
---|---|
author | Stylianou, N Malz, AI Hatfield, P Crenshaw, JF Gschwend, J |
author_facet | Stylianou, N Malz, AI Hatfield, P Crenshaw, JF Gschwend, J |
author_sort | Stylianou, N |
collection | OXFORD |
description | The accurate estimation of photometric redshifts is crucial to many upcoming galaxy surveys, for example, the Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST). Almost all Rubin extragalactic and cosmological science requires accurate and precise calculation of photometric redshifts; many diverse approaches to this problem are currently in the process of being developed, validated, and tested. In this work, we use the photometric redshift code GPz to examine two realistically complex training set imperfections scenarios for machine learning based photometric redshift calculation: (i) where the spectroscopic training set has a very different distribution in color–magnitude space to the test set, and (ii) where the effect of emission line confusion causes a fraction of the training spectroscopic sample to not have the true redshift. By evaluating the sensitivity of GPz to a range of increasingly severe imperfections, with a range of metrics (both of photo-z point estimates as well as posterior probability distribution functions, PDFs), we quantify the degree to which predictions get worse with higher degrees of degradation. In particular, we find that there is a substantial drop-off in photo-z quality when line-confusion goes above ∼1%, and sample incompleteness below a redshift of 1.5, for an experimental setup using data from the Buzzard Flock synthetic sky catalogs. |
first_indexed | 2024-03-07T07:09:28Z |
format | Journal article |
id | oxford-uuid:9d2874a9-fe2f-4217-b5bc-2a349494bb19 |
institution | University of Oxford |
language | English |
last_indexed | 2024-03-07T07:09:28Z |
publishDate | 2022 |
publisher | IOP Publishing |
record_format | dspace |
spelling | oxford-uuid:9d2874a9-fe2f-4217-b5bc-2a349494bb192022-05-30T10:11:46ZThe sensitivity of GPz estimates of photo-z posterior PDFs to realistically complex training set imperfectionsJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:9d2874a9-fe2f-4217-b5bc-2a349494bb19EnglishSymplectic ElementsIOP Publishing2022Stylianou, NMalz, AIHatfield, PCrenshaw, JFGschwend, JThe accurate estimation of photometric redshifts is crucial to many upcoming galaxy surveys, for example, the Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST). Almost all Rubin extragalactic and cosmological science requires accurate and precise calculation of photometric redshifts; many diverse approaches to this problem are currently in the process of being developed, validated, and tested. In this work, we use the photometric redshift code GPz to examine two realistically complex training set imperfections scenarios for machine learning based photometric redshift calculation: (i) where the spectroscopic training set has a very different distribution in color–magnitude space to the test set, and (ii) where the effect of emission line confusion causes a fraction of the training spectroscopic sample to not have the true redshift. By evaluating the sensitivity of GPz to a range of increasingly severe imperfections, with a range of metrics (both of photo-z point estimates as well as posterior probability distribution functions, PDFs), we quantify the degree to which predictions get worse with higher degrees of degradation. In particular, we find that there is a substantial drop-off in photo-z quality when line-confusion goes above ∼1%, and sample incompleteness below a redshift of 1.5, for an experimental setup using data from the Buzzard Flock synthetic sky catalogs. |
spellingShingle | Stylianou, N Malz, AI Hatfield, P Crenshaw, JF Gschwend, J The sensitivity of GPz estimates of photo-z posterior PDFs to realistically complex training set imperfections |
title | The sensitivity of GPz estimates of photo-z posterior PDFs to realistically complex training set imperfections |
title_full | The sensitivity of GPz estimates of photo-z posterior PDFs to realistically complex training set imperfections |
title_fullStr | The sensitivity of GPz estimates of photo-z posterior PDFs to realistically complex training set imperfections |
title_full_unstemmed | The sensitivity of GPz estimates of photo-z posterior PDFs to realistically complex training set imperfections |
title_short | The sensitivity of GPz estimates of photo-z posterior PDFs to realistically complex training set imperfections |
title_sort | sensitivity of gpz estimates of photo z posterior pdfs to realistically complex training set imperfections |
work_keys_str_mv | AT stylianoun thesensitivityofgpzestimatesofphotozposteriorpdfstorealisticallycomplextrainingsetimperfections AT malzai thesensitivityofgpzestimatesofphotozposteriorpdfstorealisticallycomplextrainingsetimperfections AT hatfieldp thesensitivityofgpzestimatesofphotozposteriorpdfstorealisticallycomplextrainingsetimperfections AT crenshawjf thesensitivityofgpzestimatesofphotozposteriorpdfstorealisticallycomplextrainingsetimperfections AT gschwendj thesensitivityofgpzestimatesofphotozposteriorpdfstorealisticallycomplextrainingsetimperfections AT stylianoun sensitivityofgpzestimatesofphotozposteriorpdfstorealisticallycomplextrainingsetimperfections AT malzai sensitivityofgpzestimatesofphotozposteriorpdfstorealisticallycomplextrainingsetimperfections AT hatfieldp sensitivityofgpzestimatesofphotozposteriorpdfstorealisticallycomplextrainingsetimperfections AT crenshawjf sensitivityofgpzestimatesofphotozposteriorpdfstorealisticallycomplextrainingsetimperfections AT gschwendj sensitivityofgpzestimatesofphotozposteriorpdfstorealisticallycomplextrainingsetimperfections |