Identification of Galaxy Shreds in Large Photometric Catalogs Using Convolutional Neural Networks

Contamination from galaxy fragments, identified as sources, is a major issue in large photometric galaxy catalogs. In this paper, we prove that this problem can be easily addressed with computer vision techniques. We use image cutouts to train a convolutional neural network (CNN) to identify catalog...

Full description

Bibliographic Details
Main Authors: Enrico M. Di Teodoro, J. E. G. Peek, John F. Wu
Format: Article
Language:English
Published: IOP Publishing 2023-01-01
Series:The Astronomical Journal
Subjects:
Online Access:https://doi.org/10.3847/1538-3881/acb53a
Description
Summary:Contamination from galaxy fragments, identified as sources, is a major issue in large photometric galaxy catalogs. In this paper, we prove that this problem can be easily addressed with computer vision techniques. We use image cutouts to train a convolutional neural network (CNN) to identify cataloged sources that are in reality just star-formation regions and/or shreds of larger galaxies. The CNN reaches an accuracy ∼98% on our testing data sets. We apply this CNN to galaxy catalogs from three among the largest surveys available today: the Sloan Digital Sky Survey, the DESI Legacy Imaging Surveys, and the Panoramic Survey Telescope and Rapid Response System Survey. We find that, even when strict selection criteria are used, all catalogs still show a ∼5% level of contamination from galaxy shreds. Our CNN gives a simple yet effective solution to clean galaxy catalogs from these contaminants.
ISSN:1538-3881