Data challenges in pulsar searches

<p>Technological advances coupled with a decline in digital storage costs have resulted in a profusion of data being created, collected and consumed. These data give rise to new challenges and opportunities in many disciplines ranging from science and engineering to biology and finance.<...

Full description

Bibliographic Details
Main Author: Van Heerden, E
Other Authors: Roberts, S
Format: Thesis
Published: 2017
_version_ 1826286648034328576
author Van Heerden, E
author2 Roberts, S
author_facet Roberts, S
Van Heerden, E
author_sort Van Heerden, E
collection OXFORD
description <p>Technological advances coupled with a decline in digital storage costs have resulted in a profusion of data being created, collected and consumed. These data give rise to new challenges and opportunities in many disciplines ranging from science and engineering to biology and finance.</p> <p>An example of a future project in radio astronomy that promises both Big Data and Big Discoveries is the Square Kilometre Array (SKA) radio telescope project. Astrophysicists are confident that the Big Data amassed by the SKA will not only answer fundamental questions regarding the Universe but also contain big discoveries not yet postulated. The transformational potential of the SKA and its ensuing data and algorithmic challenges, in particular for the discovery and study of pulsars, drive the research of this thesis.</p> <p>Discovering all pulsars beaming towards Earth is one of the key science goals of the SKA. However, in addition to low signal strengths, searching for pulsars is extremely difficult due to the intrinsic weakness of their signals, propagation effects and the presence of anthropogenic interferences. Numerous techniques have been developed to overcome some of these difficulties and to assist in the quest to find more pulsars. However, despite the success of these techniques, the number of pulsars discovered in recent surveys (Swiggum et al. 2014, Lazarus et al. 2015) has fallen well short of the number predicted by pulsar population synthesis models (Lorimer 2011). This shortfall in pulsar detections can be attributed to radio frequency interference (RFI), red noise and scintillation (Lazarus et al. 2015).</p> <p>For this thesis, and in order to investigate and quantify these claims, I first developed a new technique to simulate pulsar search data that contain different types of RFI and varying noise baselines (i.e. red noise). This surrogate modelling technique was then used in a framework that I developed to inexpensively explore the sensitivity of pulsar search pipelines for different noise and RFI settings. The results from this framework highlight the necessity to develop algorithms that are able to identify and remove non-stationary variations from the data before RFI excision and searching is performed in order to limit false positive detections.</p> <p>To address the shortcomings identified with the framework which assessed the performance of existing pulsar search pipelines, I developed a new real-time algorithm for excising RFI while simultaneously normalising the variability in time and frequency inherent to pulsar observations. Processing synthetic data with the algorithm resulted in an expansion of the noise/pulsar spin period parameter space for which we are able to successfully detect pulsars. Furthermore, the algorithm is shown to reduce the number of false positive detections.</p> <p>In conclusion, the insights gained from the work presented in this thesis and the improvements achieved will contribute to the development of a new realtime pulsar search pipeline adept at dealing with the challenges posed by the SKA.</p>
first_indexed 2024-03-07T01:46:51Z
format Thesis
id oxford-uuid:98b329d6-4dbf-4956-9277-4b52fa2971bd
institution University of Oxford
last_indexed 2024-03-07T01:46:51Z
publishDate 2017
record_format dspace
spelling oxford-uuid:98b329d6-4dbf-4956-9277-4b52fa2971bd2022-03-27T00:08:53ZData challenges in pulsar searchesThesishttp://purl.org/coar/resource_type/c_db06uuid:98b329d6-4dbf-4956-9277-4b52fa2971bdORA Deposit2017Van Heerden, ERoberts, SKarastergiou, A<p>Technological advances coupled with a decline in digital storage costs have resulted in a profusion of data being created, collected and consumed. These data give rise to new challenges and opportunities in many disciplines ranging from science and engineering to biology and finance.</p> <p>An example of a future project in radio astronomy that promises both Big Data and Big Discoveries is the Square Kilometre Array (SKA) radio telescope project. Astrophysicists are confident that the Big Data amassed by the SKA will not only answer fundamental questions regarding the Universe but also contain big discoveries not yet postulated. The transformational potential of the SKA and its ensuing data and algorithmic challenges, in particular for the discovery and study of pulsars, drive the research of this thesis.</p> <p>Discovering all pulsars beaming towards Earth is one of the key science goals of the SKA. However, in addition to low signal strengths, searching for pulsars is extremely difficult due to the intrinsic weakness of their signals, propagation effects and the presence of anthropogenic interferences. Numerous techniques have been developed to overcome some of these difficulties and to assist in the quest to find more pulsars. However, despite the success of these techniques, the number of pulsars discovered in recent surveys (Swiggum et al. 2014, Lazarus et al. 2015) has fallen well short of the number predicted by pulsar population synthesis models (Lorimer 2011). This shortfall in pulsar detections can be attributed to radio frequency interference (RFI), red noise and scintillation (Lazarus et al. 2015).</p> <p>For this thesis, and in order to investigate and quantify these claims, I first developed a new technique to simulate pulsar search data that contain different types of RFI and varying noise baselines (i.e. red noise). This surrogate modelling technique was then used in a framework that I developed to inexpensively explore the sensitivity of pulsar search pipelines for different noise and RFI settings. The results from this framework highlight the necessity to develop algorithms that are able to identify and remove non-stationary variations from the data before RFI excision and searching is performed in order to limit false positive detections.</p> <p>To address the shortcomings identified with the framework which assessed the performance of existing pulsar search pipelines, I developed a new real-time algorithm for excising RFI while simultaneously normalising the variability in time and frequency inherent to pulsar observations. Processing synthetic data with the algorithm resulted in an expansion of the noise/pulsar spin period parameter space for which we are able to successfully detect pulsars. Furthermore, the algorithm is shown to reduce the number of false positive detections.</p> <p>In conclusion, the insights gained from the work presented in this thesis and the improvements achieved will contribute to the development of a new realtime pulsar search pipeline adept at dealing with the challenges posed by the SKA.</p>
spellingShingle Van Heerden, E
Data challenges in pulsar searches
title Data challenges in pulsar searches
title_full Data challenges in pulsar searches
title_fullStr Data challenges in pulsar searches
title_full_unstemmed Data challenges in pulsar searches
title_short Data challenges in pulsar searches
title_sort data challenges in pulsar searches
work_keys_str_mv AT vanheerdene datachallengesinpulsarsearches