An Automated Bioinformatics Pipeline Informing Near-Real-Time Public Health Responses to New HIV Diagnoses in a Statewide HIV Epidemic
Molecular HIV cluster data can guide public health responses towards ending the HIV epidemic. Currently, real-time data integration, analysis, and interpretation are challenging, leading to a delayed public health response. We present a comprehensive methodology for addressing these challenges throu...
Main Authors: | , , , , , , , , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2023-03-01
|
Series: | Viruses |
Subjects: | |
Online Access: | https://www.mdpi.com/1999-4915/15/3/737 |
_version_ | 1797608639186337792 |
---|---|
author | Mark Howison Fizza S. Gillani Vlad Novitsky Jon A. Steingrimsson John Fulton Thomas Bertrand Katharine Howe Anna Civitarese Lila Bhattarai Meghan MacAskill Guillermo Ronquillo Joel Hague Casey W. Dunn Utpala Bandy Joseph W. Hogan Rami Kantor |
author_facet | Mark Howison Fizza S. Gillani Vlad Novitsky Jon A. Steingrimsson John Fulton Thomas Bertrand Katharine Howe Anna Civitarese Lila Bhattarai Meghan MacAskill Guillermo Ronquillo Joel Hague Casey W. Dunn Utpala Bandy Joseph W. Hogan Rami Kantor |
author_sort | Mark Howison |
collection | DOAJ |
description | Molecular HIV cluster data can guide public health responses towards ending the HIV epidemic. Currently, real-time data integration, analysis, and interpretation are challenging, leading to a delayed public health response. We present a comprehensive methodology for addressing these challenges through data integration, analysis, and reporting. We integrated heterogeneous data sources across systems and developed an open-source, automatic bioinformatics pipeline that provides molecular HIV cluster data to inform public health responses to new statewide HIV-1 diagnoses, overcoming data management, computational, and analytical challenges. We demonstrate implementation of this pipeline in a statewide HIV epidemic and use it to compare the impact of specific phylogenetic and distance-only methods and datasets on molecular HIV cluster analyses. The pipeline was applied to 18 monthly datasets generated between January 2020 and June 2022 in Rhode Island, USA, that provide statewide molecular HIV data to support routine public health case management by a multi-disciplinary team. The resulting cluster analyses and near-real-time reporting guided public health actions in 37 phylogenetically clustered cases out of 57 new HIV-1 diagnoses. Of the 37, only 21 (57%) clustered by distance-only methods. Through a unique academic-public health partnership, an automated open-source pipeline was developed and applied to prospective, routine analysis of statewide molecular HIV data in near-real-time. This collaboration informed public health actions to optimize disruption of HIV transmission. |
first_indexed | 2024-03-11T05:46:02Z |
format | Article |
id | doaj.art-a74cd0322ca94271af5ec0666b8eff41 |
institution | Directory Open Access Journal |
issn | 1999-4915 |
language | English |
last_indexed | 2024-03-11T05:46:02Z |
publishDate | 2023-03-01 |
publisher | MDPI AG |
record_format | Article |
series | Viruses |
spelling | doaj.art-a74cd0322ca94271af5ec0666b8eff412023-11-17T14:23:38ZengMDPI AGViruses1999-49152023-03-0115373710.3390/v15030737An Automated Bioinformatics Pipeline Informing Near-Real-Time Public Health Responses to New HIV Diagnoses in a Statewide HIV EpidemicMark Howison0Fizza S. Gillani1Vlad Novitsky2Jon A. Steingrimsson3John Fulton4Thomas Bertrand5Katharine Howe6Anna Civitarese7Lila Bhattarai8Meghan MacAskill9Guillermo Ronquillo10Joel Hague11Casey W. Dunn12Utpala Bandy13Joseph W. Hogan14Rami Kantor15Research Improving People’s Lives, Providence, RI 02903, USADepartment of Medicine, Brown University, Providence, RI 02906, USADepartment of Medicine, Brown University, Providence, RI 02906, USADepartment of Biostatistics, Brown University School of Public Health, Providence, RI 02903, USADepartment of Behavioral and Social Sciences, Brown University, Providence, RI 02903, USARhode Island Department of Health, Providence, RI 02908, USARhode Island Department of Health, Providence, RI 02908, USARhode Island Department of Health, Providence, RI 02908, USARhode Island Department of Health, Providence, RI 02908, USARhode Island Department of Health, Providence, RI 02908, USARhode Island Department of Health, Providence, RI 02908, USADepartment of Medicine, Brown University, Providence, RI 02906, USADepartment of Ecology and Evolutionary Biology, Yale University, New Haven, CT 06511, USARhode Island Department of Health, Providence, RI 02908, USADepartment of Biostatistics, Brown University School of Public Health, Providence, RI 02903, USADepartment of Medicine, Brown University, Providence, RI 02906, USAMolecular HIV cluster data can guide public health responses towards ending the HIV epidemic. Currently, real-time data integration, analysis, and interpretation are challenging, leading to a delayed public health response. We present a comprehensive methodology for addressing these challenges through data integration, analysis, and reporting. We integrated heterogeneous data sources across systems and developed an open-source, automatic bioinformatics pipeline that provides molecular HIV cluster data to inform public health responses to new statewide HIV-1 diagnoses, overcoming data management, computational, and analytical challenges. We demonstrate implementation of this pipeline in a statewide HIV epidemic and use it to compare the impact of specific phylogenetic and distance-only methods and datasets on molecular HIV cluster analyses. The pipeline was applied to 18 monthly datasets generated between January 2020 and June 2022 in Rhode Island, USA, that provide statewide molecular HIV data to support routine public health case management by a multi-disciplinary team. The resulting cluster analyses and near-real-time reporting guided public health actions in 37 phylogenetically clustered cases out of 57 new HIV-1 diagnoses. Of the 37, only 21 (57%) clustered by distance-only methods. Through a unique academic-public health partnership, an automated open-source pipeline was developed and applied to prospective, routine analysis of statewide molecular HIV data in near-real-time. This collaboration informed public health actions to optimize disruption of HIV transmission.https://www.mdpi.com/1999-4915/15/3/737molecular HIV clustersphylogeneticsmolecular epidemiologyHIV transmission networkscontact tracingnear-real-time data integration |
spellingShingle | Mark Howison Fizza S. Gillani Vlad Novitsky Jon A. Steingrimsson John Fulton Thomas Bertrand Katharine Howe Anna Civitarese Lila Bhattarai Meghan MacAskill Guillermo Ronquillo Joel Hague Casey W. Dunn Utpala Bandy Joseph W. Hogan Rami Kantor An Automated Bioinformatics Pipeline Informing Near-Real-Time Public Health Responses to New HIV Diagnoses in a Statewide HIV Epidemic Viruses molecular HIV clusters phylogenetics molecular epidemiology HIV transmission networks contact tracing near-real-time data integration |
title | An Automated Bioinformatics Pipeline Informing Near-Real-Time Public Health Responses to New HIV Diagnoses in a Statewide HIV Epidemic |
title_full | An Automated Bioinformatics Pipeline Informing Near-Real-Time Public Health Responses to New HIV Diagnoses in a Statewide HIV Epidemic |
title_fullStr | An Automated Bioinformatics Pipeline Informing Near-Real-Time Public Health Responses to New HIV Diagnoses in a Statewide HIV Epidemic |
title_full_unstemmed | An Automated Bioinformatics Pipeline Informing Near-Real-Time Public Health Responses to New HIV Diagnoses in a Statewide HIV Epidemic |
title_short | An Automated Bioinformatics Pipeline Informing Near-Real-Time Public Health Responses to New HIV Diagnoses in a Statewide HIV Epidemic |
title_sort | automated bioinformatics pipeline informing near real time public health responses to new hiv diagnoses in a statewide hiv epidemic |
topic | molecular HIV clusters phylogenetics molecular epidemiology HIV transmission networks contact tracing near-real-time data integration |
url | https://www.mdpi.com/1999-4915/15/3/737 |
work_keys_str_mv | AT markhowison anautomatedbioinformaticspipelineinformingnearrealtimepublichealthresponsestonewhivdiagnosesinastatewidehivepidemic AT fizzasgillani anautomatedbioinformaticspipelineinformingnearrealtimepublichealthresponsestonewhivdiagnosesinastatewidehivepidemic AT vladnovitsky anautomatedbioinformaticspipelineinformingnearrealtimepublichealthresponsestonewhivdiagnosesinastatewidehivepidemic AT jonasteingrimsson anautomatedbioinformaticspipelineinformingnearrealtimepublichealthresponsestonewhivdiagnosesinastatewidehivepidemic AT johnfulton anautomatedbioinformaticspipelineinformingnearrealtimepublichealthresponsestonewhivdiagnosesinastatewidehivepidemic AT thomasbertrand anautomatedbioinformaticspipelineinformingnearrealtimepublichealthresponsestonewhivdiagnosesinastatewidehivepidemic AT katharinehowe anautomatedbioinformaticspipelineinformingnearrealtimepublichealthresponsestonewhivdiagnosesinastatewidehivepidemic AT annacivitarese anautomatedbioinformaticspipelineinformingnearrealtimepublichealthresponsestonewhivdiagnosesinastatewidehivepidemic AT lilabhattarai anautomatedbioinformaticspipelineinformingnearrealtimepublichealthresponsestonewhivdiagnosesinastatewidehivepidemic AT meghanmacaskill anautomatedbioinformaticspipelineinformingnearrealtimepublichealthresponsestonewhivdiagnosesinastatewidehivepidemic AT guillermoronquillo anautomatedbioinformaticspipelineinformingnearrealtimepublichealthresponsestonewhivdiagnosesinastatewidehivepidemic AT joelhague anautomatedbioinformaticspipelineinformingnearrealtimepublichealthresponsestonewhivdiagnosesinastatewidehivepidemic AT caseywdunn anautomatedbioinformaticspipelineinformingnearrealtimepublichealthresponsestonewhivdiagnosesinastatewidehivepidemic AT utpalabandy anautomatedbioinformaticspipelineinformingnearrealtimepublichealthresponsestonewhivdiagnosesinastatewidehivepidemic AT josephwhogan anautomatedbioinformaticspipelineinformingnearrealtimepublichealthresponsestonewhivdiagnosesinastatewidehivepidemic AT ramikantor anautomatedbioinformaticspipelineinformingnearrealtimepublichealthresponsestonewhivdiagnosesinastatewidehivepidemic AT markhowison automatedbioinformaticspipelineinformingnearrealtimepublichealthresponsestonewhivdiagnosesinastatewidehivepidemic AT fizzasgillani automatedbioinformaticspipelineinformingnearrealtimepublichealthresponsestonewhivdiagnosesinastatewidehivepidemic AT vladnovitsky automatedbioinformaticspipelineinformingnearrealtimepublichealthresponsestonewhivdiagnosesinastatewidehivepidemic AT jonasteingrimsson automatedbioinformaticspipelineinformingnearrealtimepublichealthresponsestonewhivdiagnosesinastatewidehivepidemic AT johnfulton automatedbioinformaticspipelineinformingnearrealtimepublichealthresponsestonewhivdiagnosesinastatewidehivepidemic AT thomasbertrand automatedbioinformaticspipelineinformingnearrealtimepublichealthresponsestonewhivdiagnosesinastatewidehivepidemic AT katharinehowe automatedbioinformaticspipelineinformingnearrealtimepublichealthresponsestonewhivdiagnosesinastatewidehivepidemic AT annacivitarese automatedbioinformaticspipelineinformingnearrealtimepublichealthresponsestonewhivdiagnosesinastatewidehivepidemic AT lilabhattarai automatedbioinformaticspipelineinformingnearrealtimepublichealthresponsestonewhivdiagnosesinastatewidehivepidemic AT meghanmacaskill automatedbioinformaticspipelineinformingnearrealtimepublichealthresponsestonewhivdiagnosesinastatewidehivepidemic AT guillermoronquillo automatedbioinformaticspipelineinformingnearrealtimepublichealthresponsestonewhivdiagnosesinastatewidehivepidemic AT joelhague automatedbioinformaticspipelineinformingnearrealtimepublichealthresponsestonewhivdiagnosesinastatewidehivepidemic AT caseywdunn automatedbioinformaticspipelineinformingnearrealtimepublichealthresponsestonewhivdiagnosesinastatewidehivepidemic AT utpalabandy automatedbioinformaticspipelineinformingnearrealtimepublichealthresponsestonewhivdiagnosesinastatewidehivepidemic AT josephwhogan automatedbioinformaticspipelineinformingnearrealtimepublichealthresponsestonewhivdiagnosesinastatewidehivepidemic AT ramikantor automatedbioinformaticspipelineinformingnearrealtimepublichealthresponsestonewhivdiagnosesinastatewidehivepidemic |