FORENSIC: an Online Platform for Fecal Source Identification

ABSTRACT Sewage overflows, agricultural runoff, and stormwater discharges introduce fecal pollution into surface waters. Distinguishing these sources is critical for evaluating water quality and formulating remediation strategies. With the falling costs of sequencing, microbial community-based water...

Full description

Bibliographic Details
Main Authors: Adélaïde Roguet, Özcan C. Esen, A. Murat Eren, Ryan J. Newton, Sandra L. McLellan
Format: Article
Language:English
Published: American Society for Microbiology 2020-04-01
Series:mSystems
Subjects:
Online Access:https://journals.asm.org/doi/10.1128/mSystems.00869-19
_version_ 1818344790536749056
author Adélaïde Roguet
Özcan C. Esen
A. Murat Eren
Ryan J. Newton
Sandra L. McLellan
author_facet Adélaïde Roguet
Özcan C. Esen
A. Murat Eren
Ryan J. Newton
Sandra L. McLellan
author_sort Adélaïde Roguet
collection DOAJ
description ABSTRACT Sewage overflows, agricultural runoff, and stormwater discharges introduce fecal pollution into surface waters. Distinguishing these sources is critical for evaluating water quality and formulating remediation strategies. With the falling costs of sequencing, microbial community-based water quality assessment tools are under development. However, their application is limited by the need to build reference libraries, which requires extensive sampling of sources and bioinformatic expertise. Here, we introduce FORest Enteric Source IdentifiCation (FORENSIC; https://forensic.sfs.uwm.edu/), an online, library-independent source tracking platform based on random forest classification and 16S rRNA gene amplicon sequences to identify in environmental samples common fecal contamination sources, including humans, domestic pets, and agricultural animals. FORENSIC relies on a broad reference signature database of Bacteroidales and Clostridiales, two predominant bacterial groups that have coevolved with their hosts. As a result, these groups demonstrate cohesive and reliable assemblage patterns within mammalian species or among species sharing the same diet/physiology. We created a scalable and extensible platform that we tested for global applicability using samples collected in distant geographic locations. This Web application offers a fast and intuitive approach for fecal source identification, particularly in sewage-contaminated waters. IMPORTANCE FORENSIC is an online platform to identify sources of fecal pollution without the need to create reference libraries. FORENSIC is based on the ability of random forest classification to extract cohesive source microbial signatures to create classifiers despite individual variability and to detect the signatures in environmental samples. We primarily focused on defining sewage signals, which are associated with a high human health risk in polluted waters. To test for fecal contamination sources, the platform only requires paired-end reads targeting the V4 or V6 regions of the 16S rRNA gene. We demonstrated that we could use V4V5 reads trimmed to the V4 positions to generate the reference signature. The systematic workflow we describe to create and validate the signatures could be applied to many disciplines. With the increasing gap between advancing technology and practical applications, this platform makes sequence-based water quality assessments accessible to the public health and water resource communities.
first_indexed 2024-12-13T16:52:05Z
format Article
id doaj.art-9a6d8a770d1f45e0a5d41fa02b998b1c
institution Directory Open Access Journal
issn 2379-5077
language English
last_indexed 2024-12-13T16:52:05Z
publishDate 2020-04-01
publisher American Society for Microbiology
record_format Article
series mSystems
spelling doaj.art-9a6d8a770d1f45e0a5d41fa02b998b1c2022-12-21T23:38:00ZengAmerican Society for MicrobiologymSystems2379-50772020-04-015210.1128/mSystems.00869-19FORENSIC: an Online Platform for Fecal Source IdentificationAdélaïde Roguet0Özcan C. Esen1A. Murat Eren2Ryan J. Newton3Sandra L. McLellan4School of Freshwater Sciences, University of Wisconsin–Milwaukee, Milwaukee, Wisconsin, USADepartment of Medicine, University of Chicago, Chicago, Illinois, USADepartment of Medicine, University of Chicago, Chicago, Illinois, USASchool of Freshwater Sciences, University of Wisconsin–Milwaukee, Milwaukee, Wisconsin, USASchool of Freshwater Sciences, University of Wisconsin–Milwaukee, Milwaukee, Wisconsin, USAABSTRACT Sewage overflows, agricultural runoff, and stormwater discharges introduce fecal pollution into surface waters. Distinguishing these sources is critical for evaluating water quality and formulating remediation strategies. With the falling costs of sequencing, microbial community-based water quality assessment tools are under development. However, their application is limited by the need to build reference libraries, which requires extensive sampling of sources and bioinformatic expertise. Here, we introduce FORest Enteric Source IdentifiCation (FORENSIC; https://forensic.sfs.uwm.edu/), an online, library-independent source tracking platform based on random forest classification and 16S rRNA gene amplicon sequences to identify in environmental samples common fecal contamination sources, including humans, domestic pets, and agricultural animals. FORENSIC relies on a broad reference signature database of Bacteroidales and Clostridiales, two predominant bacterial groups that have coevolved with their hosts. As a result, these groups demonstrate cohesive and reliable assemblage patterns within mammalian species or among species sharing the same diet/physiology. We created a scalable and extensible platform that we tested for global applicability using samples collected in distant geographic locations. This Web application offers a fast and intuitive approach for fecal source identification, particularly in sewage-contaminated waters. IMPORTANCE FORENSIC is an online platform to identify sources of fecal pollution without the need to create reference libraries. FORENSIC is based on the ability of random forest classification to extract cohesive source microbial signatures to create classifiers despite individual variability and to detect the signatures in environmental samples. We primarily focused on defining sewage signals, which are associated with a high human health risk in polluted waters. To test for fecal contamination sources, the platform only requires paired-end reads targeting the V4 or V6 regions of the 16S rRNA gene. We demonstrated that we could use V4V5 reads trimmed to the V4 positions to generate the reference signature. The systematic workflow we describe to create and validate the signatures could be applied to many disciplines. With the increasing gap between advancing technology and practical applications, this platform makes sequence-based water quality assessments accessible to the public health and water resource communities.https://journals.asm.org/doi/10.1128/mSystems.00869-19microbial source tracking16S rRNA genehigh-throughput sequencingBacteroidalesClostridialesrandom forest classification
spellingShingle Adélaïde Roguet
Özcan C. Esen
A. Murat Eren
Ryan J. Newton
Sandra L. McLellan
FORENSIC: an Online Platform for Fecal Source Identification
mSystems
microbial source tracking
16S rRNA gene
high-throughput sequencing
Bacteroidales
Clostridiales
random forest classification
title FORENSIC: an Online Platform for Fecal Source Identification
title_full FORENSIC: an Online Platform for Fecal Source Identification
title_fullStr FORENSIC: an Online Platform for Fecal Source Identification
title_full_unstemmed FORENSIC: an Online Platform for Fecal Source Identification
title_short FORENSIC: an Online Platform for Fecal Source Identification
title_sort forensic an online platform for fecal source identification
topic microbial source tracking
16S rRNA gene
high-throughput sequencing
Bacteroidales
Clostridiales
random forest classification
url https://journals.asm.org/doi/10.1128/mSystems.00869-19
work_keys_str_mv AT adelaideroguet forensicanonlineplatformforfecalsourceidentification
AT ozcancesen forensicanonlineplatformforfecalsourceidentification
AT amurateren forensicanonlineplatformforfecalsourceidentification
AT ryanjnewton forensicanonlineplatformforfecalsourceidentification
AT sandralmclellan forensicanonlineplatformforfecalsourceidentification