Genome analysis of Legionella pneumophila strains using a mixed-genome microarray.

BACKGROUND: Legionella, the causative agent for Legionnaires' disease, is ubiquitous in both natural and man-made aquatic environments. The distribution of Legionella genotypes within clinical strains is significantly different from that found in environmental strains. Developing novel genotypi...

Full description

Bibliographic Details
Main Authors: Sjoerd M Euser, Nico J Nagelkerke, Frank Schuren, Ruud Jansen, Jeroen W Den Boer
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2012-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC3475688?pdf=render
_version_ 1831599662224637952
author Sjoerd M Euser
Nico J Nagelkerke
Frank Schuren
Ruud Jansen
Jeroen W Den Boer
author_facet Sjoerd M Euser
Nico J Nagelkerke
Frank Schuren
Ruud Jansen
Jeroen W Den Boer
author_sort Sjoerd M Euser
collection DOAJ
description BACKGROUND: Legionella, the causative agent for Legionnaires' disease, is ubiquitous in both natural and man-made aquatic environments. The distribution of Legionella genotypes within clinical strains is significantly different from that found in environmental strains. Developing novel genotypic methods that offer the ability to distinguish clinical from environmental strains could help to focus on more relevant (virulent) Legionella species in control efforts. Mixed-genome microarray data can be used to perform a comparative-genome analysis of strain collections, and advanced statistical approaches, such as the Random Forest algorithm are available to process these data. METHODS: Microarray analysis was performed on a collection of 222 Legionella pneumophila strains, which included patient-derived strains from notified cases in The Netherlands in the period 2002-2006 and the environmental strains that were collected during the source investigation for those patients within the Dutch National Legionella Outbreak Detection Programme. The Random Forest algorithm combined with a logistic regression model was used to select predictive markers and to construct a predictive model that could discriminate between strains from different origin: clinical or environmental. RESULTS: Four genetic markers were selected that correctly predicted 96% of the clinical strains and 66% of the environmental strains collected within the Dutch National Legionella Outbreak Detection Programme. CONCLUSIONS: The Random Forest algorithm is well suited for the development of prediction models that use mixed-genome microarray data to discriminate between Legionella strains from different origin. The identification of these predictive genetic markers could offer the possibility to identify virulence factors within the Legionella genome, which in the future may be implemented in the daily practice of controlling Legionella in the public health environment.
first_indexed 2024-12-18T14:28:44Z
format Article
id doaj.art-8ddfb9972d134ca18a61558f1bbb7228
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-12-18T14:28:44Z
publishDate 2012-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-8ddfb9972d134ca18a61558f1bbb72282022-12-21T21:04:39ZengPublic Library of Science (PLoS)PLoS ONE1932-62032012-01-01710e4743710.1371/journal.pone.0047437Genome analysis of Legionella pneumophila strains using a mixed-genome microarray.Sjoerd M EuserNico J NagelkerkeFrank SchurenRuud JansenJeroen W Den BoerBACKGROUND: Legionella, the causative agent for Legionnaires' disease, is ubiquitous in both natural and man-made aquatic environments. The distribution of Legionella genotypes within clinical strains is significantly different from that found in environmental strains. Developing novel genotypic methods that offer the ability to distinguish clinical from environmental strains could help to focus on more relevant (virulent) Legionella species in control efforts. Mixed-genome microarray data can be used to perform a comparative-genome analysis of strain collections, and advanced statistical approaches, such as the Random Forest algorithm are available to process these data. METHODS: Microarray analysis was performed on a collection of 222 Legionella pneumophila strains, which included patient-derived strains from notified cases in The Netherlands in the period 2002-2006 and the environmental strains that were collected during the source investigation for those patients within the Dutch National Legionella Outbreak Detection Programme. The Random Forest algorithm combined with a logistic regression model was used to select predictive markers and to construct a predictive model that could discriminate between strains from different origin: clinical or environmental. RESULTS: Four genetic markers were selected that correctly predicted 96% of the clinical strains and 66% of the environmental strains collected within the Dutch National Legionella Outbreak Detection Programme. CONCLUSIONS: The Random Forest algorithm is well suited for the development of prediction models that use mixed-genome microarray data to discriminate between Legionella strains from different origin. The identification of these predictive genetic markers could offer the possibility to identify virulence factors within the Legionella genome, which in the future may be implemented in the daily practice of controlling Legionella in the public health environment.http://europepmc.org/articles/PMC3475688?pdf=render
spellingShingle Sjoerd M Euser
Nico J Nagelkerke
Frank Schuren
Ruud Jansen
Jeroen W Den Boer
Genome analysis of Legionella pneumophila strains using a mixed-genome microarray.
PLoS ONE
title Genome analysis of Legionella pneumophila strains using a mixed-genome microarray.
title_full Genome analysis of Legionella pneumophila strains using a mixed-genome microarray.
title_fullStr Genome analysis of Legionella pneumophila strains using a mixed-genome microarray.
title_full_unstemmed Genome analysis of Legionella pneumophila strains using a mixed-genome microarray.
title_short Genome analysis of Legionella pneumophila strains using a mixed-genome microarray.
title_sort genome analysis of legionella pneumophila strains using a mixed genome microarray
url http://europepmc.org/articles/PMC3475688?pdf=render
work_keys_str_mv AT sjoerdmeuser genomeanalysisoflegionellapneumophilastrainsusingamixedgenomemicroarray
AT nicojnagelkerke genomeanalysisoflegionellapneumophilastrainsusingamixedgenomemicroarray
AT frankschuren genomeanalysisoflegionellapneumophilastrainsusingamixedgenomemicroarray
AT ruudjansen genomeanalysisoflegionellapneumophilastrainsusingamixedgenomemicroarray
AT jeroenwdenboer genomeanalysisoflegionellapneumophilastrainsusingamixedgenomemicroarray