Comparison of Source Attribution Methodologies for Human Campylobacteriosis

<i>Campylobacter</i> spp. are the most common cause of bacterial gastrointestinal infection in humans both in Denmark and worldwide. Studies have found microbial subtyping to be a powerful tool for source attribution, but comparisons of different methodologies are limited. In this study,...

Full description

Bibliographic Details
Main Authors: Maja Lykke Brinch, Tine Hald, Lynda Wainaina, Alessandra Merlotti, Daniel Remondini, Clementine Henri, Patrick Murigu Kamau Njage
Format: Article
Language:English
Published: MDPI AG 2023-05-01
Series:Pathogens
Subjects:
Online Access:https://www.mdpi.com/2076-0817/12/6/786
Description
Summary:<i>Campylobacter</i> spp. are the most common cause of bacterial gastrointestinal infection in humans both in Denmark and worldwide. Studies have found microbial subtyping to be a powerful tool for source attribution, but comparisons of different methodologies are limited. In this study, we compare three source attribution approaches (Machine Learning, Network Analysis, and Bayesian modeling) using three types of whole genome sequences (WGS) data inputs (cgMLST, 5-Mers and 7-Mers). We predicted and compared the sources of human campylobacteriosis cases in Denmark. Using 7mer as an input feature provided the best model performance. The network analysis algorithm had a CSC value of <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>78.99</mn><mo>%</mo></mrow></semantics></math></inline-formula> and an F1-score value of <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>67</mn><mo>%</mo></mrow></semantics></math></inline-formula>, while the machine-learning algorithm showed the highest accuracy (<inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>98</mn><mo>%</mo></mrow></semantics></math></inline-formula>). The models attributed between 965 and all of the 1224 human cases to a source (network applying 5mer and machine learning applying 7mer, respectively). Chicken from Denmark was the primary source of human campylobacteriosis with an average percentage probability of attribution of <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>45.8</mn><mo>%</mo></mrow></semantics></math></inline-formula> to <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>65.4</mn><mo>%</mo></mrow></semantics></math></inline-formula>, representing Bayesian with 7mer and machine learning with cgMLST, respectively. Our results indicate that the different source attribution methodologies based on WGS have great potential for the surveillance and source tracking of Campylobacter. The results of such models may support decision makers to prioritize and target interventions.
ISSN:2076-0817