Influenza-like illness surveillance on Twitter through automated learning of naïve language.

Twitter has the potential to be a timely and cost-effective source of data for syndromic surveillance. When speaking of an illness, Twitter users often report a combination of symptoms, rather than a suspected or final diagnosis, using naïve, everyday language. We developed a minimally trained algor...

Full description

Bibliographic Details
Main Authors: Francesco Gesualdo, Giovanni Stilo, Eleonora Agricola, Michaela V Gonfiantini, Elisabetta Pandolfi, Paola Velardi, Alberto E Tozzi
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2013-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC3853203?pdf=render
_version_ 1818059800223678464
author Francesco Gesualdo
Giovanni Stilo
Eleonora Agricola
Michaela V Gonfiantini
Elisabetta Pandolfi
Paola Velardi
Alberto E Tozzi
author_facet Francesco Gesualdo
Giovanni Stilo
Eleonora Agricola
Michaela V Gonfiantini
Elisabetta Pandolfi
Paola Velardi
Alberto E Tozzi
author_sort Francesco Gesualdo
collection DOAJ
description Twitter has the potential to be a timely and cost-effective source of data for syndromic surveillance. When speaking of an illness, Twitter users often report a combination of symptoms, rather than a suspected or final diagnosis, using naïve, everyday language. We developed a minimally trained algorithm that exploits the abundance of health-related web pages to identify all jargon expressions related to a specific technical term. We then translated an influenza case definition into a Boolean query, each symptom being described by a technical term and all related jargon expressions, as identified by the algorithm. Subsequently, we monitored all tweets that reported a combination of symptoms satisfying the case definition query. In order to geolocalize messages, we defined 3 localization strategies based on codes associated with each tweet. We found a high correlation coefficient between the trend of our influenza-positive tweets and ILI trends identified by US traditional surveillance systems.
first_indexed 2024-12-10T13:22:17Z
format Article
id doaj.art-b99ddb38682d408bbcaf5c8d9351a2c9
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-12-10T13:22:17Z
publishDate 2013-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-b99ddb38682d408bbcaf5c8d9351a2c92022-12-22T01:47:18ZengPublic Library of Science (PLoS)PLoS ONE1932-62032013-01-01812e8248910.1371/journal.pone.0082489Influenza-like illness surveillance on Twitter through automated learning of naïve language.Francesco GesualdoGiovanni StiloEleonora AgricolaMichaela V GonfiantiniElisabetta PandolfiPaola VelardiAlberto E TozziTwitter has the potential to be a timely and cost-effective source of data for syndromic surveillance. When speaking of an illness, Twitter users often report a combination of symptoms, rather than a suspected or final diagnosis, using naïve, everyday language. We developed a minimally trained algorithm that exploits the abundance of health-related web pages to identify all jargon expressions related to a specific technical term. We then translated an influenza case definition into a Boolean query, each symptom being described by a technical term and all related jargon expressions, as identified by the algorithm. Subsequently, we monitored all tweets that reported a combination of symptoms satisfying the case definition query. In order to geolocalize messages, we defined 3 localization strategies based on codes associated with each tweet. We found a high correlation coefficient between the trend of our influenza-positive tweets and ILI trends identified by US traditional surveillance systems.http://europepmc.org/articles/PMC3853203?pdf=render
spellingShingle Francesco Gesualdo
Giovanni Stilo
Eleonora Agricola
Michaela V Gonfiantini
Elisabetta Pandolfi
Paola Velardi
Alberto E Tozzi
Influenza-like illness surveillance on Twitter through automated learning of naïve language.
PLoS ONE
title Influenza-like illness surveillance on Twitter through automated learning of naïve language.
title_full Influenza-like illness surveillance on Twitter through automated learning of naïve language.
title_fullStr Influenza-like illness surveillance on Twitter through automated learning of naïve language.
title_full_unstemmed Influenza-like illness surveillance on Twitter through automated learning of naïve language.
title_short Influenza-like illness surveillance on Twitter through automated learning of naïve language.
title_sort influenza like illness surveillance on twitter through automated learning of naive language
url http://europepmc.org/articles/PMC3853203?pdf=render
work_keys_str_mv AT francescogesualdo influenzalikeillnesssurveillanceontwitterthroughautomatedlearningofnaivelanguage
AT giovannistilo influenzalikeillnesssurveillanceontwitterthroughautomatedlearningofnaivelanguage
AT eleonoraagricola influenzalikeillnesssurveillanceontwitterthroughautomatedlearningofnaivelanguage
AT michaelavgonfiantini influenzalikeillnesssurveillanceontwitterthroughautomatedlearningofnaivelanguage
AT elisabettapandolfi influenzalikeillnesssurveillanceontwitterthroughautomatedlearningofnaivelanguage
AT paolavelardi influenzalikeillnesssurveillanceontwitterthroughautomatedlearningofnaivelanguage
AT albertoetozzi influenzalikeillnesssurveillanceontwitterthroughautomatedlearningofnaivelanguage