Bot Datasets on Twitter: Analysis and Challenges

The reach and influence of social networks over modern society and its functioning have created new challenges and opportunities to prevent the misuse or tampering of such powerful tools of social interaction. Twitter, a social networking service that specializes in online news and information excha...

Full description

Bibliographic Details
Main Authors: Luis Daniel Samper-Escalante, Octavio Loyola-González, Raúl Monroy, Miguel Angel Medina-Pérez
Format: Article
Language:English
Published: MDPI AG 2021-04-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/11/9/4105
_version_ 1797535620654956544
author Luis Daniel Samper-Escalante
Octavio Loyola-González
Raúl Monroy
Miguel Angel Medina-Pérez
author_facet Luis Daniel Samper-Escalante
Octavio Loyola-González
Raúl Monroy
Miguel Angel Medina-Pérez
author_sort Luis Daniel Samper-Escalante
collection DOAJ
description The reach and influence of social networks over modern society and its functioning have created new challenges and opportunities to prevent the misuse or tampering of such powerful tools of social interaction. Twitter, a social networking service that specializes in online news and information exchange involving billions of users world-wide, has been infested by bots for several years. In this paper, we analyze both public and private databases from the literature of bot detection on Twitter. We summarize their advantages, disadvantages, and differences, recommending which is more suitable to work with depending on the necessities of the researcher. From this analysis, we present five distinct behaviors in automated accounts exhibited across all the bot datasets analyzed from these databases. We measure their level of presence in each dataset using a radar chart for visual comparison. Finally, we identify four challenges that researchers of bot detection on Twitter have to face when using these databases from the literature.
first_indexed 2024-03-10T11:48:00Z
format Article
id doaj.art-3e9d05d397a44c34ae013c71ee9aa633
institution Directory Open Access Journal
issn 2076-3417
language English
last_indexed 2024-03-10T11:48:00Z
publishDate 2021-04-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj.art-3e9d05d397a44c34ae013c71ee9aa6332023-11-21T17:55:04ZengMDPI AGApplied Sciences2076-34172021-04-01119410510.3390/app11094105Bot Datasets on Twitter: Analysis and ChallengesLuis Daniel Samper-Escalante0Octavio Loyola-González1Raúl Monroy2Miguel Angel Medina-Pérez3School of Engineering and Sciences, Tecnologico de Monterrey, Puebla 72453, MexicoAltair Management Consultants Corp., 303 Wyman St., Suite 300, Waltham, MA 02451, USASchool of Engineering and Sciences, Tecnologico de Monterrey, Estado de Mexico 52926, MexicoSchool of Engineering and Sciences, Tecnologico de Monterrey, Estado de Mexico 52926, MexicoThe reach and influence of social networks over modern society and its functioning have created new challenges and opportunities to prevent the misuse or tampering of such powerful tools of social interaction. Twitter, a social networking service that specializes in online news and information exchange involving billions of users world-wide, has been infested by bots for several years. In this paper, we analyze both public and private databases from the literature of bot detection on Twitter. We summarize their advantages, disadvantages, and differences, recommending which is more suitable to work with depending on the necessities of the researcher. From this analysis, we present five distinct behaviors in automated accounts exhibited across all the bot datasets analyzed from these databases. We measure their level of presence in each dataset using a radar chart for visual comparison. Finally, we identify four challenges that researchers of bot detection on Twitter have to face when using these databases from the literature.https://www.mdpi.com/2076-3417/11/9/4105bot behaviorbot datasetstwitterdatabase analysisdatabase challenges
spellingShingle Luis Daniel Samper-Escalante
Octavio Loyola-González
Raúl Monroy
Miguel Angel Medina-Pérez
Bot Datasets on Twitter: Analysis and Challenges
Applied Sciences
bot behavior
bot datasets
twitter
database analysis
database challenges
title Bot Datasets on Twitter: Analysis and Challenges
title_full Bot Datasets on Twitter: Analysis and Challenges
title_fullStr Bot Datasets on Twitter: Analysis and Challenges
title_full_unstemmed Bot Datasets on Twitter: Analysis and Challenges
title_short Bot Datasets on Twitter: Analysis and Challenges
title_sort bot datasets on twitter analysis and challenges
topic bot behavior
bot datasets
twitter
database analysis
database challenges
url https://www.mdpi.com/2076-3417/11/9/4105
work_keys_str_mv AT luisdanielsamperescalante botdatasetsontwitteranalysisandchallenges
AT octavioloyolagonzalez botdatasetsontwitteranalysisandchallenges
AT raulmonroy botdatasetsontwitteranalysisandchallenges
AT miguelangelmedinaperez botdatasetsontwitteranalysisandchallenges