Feature extractions and selection of bot detection on Twitter A systematic literature review

Abstract Automated or semiautomated computer programs that imitate humans and/or human behavior in online social networks are known as social bots. Users can be attacked by social bots to achieve several hidden aims, such as spreading information or influencing targets. While researchers develop a...

Full description

Bibliographic Details
Main Authors: Raad Al-azawi, Safaa O. AL-mamory
Format: Article
Language:English
Published: Asociación Española para la Inteligencia Artificial 2022-04-01
Series:Inteligencia Artificial
Subjects:
Online Access:https://journal.iberamia.org/index.php/intartif/article/view/758
_version_ 1811294339886743552
author Raad Al-azawi
Safaa O. AL-mamory
author_facet Raad Al-azawi
Safaa O. AL-mamory
author_sort Raad Al-azawi
collection DOAJ
description Abstract Automated or semiautomated computer programs that imitate humans and/or human behavior in online social networks are known as social bots. Users can be attacked by social bots to achieve several hidden aims, such as spreading information or influencing targets. While researchers develop a variety of methods to detect social media bot accounts, attackers adapt their bots to avoid detection. This field necessitates ongoing growth, particularly in the areas of feature selection and extraction. The study's purpose is to provide an overview of bot attacks on Twitter, shedding light on issues in feature extraction and selection that have a significant impact on the accuracy of bot detection algorithms, and highlighting the weaknesses in training time and dimensionality reduction. To the best of our knowledge, this study is the first systematic literature review based on a preset search-strategy that encompasses literature published between 2018 and 2021 which are concerned with Twitter features (attributes). The key findings of this research are threefold. First, the paper provides an improved taxonomy of feature extraction and selection approaches. Second, it includes a comprehensive overview of approaches for detecting bots in the Twitter platform, particularly machine learning techniques. The percentage was calculated using the proposed taxonomy, with metadata, tweet text, and merging (meta and tweet text) accounting for 37%, 31%, and 32%, respectively. Third, some gaps are also highlighted for further research. The first is that public datasets are not precise or suitable in size. Second, the use of integrated systems and real-time detection is uncommon. Third, detecting each bots category identified separately is needed, rather than detecting all categories of bots using one generic model and the same features' values. Finally, extracting influential features that assist machine learning algorithms in detecting Twitter bots with high accuracy is critical, especially if the type of bot is pre-determined.
first_indexed 2024-04-13T05:14:59Z
format Article
id doaj.art-b41bb42c5ae6440da1447e62cee6488c
institution Directory Open Access Journal
issn 1137-3601
1988-3064
language English
last_indexed 2024-04-13T05:14:59Z
publishDate 2022-04-01
publisher Asociación Española para la Inteligencia Artificial
record_format Article
series Inteligencia Artificial
spelling doaj.art-b41bb42c5ae6440da1447e62cee6488c2022-12-22T03:00:55ZengAsociación Española para la Inteligencia ArtificialInteligencia Artificial1137-36011988-30642022-04-01256910.4114/intartif.vol25iss69pp57-86Feature extractions and selection of bot detection on Twitter A systematic literature reviewRaad Al-azawi0Safaa O. AL-mamory1University of Babylon, IraqCollege of Business Informatics, Bagdad, Iraq Abstract Automated or semiautomated computer programs that imitate humans and/or human behavior in online social networks are known as social bots. Users can be attacked by social bots to achieve several hidden aims, such as spreading information or influencing targets. While researchers develop a variety of methods to detect social media bot accounts, attackers adapt their bots to avoid detection. This field necessitates ongoing growth, particularly in the areas of feature selection and extraction. The study's purpose is to provide an overview of bot attacks on Twitter, shedding light on issues in feature extraction and selection that have a significant impact on the accuracy of bot detection algorithms, and highlighting the weaknesses in training time and dimensionality reduction. To the best of our knowledge, this study is the first systematic literature review based on a preset search-strategy that encompasses literature published between 2018 and 2021 which are concerned with Twitter features (attributes). The key findings of this research are threefold. First, the paper provides an improved taxonomy of feature extraction and selection approaches. Second, it includes a comprehensive overview of approaches for detecting bots in the Twitter platform, particularly machine learning techniques. The percentage was calculated using the proposed taxonomy, with metadata, tweet text, and merging (meta and tweet text) accounting for 37%, 31%, and 32%, respectively. Third, some gaps are also highlighted for further research. The first is that public datasets are not precise or suitable in size. Second, the use of integrated systems and real-time detection is uncommon. Third, detecting each bots category identified separately is needed, rather than detecting all categories of bots using one generic model and the same features' values. Finally, extracting influential features that assist machine learning algorithms in detecting Twitter bots with high accuracy is critical, especially if the type of bot is pre-determined. https://journal.iberamia.org/index.php/intartif/article/view/758Feature selectionFeature extractionSocial MediaMachine LearningDataminingAI
spellingShingle Raad Al-azawi
Safaa O. AL-mamory
Feature extractions and selection of bot detection on Twitter A systematic literature review
Inteligencia Artificial
Feature selection
Feature extraction
Social Media
Machine Learning
Datamining
AI
title Feature extractions and selection of bot detection on Twitter A systematic literature review
title_full Feature extractions and selection of bot detection on Twitter A systematic literature review
title_fullStr Feature extractions and selection of bot detection on Twitter A systematic literature review
title_full_unstemmed Feature extractions and selection of bot detection on Twitter A systematic literature review
title_short Feature extractions and selection of bot detection on Twitter A systematic literature review
title_sort feature extractions and selection of bot detection on twitter a systematic literature review
topic Feature selection
Feature extraction
Social Media
Machine Learning
Datamining
AI
url https://journal.iberamia.org/index.php/intartif/article/view/758
work_keys_str_mv AT raadalazawi featureextractionsandselectionofbotdetectionontwitterasystematicliteraturereview
AT safaaoalmamory featureextractionsandselectionofbotdetectionontwitterasystematicliteraturereview