Feature extractions and selection of bot detection on Twitter A systematic literature review

Abstract Automated or semiautomated computer programs that imitate humans and/or human behavior in online social networks are known as social bots. Users can be attacked by social bots to achieve several hidden aims, such as spreading information or influencing targets. While researchers develop a...

Full description

Bibliographic Details
Main Authors:	Raad Al-azawi, Safaa O. AL-mamory
Format:	Article
Language:	English
Published:	Asociación Española para la Inteligencia Artificial 2022-04-01
Series:	Inteligencia Artificial
Subjects:	Feature selection Feature extraction Social Media Machine Learning Datamining AI
Online Access:	https://journal.iberamia.org/index.php/intartif/article/view/758

_version_	1811294339886743552
author	Raad Al-azawi Safaa O. AL-mamory
author_facet	Raad Al-azawi Safaa O. AL-mamory
author_sort	Raad Al-azawi
collection	DOAJ
description	Abstract Automated or semiautomated computer programs that imitate humans and/or human behavior in online social networks are known as social bots. Users can be attacked by social bots to achieve several hidden aims, such as spreading information or influencing targets. While researchers develop a variety of methods to detect social media bot accounts, attackers adapt their bots to avoid detection. This field necessitates ongoing growth, particularly in the areas of feature selection and extraction. The study's purpose is to provide an overview of bot attacks on Twitter, shedding light on issues in feature extraction and selection that have a significant impact on the accuracy of bot detection algorithms, and highlighting the weaknesses in training time and dimensionality reduction. To the best of our knowledge, this study is the first systematic literature review based on a preset search-strategy that encompasses literature published between 2018 and 2021 which are concerned with Twitter features (attributes). The key findings of this research are threefold. First, the paper provides an improved taxonomy of feature extraction and selection approaches. Second, it includes a comprehensive overview of approaches for detecting bots in the Twitter platform, particularly machine learning techniques. The percentage was calculated using the proposed taxonomy, with metadata, tweet text, and merging (meta and tweet text) accounting for 37%, 31%, and 32%, respectively. Third, some gaps are also highlighted for further research. The first is that public datasets are not precise or suitable in size. Second, the use of integrated systems and real-time detection is uncommon. Third, detecting each bots category identified separately is needed, rather than detecting all categories of bots using one generic model and the same features' values. Finally, extracting influential features that assist machine learning algorithms in detecting Twitter bots with high accuracy is critical, especially if the type of bot is pre-determined.
first_indexed	2024-04-13T05:14:59Z
format	Article
id	doaj.art-b41bb42c5ae6440da1447e62cee6488c
institution	Directory Open Access Journal
issn	1137-3601 1988-3064
language	English
last_indexed	2024-04-13T05:14:59Z
publishDate	2022-04-01
publisher	Asociación Española para la Inteligencia Artificial
record_format	Article
series	Inteligencia Artificial
spelling	doaj.art-b41bb42c5ae6440da1447e62cee6488c2022-12-22T03:00:55ZengAsociación Española para la Inteligencia ArtificialInteligencia Artificial1137-36011988-30642022-04-01256910.4114/intartif.vol25iss69pp57-86Feature extractions and selection of bot detection on Twitter A systematic literature reviewRaad Al-azawi0Safaa O. AL-mamory1University of Babylon, IraqCollege of Business Informatics, Bagdad, Iraq Abstract Automated or semiautomated computer programs that imitate humans and/or human behavior in online social networks are known as social bots. Users can be attacked by social bots to achieve several hidden aims, such as spreading information or influencing targets. While researchers develop a variety of methods to detect social media bot accounts, attackers adapt their bots to avoid detection. This field necessitates ongoing growth, particularly in the areas of feature selection and extraction. The study's purpose is to provide an overview of bot attacks on Twitter, shedding light on issues in feature extraction and selection that have a significant impact on the accuracy of bot detection algorithms, and highlighting the weaknesses in training time and dimensionality reduction. To the best of our knowledge, this study is the first systematic literature review based on a preset search-strategy that encompasses literature published between 2018 and 2021 which are concerned with Twitter features (attributes). The key findings of this research are threefold. First, the paper provides an improved taxonomy of feature extraction and selection approaches. Second, it includes a comprehensive overview of approaches for detecting bots in the Twitter platform, particularly machine learning techniques. The percentage was calculated using the proposed taxonomy, with metadata, tweet text, and merging (meta and tweet text) accounting for 37%, 31%, and 32%, respectively. Third, some gaps are also highlighted for further research. The first is that public datasets are not precise or suitable in size. Second, the use of integrated systems and real-time detection is uncommon. Third, detecting each bots category identified separately is needed, rather than detecting all categories of bots using one generic model and the same features' values. Finally, extracting influential features that assist machine learning algorithms in detecting Twitter bots with high accuracy is critical, especially if the type of bot is pre-determined. https://journal.iberamia.org/index.php/intartif/article/view/758Feature selectionFeature extractionSocial MediaMachine LearningDataminingAI
spellingShingle	Raad Al-azawi Safaa O. AL-mamory Feature extractions and selection of bot detection on Twitter A systematic literature review Inteligencia Artificial Feature selection Feature extraction Social Media Machine Learning Datamining AI
title	Feature extractions and selection of bot detection on Twitter A systematic literature review
title_full	Feature extractions and selection of bot detection on Twitter A systematic literature review
title_fullStr	Feature extractions and selection of bot detection on Twitter A systematic literature review
title_full_unstemmed	Feature extractions and selection of bot detection on Twitter A systematic literature review
title_short	Feature extractions and selection of bot detection on Twitter A systematic literature review
title_sort	feature extractions and selection of bot detection on twitter a systematic literature review
topic	Feature selection Feature extraction Social Media Machine Learning Datamining AI
url	https://journal.iberamia.org/index.php/intartif/article/view/758
work_keys_str_mv	AT raadalazawi featureextractionsandselectionofbotdetectionontwitterasystematicliteraturereview AT safaaoalmamory featureextractionsandselectionofbotdetectionontwitterasystematicliteraturereview

Feature extractions and selection of bot detection on Twitter A systematic literature review

Similar Items