Identifying Hot Topic Trends in Streaming Text Data Using News Sequential Evolution Model Based on Distributed Representations

Hot topic trends have become increasingly important in the era of social media, as these trends can spread rapidly through online platforms and significantly impact public discourse and behavior. As a result, the scope of distributed representations has expanded in machine learning and natural langu...

Full description

Bibliographic Details
Main Authors: Zohaib Ahmad Khan, Yuanqing Xia, Shahzad Ali, Javed Ali Khan, S. S. Askar, Mohamed Abouhawwash, Nora El-Rashidy
Format: Article
Language:English
Published: IEEE 2023-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10243000/
_version_ 1797685356440584192
author Zohaib Ahmad Khan
Yuanqing Xia
Shahzad Ali
Javed Ali Khan
S. S. Askar
Mohamed Abouhawwash
Nora El-Rashidy
author_facet Zohaib Ahmad Khan
Yuanqing Xia
Shahzad Ali
Javed Ali Khan
S. S. Askar
Mohamed Abouhawwash
Nora El-Rashidy
author_sort Zohaib Ahmad Khan
collection DOAJ
description Hot topic trends have become increasingly important in the era of social media, as these trends can spread rapidly through online platforms and significantly impact public discourse and behavior. As a result, the scope of distributed representations has expanded in machine learning and natural language processing. As these approaches can be used to effectively identify and analyze hot topic trends in large datasets. However, previous research has shown that analyzing sequential periods in data streams to detect hot topic trends can be challenging, particularly when dealing with large datasets. Moreover, existing methods often fail to accurately capture the semantic relationships between words over different time periods, limiting their effectiveness in trend prediction and relationship analysis. This paper aims to utilize a distributed representations approach to detect hot topic trends in streaming text data. For this purpose, we build a sequential evolution model for a streaming news website to identify hot topic trends in streaming text data. Additionally, we create a visual display model and knowledge graph to further enhance our proposed approach. To achieve this, we begin by collecting streaming news data from the web and dividing it chronologically into several datasets. In addition, word2vec models are built in different periods for each dataset. Finally, we compare the relationship of any target word in sequential word2vec models and analyze its evolutionary process. Experimental results show that the proposed method can detect hot topic trends and provide a graphical representation of any raw data that cannot be easily designed using traditional methods.
first_indexed 2024-03-12T00:43:58Z
format Article
id doaj.art-3a30d3516fb64c76aae6f7767ef2f70c
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-03-12T00:43:58Z
publishDate 2023-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-3a30d3516fb64c76aae6f7767ef2f70c2023-09-14T23:00:47ZengIEEEIEEE Access2169-35362023-01-0111987879880410.1109/ACCESS.2023.331276410243000Identifying Hot Topic Trends in Streaming Text Data Using News Sequential Evolution Model Based on Distributed RepresentationsZohaib Ahmad Khan0https://orcid.org/0000-0002-9979-2348Yuanqing Xia1https://orcid.org/0000-0002-5977-4911Shahzad Ali2Javed Ali Khan3https://orcid.org/0000-0003-3306-1195S. S. Askar4Mohamed Abouhawwash5https://orcid.org/0000-0003-2846-4707Nora El-Rashidy6School of Automation, Beijing Institute of Technology, Beijing, ChinaSchool of Automation, Beijing Institute of Technology, Beijing, ChinaSchool of Automation, Beijing Institute of Technology, Beijing, ChinaDepartment of Computer Science, Faculty of Physics, Engineering, and Computer Science, University of Hertfordshire, Hatfield, U.K.Department of Statistics and Operation Research, College of Science, King Saud University, Riyadh, Saudi ArabiaDepartment of Computational Mathematics, Science, and Engineering (CMSE), Michigan State University, East Lansing, MI, USAMachine Learning and Information Retrieval Department, Faculty of Artificial Intelligence, Kafrelsheikh University, Kafr El-Sheikh, EgyptHot topic trends have become increasingly important in the era of social media, as these trends can spread rapidly through online platforms and significantly impact public discourse and behavior. As a result, the scope of distributed representations has expanded in machine learning and natural language processing. As these approaches can be used to effectively identify and analyze hot topic trends in large datasets. However, previous research has shown that analyzing sequential periods in data streams to detect hot topic trends can be challenging, particularly when dealing with large datasets. Moreover, existing methods often fail to accurately capture the semantic relationships between words over different time periods, limiting their effectiveness in trend prediction and relationship analysis. This paper aims to utilize a distributed representations approach to detect hot topic trends in streaming text data. For this purpose, we build a sequential evolution model for a streaming news website to identify hot topic trends in streaming text data. Additionally, we create a visual display model and knowledge graph to further enhance our proposed approach. To achieve this, we begin by collecting streaming news data from the web and dividing it chronologically into several datasets. In addition, word2vec models are built in different periods for each dataset. Finally, we compare the relationship of any target word in sequential word2vec models and analyze its evolutionary process. Experimental results show that the proposed method can detect hot topic trends and provide a graphical representation of any raw data that cannot be easily designed using traditional methods.https://ieeexplore.ieee.org/document/10243000/Topic trendsnews sequential evolution modelstream text analysisvisual display modelknowledge graphdistributed representations
spellingShingle Zohaib Ahmad Khan
Yuanqing Xia
Shahzad Ali
Javed Ali Khan
S. S. Askar
Mohamed Abouhawwash
Nora El-Rashidy
Identifying Hot Topic Trends in Streaming Text Data Using News Sequential Evolution Model Based on Distributed Representations
IEEE Access
Topic trends
news sequential evolution model
stream text analysis
visual display model
knowledge graph
distributed representations
title Identifying Hot Topic Trends in Streaming Text Data Using News Sequential Evolution Model Based on Distributed Representations
title_full Identifying Hot Topic Trends in Streaming Text Data Using News Sequential Evolution Model Based on Distributed Representations
title_fullStr Identifying Hot Topic Trends in Streaming Text Data Using News Sequential Evolution Model Based on Distributed Representations
title_full_unstemmed Identifying Hot Topic Trends in Streaming Text Data Using News Sequential Evolution Model Based on Distributed Representations
title_short Identifying Hot Topic Trends in Streaming Text Data Using News Sequential Evolution Model Based on Distributed Representations
title_sort identifying hot topic trends in streaming text data using news sequential evolution model based on distributed representations
topic Topic trends
news sequential evolution model
stream text analysis
visual display model
knowledge graph
distributed representations
url https://ieeexplore.ieee.org/document/10243000/
work_keys_str_mv AT zohaibahmadkhan identifyinghottopictrendsinstreamingtextdatausingnewssequentialevolutionmodelbasedondistributedrepresentations
AT yuanqingxia identifyinghottopictrendsinstreamingtextdatausingnewssequentialevolutionmodelbasedondistributedrepresentations
AT shahzadali identifyinghottopictrendsinstreamingtextdatausingnewssequentialevolutionmodelbasedondistributedrepresentations
AT javedalikhan identifyinghottopictrendsinstreamingtextdatausingnewssequentialevolutionmodelbasedondistributedrepresentations
AT ssaskar identifyinghottopictrendsinstreamingtextdatausingnewssequentialevolutionmodelbasedondistributedrepresentations
AT mohamedabouhawwash identifyinghottopictrendsinstreamingtextdatausingnewssequentialevolutionmodelbasedondistributedrepresentations
AT noraelrashidy identifyinghottopictrendsinstreamingtextdatausingnewssequentialevolutionmodelbasedondistributedrepresentations