Text Mining and Sentiment Analysis of Newspaper Headlines

Text analytics are well-known in the modern era for extracting information and patterns from text. However, no study has attempted to illustrate the pattern and priorities of newspaper headlines in Bangladesh using a combination of text analytics techniques. The purpose of this paper is to examine t...

Full description

Bibliographic Details
Main Authors: Arafat Hossain, Md. Karimuzzaman, Md. Moyazzem Hossain, Azizur Rahman
Format: Article
Language:English
Published: MDPI AG 2021-10-01
Series:Information
Subjects:
Online Access:https://www.mdpi.com/2078-2489/12/10/414
_version_ 1797514304908427264
author Arafat Hossain
Md. Karimuzzaman
Md. Moyazzem Hossain
Azizur Rahman
author_facet Arafat Hossain
Md. Karimuzzaman
Md. Moyazzem Hossain
Azizur Rahman
author_sort Arafat Hossain
collection DOAJ
description Text analytics are well-known in the modern era for extracting information and patterns from text. However, no study has attempted to illustrate the pattern and priorities of newspaper headlines in Bangladesh using a combination of text analytics techniques. The purpose of this paper is to examine the pattern of words that appeared on the front page of a well-known daily English newspaper in Bangladesh, <i>The Daily Star</i>, in 2018 and 2019. The elucidation of that era’s possible social and political context was also attempted using word patterns. The study employs three widely used and contemporary text mining techniques: word clouds, sentiment analysis, and cluster analysis. The word cloud reveals that election, kill, cricket, and Rohingya-related terms appeared more than 60 times in 2018, whereas BNP, poll, kill, AL, and Khaleda appeared more than 80 times in 2019. These indicated the country’s passion for cricket, political turmoil, and Rohingya-related issues. Furthermore, sentiment analysis reveals that words of fear and negative emotions appeared more than 600 times, whereas anger, anticipation, sadness, trust, and positive-type emotions came up more than 400 times in both years. Finally, the clustering method demonstrates that election, politics, deaths, digital security act, Rohingya, and cricket-related words exhibit similarity and belong to a similar group in 2019, whereas rape, deaths, road, and fire-related words clustered in 2018 alongside a similar-appearing group. In general, this analysis demonstrates how vividly the text mining approach depicts Bangladesh’s social, political, and law-and-order situation, particularly during election season and the country’s cricket craze, and also validates the significance of the text mining approach to understanding the overall view of a country during a particular time in an efficient manner.
first_indexed 2024-03-10T06:30:01Z
format Article
id doaj.art-adf957c7b8684267aca16a45f811cbdd
institution Directory Open Access Journal
issn 2078-2489
language English
last_indexed 2024-03-10T06:30:01Z
publishDate 2021-10-01
publisher MDPI AG
record_format Article
series Information
spelling doaj.art-adf957c7b8684267aca16a45f811cbdd2023-11-22T18:37:47ZengMDPI AGInformation2078-24892021-10-01121041410.3390/info12100414Text Mining and Sentiment Analysis of Newspaper HeadlinesArafat Hossain0Md. Karimuzzaman1Md. Moyazzem Hossain2Azizur Rahman3Department of Statistics, Jahangirnagar University, Savar, Dhaka 1342, BangladeshDepartment of Statistics, Jahangirnagar University, Savar, Dhaka 1342, BangladeshDepartment of Statistics, Jahangirnagar University, Savar, Dhaka 1342, BangladeshSchool of Computing, Mathematics and Engineering, Charles Sturt University, Wagga Wagga, NSW 2678, AustraliaText analytics are well-known in the modern era for extracting information and patterns from text. However, no study has attempted to illustrate the pattern and priorities of newspaper headlines in Bangladesh using a combination of text analytics techniques. The purpose of this paper is to examine the pattern of words that appeared on the front page of a well-known daily English newspaper in Bangladesh, <i>The Daily Star</i>, in 2018 and 2019. The elucidation of that era’s possible social and political context was also attempted using word patterns. The study employs three widely used and contemporary text mining techniques: word clouds, sentiment analysis, and cluster analysis. The word cloud reveals that election, kill, cricket, and Rohingya-related terms appeared more than 60 times in 2018, whereas BNP, poll, kill, AL, and Khaleda appeared more than 80 times in 2019. These indicated the country’s passion for cricket, political turmoil, and Rohingya-related issues. Furthermore, sentiment analysis reveals that words of fear and negative emotions appeared more than 600 times, whereas anger, anticipation, sadness, trust, and positive-type emotions came up more than 400 times in both years. Finally, the clustering method demonstrates that election, politics, deaths, digital security act, Rohingya, and cricket-related words exhibit similarity and belong to a similar group in 2019, whereas rape, deaths, road, and fire-related words clustered in 2018 alongside a similar-appearing group. In general, this analysis demonstrates how vividly the text mining approach depicts Bangladesh’s social, political, and law-and-order situation, particularly during election season and the country’s cricket craze, and also validates the significance of the text mining approach to understanding the overall view of a country during a particular time in an efficient manner.https://www.mdpi.com/2078-2489/12/10/414newspaperheadlines pattern and contextword cloudcluster analysissentiment analysisBangladesh
spellingShingle Arafat Hossain
Md. Karimuzzaman
Md. Moyazzem Hossain
Azizur Rahman
Text Mining and Sentiment Analysis of Newspaper Headlines
Information
newspaper
headlines pattern and context
word cloud
cluster analysis
sentiment analysis
Bangladesh
title Text Mining and Sentiment Analysis of Newspaper Headlines
title_full Text Mining and Sentiment Analysis of Newspaper Headlines
title_fullStr Text Mining and Sentiment Analysis of Newspaper Headlines
title_full_unstemmed Text Mining and Sentiment Analysis of Newspaper Headlines
title_short Text Mining and Sentiment Analysis of Newspaper Headlines
title_sort text mining and sentiment analysis of newspaper headlines
topic newspaper
headlines pattern and context
word cloud
cluster analysis
sentiment analysis
Bangladesh
url https://www.mdpi.com/2078-2489/12/10/414
work_keys_str_mv AT arafathossain textminingandsentimentanalysisofnewspaperheadlines
AT mdkarimuzzaman textminingandsentimentanalysisofnewspaperheadlines
AT mdmoyazzemhossain textminingandsentimentanalysisofnewspaperheadlines
AT azizurrahman textminingandsentimentanalysisofnewspaperheadlines