A Study of Chinese News Headline Classification Based on Keyword Feature Expansion

Abstract Existing work generally classifies news headlines as a matter of short text classification. However, due to the strong domain nature and limited text length of news headlines, their classification results are usually determined by several specific keywords, which makes the traditional short...

Full description

Bibliographic Details
Main Authors: Kai Miao, Xin He, Junyang Yu, Guanghui Wang, Yongchao Chen
Format: Article
Language:English
Published: Springer 2023-05-01
Series:International Journal of Computational Intelligence Systems
Subjects:
Online Access:https://doi.org/10.1007/s44196-023-00251-4
_version_ 1797831911814463488
author Kai Miao
Xin He
Junyang Yu
Guanghui Wang
Yongchao Chen
author_facet Kai Miao
Xin He
Junyang Yu
Guanghui Wang
Yongchao Chen
author_sort Kai Miao
collection DOAJ
description Abstract Existing work generally classifies news headlines as a matter of short text classification. However, due to the strong domain nature and limited text length of news headlines, their classification results are usually determined by several specific keywords, which makes the traditional short text classification method ineffective. In this paper, we propose a new method to identify keywords in news headlines and expand their features from sentence level and word level respectively, and finally use convolutional neural networks (CNN) to extract and classify their features. The proposed model was tested on the Sogou News Corpus dataset and achieved 93.42 $$\%$$ % accuracy.
first_indexed 2024-04-09T14:00:26Z
format Article
id doaj.art-cedb3c2f1c2847ba99069b6409853acf
institution Directory Open Access Journal
issn 1875-6883
language English
last_indexed 2024-04-09T14:00:26Z
publishDate 2023-05-01
publisher Springer
record_format Article
series International Journal of Computational Intelligence Systems
spelling doaj.art-cedb3c2f1c2847ba99069b6409853acf2023-05-07T11:23:22ZengSpringerInternational Journal of Computational Intelligence Systems1875-68832023-05-011611910.1007/s44196-023-00251-4A Study of Chinese News Headline Classification Based on Keyword Feature ExpansionKai Miao0Xin He1Junyang Yu2Guanghui Wang3Yongchao Chen4Intelligent Data Processing Engineering Research Center Of Henan Province, School of Software, Henan UniversityIntelligent Data Processing Engineering Research Center Of Henan Province, School of Software, Henan UniversityIntelligent Data Processing Engineering Research Center Of Henan Province, School of Software, Henan UniversityHenan International Joint Laboratory of Intelligent Network Theory and Key Technology, School of Software, Henan UniversityIntelligent Data Processing Engineering Research Center Of Henan Province, School of Software, Henan UniversityAbstract Existing work generally classifies news headlines as a matter of short text classification. However, due to the strong domain nature and limited text length of news headlines, their classification results are usually determined by several specific keywords, which makes the traditional short text classification method ineffective. In this paper, we propose a new method to identify keywords in news headlines and expand their features from sentence level and word level respectively, and finally use convolutional neural networks (CNN) to extract and classify their features. The proposed model was tested on the Sogou News Corpus dataset and achieved 93.42 $$\%$$ % accuracy.https://doi.org/10.1007/s44196-023-00251-4News headline classificationTopic modelFeature weightSemantic extensionCNN
spellingShingle Kai Miao
Xin He
Junyang Yu
Guanghui Wang
Yongchao Chen
A Study of Chinese News Headline Classification Based on Keyword Feature Expansion
International Journal of Computational Intelligence Systems
News headline classification
Topic model
Feature weight
Semantic extension
CNN
title A Study of Chinese News Headline Classification Based on Keyword Feature Expansion
title_full A Study of Chinese News Headline Classification Based on Keyword Feature Expansion
title_fullStr A Study of Chinese News Headline Classification Based on Keyword Feature Expansion
title_full_unstemmed A Study of Chinese News Headline Classification Based on Keyword Feature Expansion
title_short A Study of Chinese News Headline Classification Based on Keyword Feature Expansion
title_sort study of chinese news headline classification based on keyword feature expansion
topic News headline classification
Topic model
Feature weight
Semantic extension
CNN
url https://doi.org/10.1007/s44196-023-00251-4
work_keys_str_mv AT kaimiao astudyofchinesenewsheadlineclassificationbasedonkeywordfeatureexpansion
AT xinhe astudyofchinesenewsheadlineclassificationbasedonkeywordfeatureexpansion
AT junyangyu astudyofchinesenewsheadlineclassificationbasedonkeywordfeatureexpansion
AT guanghuiwang astudyofchinesenewsheadlineclassificationbasedonkeywordfeatureexpansion
AT yongchaochen astudyofchinesenewsheadlineclassificationbasedonkeywordfeatureexpansion
AT kaimiao studyofchinesenewsheadlineclassificationbasedonkeywordfeatureexpansion
AT xinhe studyofchinesenewsheadlineclassificationbasedonkeywordfeatureexpansion
AT junyangyu studyofchinesenewsheadlineclassificationbasedonkeywordfeatureexpansion
AT guanghuiwang studyofchinesenewsheadlineclassificationbasedonkeywordfeatureexpansion
AT yongchaochen studyofchinesenewsheadlineclassificationbasedonkeywordfeatureexpansion