A Study of Chinese News Headline Classification Based on Keyword Feature Expansion
Abstract Existing work generally classifies news headlines as a matter of short text classification. However, due to the strong domain nature and limited text length of news headlines, their classification results are usually determined by several specific keywords, which makes the traditional short...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Springer
2023-05-01
|
Series: | International Journal of Computational Intelligence Systems |
Subjects: | |
Online Access: | https://doi.org/10.1007/s44196-023-00251-4 |
_version_ | 1797831911814463488 |
---|---|
author | Kai Miao Xin He Junyang Yu Guanghui Wang Yongchao Chen |
author_facet | Kai Miao Xin He Junyang Yu Guanghui Wang Yongchao Chen |
author_sort | Kai Miao |
collection | DOAJ |
description | Abstract Existing work generally classifies news headlines as a matter of short text classification. However, due to the strong domain nature and limited text length of news headlines, their classification results are usually determined by several specific keywords, which makes the traditional short text classification method ineffective. In this paper, we propose a new method to identify keywords in news headlines and expand their features from sentence level and word level respectively, and finally use convolutional neural networks (CNN) to extract and classify their features. The proposed model was tested on the Sogou News Corpus dataset and achieved 93.42 $$\%$$ % accuracy. |
first_indexed | 2024-04-09T14:00:26Z |
format | Article |
id | doaj.art-cedb3c2f1c2847ba99069b6409853acf |
institution | Directory Open Access Journal |
issn | 1875-6883 |
language | English |
last_indexed | 2024-04-09T14:00:26Z |
publishDate | 2023-05-01 |
publisher | Springer |
record_format | Article |
series | International Journal of Computational Intelligence Systems |
spelling | doaj.art-cedb3c2f1c2847ba99069b6409853acf2023-05-07T11:23:22ZengSpringerInternational Journal of Computational Intelligence Systems1875-68832023-05-011611910.1007/s44196-023-00251-4A Study of Chinese News Headline Classification Based on Keyword Feature ExpansionKai Miao0Xin He1Junyang Yu2Guanghui Wang3Yongchao Chen4Intelligent Data Processing Engineering Research Center Of Henan Province, School of Software, Henan UniversityIntelligent Data Processing Engineering Research Center Of Henan Province, School of Software, Henan UniversityIntelligent Data Processing Engineering Research Center Of Henan Province, School of Software, Henan UniversityHenan International Joint Laboratory of Intelligent Network Theory and Key Technology, School of Software, Henan UniversityIntelligent Data Processing Engineering Research Center Of Henan Province, School of Software, Henan UniversityAbstract Existing work generally classifies news headlines as a matter of short text classification. However, due to the strong domain nature and limited text length of news headlines, their classification results are usually determined by several specific keywords, which makes the traditional short text classification method ineffective. In this paper, we propose a new method to identify keywords in news headlines and expand their features from sentence level and word level respectively, and finally use convolutional neural networks (CNN) to extract and classify their features. The proposed model was tested on the Sogou News Corpus dataset and achieved 93.42 $$\%$$ % accuracy.https://doi.org/10.1007/s44196-023-00251-4News headline classificationTopic modelFeature weightSemantic extensionCNN |
spellingShingle | Kai Miao Xin He Junyang Yu Guanghui Wang Yongchao Chen A Study of Chinese News Headline Classification Based on Keyword Feature Expansion International Journal of Computational Intelligence Systems News headline classification Topic model Feature weight Semantic extension CNN |
title | A Study of Chinese News Headline Classification Based on Keyword Feature Expansion |
title_full | A Study of Chinese News Headline Classification Based on Keyword Feature Expansion |
title_fullStr | A Study of Chinese News Headline Classification Based on Keyword Feature Expansion |
title_full_unstemmed | A Study of Chinese News Headline Classification Based on Keyword Feature Expansion |
title_short | A Study of Chinese News Headline Classification Based on Keyword Feature Expansion |
title_sort | study of chinese news headline classification based on keyword feature expansion |
topic | News headline classification Topic model Feature weight Semantic extension CNN |
url | https://doi.org/10.1007/s44196-023-00251-4 |
work_keys_str_mv | AT kaimiao astudyofchinesenewsheadlineclassificationbasedonkeywordfeatureexpansion AT xinhe astudyofchinesenewsheadlineclassificationbasedonkeywordfeatureexpansion AT junyangyu astudyofchinesenewsheadlineclassificationbasedonkeywordfeatureexpansion AT guanghuiwang astudyofchinesenewsheadlineclassificationbasedonkeywordfeatureexpansion AT yongchaochen astudyofchinesenewsheadlineclassificationbasedonkeywordfeatureexpansion AT kaimiao studyofchinesenewsheadlineclassificationbasedonkeywordfeatureexpansion AT xinhe studyofchinesenewsheadlineclassificationbasedonkeywordfeatureexpansion AT junyangyu studyofchinesenewsheadlineclassificationbasedonkeywordfeatureexpansion AT guanghuiwang studyofchinesenewsheadlineclassificationbasedonkeywordfeatureexpansion AT yongchaochen studyofchinesenewsheadlineclassificationbasedonkeywordfeatureexpansion |