A survey of the development of computer text analysis algorithms

Abstract: Computer text analysis is an important branch in the field of natural language processing, and it is a computer technology that studies how to extract various types of information from a given corpus from text data. At present, computer text analysis has entered a new historical stage. On...

Full description

Bibliographic Details
Main Authors: Sun Jinghan, Ren Jing
Format: Article
Language:zho
Published: National Computer System Engineering Research Institute of China 2023-03-01
Series:Dianzi Jishu Yingyong
Subjects:
Online Access:http://www.chinaaet.com/article/3000160065
_version_ 1797448186393001984
author Sun Jinghan
Ren Jing
author_facet Sun Jinghan
Ren Jing
author_sort Sun Jinghan
collection DOAJ
description Abstract: Computer text analysis is an important branch in the field of natural language processing, and it is a computer technology that studies how to extract various types of information from a given corpus from text data. At present, computer text analysis has entered a new historical stage. On the one hand, the keyword extraction algorithm has gradually been completed. On the other hand, with the emergence of the BERT method, the word vector calculation problem has also made great progress. However, there are still some problems to be solved in both keyword extraction and word vector calculation. In addition, many existing studies suitable for using text analysis still use ancient text analysis methods. Therefore, in the future, how to better reduce the model size to promote the integration of disciplines and improve the comprehensive social benefits of text analysis will become an important issue in the development of text analysis algorithms.
first_indexed 2024-03-09T14:06:49Z
format Article
id doaj.art-5b715f19401148598e4610402116ccfc
institution Directory Open Access Journal
issn 0258-7998
language zho
last_indexed 2024-03-09T14:06:49Z
publishDate 2023-03-01
publisher National Computer System Engineering Research Institute of China
record_format Article
series Dianzi Jishu Yingyong
spelling doaj.art-5b715f19401148598e4610402116ccfc2023-11-30T03:43:54ZzhoNational Computer System Engineering Research Institute of ChinaDianzi Jishu Yingyong0258-79982023-03-01493424710.16157/j.issn.0258-7998.2231173000160065A survey of the development of computer text analysis algorithmsSun Jinghan0Ren Jing1(1.Beijing University of Technology,Beijing100124, ChinaThe Sixth Research Institute of China Electronics Corporation, Beijing 100083, China)Abstract: Computer text analysis is an important branch in the field of natural language processing, and it is a computer technology that studies how to extract various types of information from a given corpus from text data. At present, computer text analysis has entered a new historical stage. On the one hand, the keyword extraction algorithm has gradually been completed. On the other hand, with the emergence of the BERT method, the word vector calculation problem has also made great progress. However, there are still some problems to be solved in both keyword extraction and word vector calculation. In addition, many existing studies suitable for using text analysis still use ancient text analysis methods. Therefore, in the future, how to better reduce the model size to promote the integration of disciplines and improve the comprehensive social benefits of text analysis will become an important issue in the development of text analysis algorithms.http://www.chinaaet.com/article/3000160065 text analysisnatural language processingalgorithm
spellingShingle Sun Jinghan
Ren Jing
A survey of the development of computer text analysis algorithms
Dianzi Jishu Yingyong
text analysis
natural language processing
algorithm
title A survey of the development of computer text analysis algorithms
title_full A survey of the development of computer text analysis algorithms
title_fullStr A survey of the development of computer text analysis algorithms
title_full_unstemmed A survey of the development of computer text analysis algorithms
title_short A survey of the development of computer text analysis algorithms
title_sort survey of the development of computer text analysis algorithms
topic text analysis
natural language processing
algorithm
url http://www.chinaaet.com/article/3000160065
work_keys_str_mv AT sunjinghan asurveyofthedevelopmentofcomputertextanalysisalgorithms
AT renjing asurveyofthedevelopmentofcomputertextanalysisalgorithms
AT sunjinghan surveyofthedevelopmentofcomputertextanalysisalgorithms
AT renjing surveyofthedevelopmentofcomputertextanalysisalgorithms