Text Matching in Insurance Question-Answering Community Based on an Integrated BiLSTM-TextCNN Model Fusing Multi-Feature

Along with the explosion of ChatGPT, the artificial intelligence question-answering system has been pushed to a climax. Intelligent question-answering enables computers to simulate people’s behavior habits of understanding a corpus through machine learning, so as to answer questions in professional...

Full description

Bibliographic Details
Main Authors:	Zhaohui Li, Xueru Yang, Luli Zhou, Hongyu Jia, Wenli Li
Format:	Article
Language:	English
Published:	MDPI AG 2023-04-01
Series:	Entropy
Subjects:	insurance question answering community text matching ERNIE feature extraction deep learning
Online Access:	https://www.mdpi.com/1099-4300/25/4/639

_version_	1797605574749192192
author	Zhaohui Li Xueru Yang Luli Zhou Hongyu Jia Wenli Li
author_facet	Zhaohui Li Xueru Yang Luli Zhou Hongyu Jia Wenli Li
author_sort	Zhaohui Li
collection	DOAJ
description	Along with the explosion of ChatGPT, the artificial intelligence question-answering system has been pushed to a climax. Intelligent question-answering enables computers to simulate people’s behavior habits of understanding a corpus through machine learning, so as to answer questions in professional fields. How to obtain more accurate answers to personalized questions in professional fields is the core content of intelligent question-answering research. As one of the key technologies of intelligent question-answering, the accuracy of text matching is related to the development of the intelligent question-answering community. Aiming to solve the problem of polysemy of text, the Enhanced Representation through Knowledge Integration (ERNIE) model is used to obtain the word vector representation of text, which makes up for the lack of prior knowledge in the traditional word vector representation model. Additionally, there are also problems of homophones and polyphones in Chinese, so this paper introduces the phonetic character sequence of the text to distinguish them. In addition, aiming at the problem that there are many proper nouns in the insurance field that are difficult to identify, after conventional part-of-speech tagging, proper nouns are distinguished by especially defining their parts of speech. After the above three types of text-based semantic feature extensions, this paper also uses the Bi-directional Long Short-Term Memory (BiLSTM) and TextCNN models to extract the global features and local features of the text, respectively. It can obtain the feature representation of the text more comprehensively. Thus, the text matching model integrating BiLSTM and TextCNN fusing Multi-Feature (namely MFBT) is proposed for the insurance question-answering community. The MFBT model aims to solve the problems that affect the answer selection in the insurance question-answering community, such as proper nouns, nonstandard sentences and sparse features. Taking the question-and-answer data of the insurance library as the sample, the MFBT text-matching model is compared and evaluated with other models. The experimental results show that the MFBT text-matching model has higher evaluation index values, including accuracy, recall and F1, than other models. The model trained by historical search data can better help users in the insurance question-and-answer community obtain the answers they need and improve their satisfaction.
first_indexed	2024-03-11T05:03:01Z
format	Article
id	doaj.art-55e30bb5aee64c9f943799f1a08909b2
institution	Directory Open Access Journal
issn	1099-4300
language	English
last_indexed	2024-03-11T05:03:01Z
publishDate	2023-04-01
publisher	MDPI AG
record_format	Article
series	Entropy
spelling	doaj.art-55e30bb5aee64c9f943799f1a08909b22023-11-17T19:08:57ZengMDPI AGEntropy1099-43002023-04-0125463910.3390/e25040639Text Matching in Insurance Question-Answering Community Based on an Integrated BiLSTM-TextCNN Model Fusing Multi-FeatureZhaohui Li0Xueru Yang1Luli Zhou2Hongyu Jia3Wenli Li4School of Maritime Economics and Management, Dalian Maritime University, Dalian 116026, ChinaSchool of Maritime Economics and Management, Dalian Maritime University, Dalian 116026, ChinaSchool of Maritime Economics and Management, Dalian Maritime University, Dalian 116026, ChinaSchool of Maritime Economics and Management, Dalian Maritime University, Dalian 116026, ChinaSchool of Economics and Management, Dalian University of Technology, Dalian 116024, ChinaAlong with the explosion of ChatGPT, the artificial intelligence question-answering system has been pushed to a climax. Intelligent question-answering enables computers to simulate people’s behavior habits of understanding a corpus through machine learning, so as to answer questions in professional fields. How to obtain more accurate answers to personalized questions in professional fields is the core content of intelligent question-answering research. As one of the key technologies of intelligent question-answering, the accuracy of text matching is related to the development of the intelligent question-answering community. Aiming to solve the problem of polysemy of text, the Enhanced Representation through Knowledge Integration (ERNIE) model is used to obtain the word vector representation of text, which makes up for the lack of prior knowledge in the traditional word vector representation model. Additionally, there are also problems of homophones and polyphones in Chinese, so this paper introduces the phonetic character sequence of the text to distinguish them. In addition, aiming at the problem that there are many proper nouns in the insurance field that are difficult to identify, after conventional part-of-speech tagging, proper nouns are distinguished by especially defining their parts of speech. After the above three types of text-based semantic feature extensions, this paper also uses the Bi-directional Long Short-Term Memory (BiLSTM) and TextCNN models to extract the global features and local features of the text, respectively. It can obtain the feature representation of the text more comprehensively. Thus, the text matching model integrating BiLSTM and TextCNN fusing Multi-Feature (namely MFBT) is proposed for the insurance question-answering community. The MFBT model aims to solve the problems that affect the answer selection in the insurance question-answering community, such as proper nouns, nonstandard sentences and sparse features. Taking the question-and-answer data of the insurance library as the sample, the MFBT text-matching model is compared and evaluated with other models. The experimental results show that the MFBT text-matching model has higher evaluation index values, including accuracy, recall and F1, than other models. The model trained by historical search data can better help users in the insurance question-and-answer community obtain the answers they need and improve their satisfaction.https://www.mdpi.com/1099-4300/25/4/639insurance question answering communitytext matchingERNIEfeature extractiondeep learning
spellingShingle	Zhaohui Li Xueru Yang Luli Zhou Hongyu Jia Wenli Li Text Matching in Insurance Question-Answering Community Based on an Integrated BiLSTM-TextCNN Model Fusing Multi-Feature Entropy insurance question answering community text matching ERNIE feature extraction deep learning
title	Text Matching in Insurance Question-Answering Community Based on an Integrated BiLSTM-TextCNN Model Fusing Multi-Feature
title_full	Text Matching in Insurance Question-Answering Community Based on an Integrated BiLSTM-TextCNN Model Fusing Multi-Feature
title_fullStr	Text Matching in Insurance Question-Answering Community Based on an Integrated BiLSTM-TextCNN Model Fusing Multi-Feature
title_full_unstemmed	Text Matching in Insurance Question-Answering Community Based on an Integrated BiLSTM-TextCNN Model Fusing Multi-Feature
title_short	Text Matching in Insurance Question-Answering Community Based on an Integrated BiLSTM-TextCNN Model Fusing Multi-Feature
title_sort	text matching in insurance question answering community based on an integrated bilstm textcnn model fusing multi feature
topic	insurance question answering community text matching ERNIE feature extraction deep learning
url	https://www.mdpi.com/1099-4300/25/4/639
work_keys_str_mv	AT zhaohuili textmatchingininsurancequestionansweringcommunitybasedonanintegratedbilstmtextcnnmodelfusingmultifeature AT xueruyang textmatchingininsurancequestionansweringcommunitybasedonanintegratedbilstmtextcnnmodelfusingmultifeature AT lulizhou textmatchingininsurancequestionansweringcommunitybasedonanintegratedbilstmtextcnnmodelfusingmultifeature AT hongyujia textmatchingininsurancequestionansweringcommunitybasedonanintegratedbilstmtextcnnmodelfusingmultifeature AT wenlili textmatchingininsurancequestionansweringcommunitybasedonanintegratedbilstmtextcnnmodelfusingmultifeature

Text Matching in Insurance Question-Answering Community Based on an Integrated BiLSTM-TextCNN Model Fusing Multi-Feature

Similar Items