Chinese sentence similarity calculation based on modifiers

To compute the similarity of Chinese sentences accurately, a revised Chinese sentence similarity approach is proposed though enhancing the importance of the modifiers of stem of sentence. After extracting the modified part of the sentence by Language Technology Platform (LTP), this part of each stru...

Full description

Bibliographic Details
Main Authors: Wang, Fangling, Ye, Shaoqiang, Kang, Diwen, Mohd. Zain, Azlan, Zhou, Kaiqing
Format: Book Section
Published: Springer Science and Business Media Deutschland GmbH 2022
Subjects:
Description
Summary:To compute the similarity of Chinese sentences accurately, a revised Chinese sentence similarity approach is proposed though enhancing the importance of the modifiers of stem of sentence. After extracting the modified part of the sentence by Language Technology Platform (LTP), this part of each structure could be removed the longest common substring, to better capture the similarities of modified parts. The entire method includes three phases, which are to split the sentences into principal and predicate object structures using the syntactic analysis tool, to generate modifiers and sentence stem vectors and calculate the similarity between the vectors using the Word2Vec, and to obtain the similarity between two sentences by weighting each part. Experimental results on 200 sentences of the LCQMC dataset and corresponding analysis reveal that the proposed method can obtain more accurate similarity calculation results by effectively gaining the modified part - which affects the whole sentence meaning effectively-of the sentence structure.