A Semi-Supervised Paraphrase Identification Model Based on Multi-Granularity Interaction Reasoning

Conventional paraphrase identification (PI) models based on deep learning usually focus on text representation and ignore the mining and matching of multi-granular interaction features. In addition, supervised learning relies on a large labeled data. However, labeled training set for PI is small in...

Full description

Bibliographic Details
Main Authors:	Xu Li, Fanxu Zeng, Chunlong Yao
Format:	Article
Language:	English
Published:	IEEE 2020-01-01
Series:	IEEE Access
Subjects:	Natural language processing paraphrase identification deep learning recurrent neural network (RNN) attention mechanism unsupervised pre-training
Online Access:	https://ieeexplore.ieee.org/document/9050515/

_version_	1819276400290430976
author	Xu Li Fanxu Zeng Chunlong Yao
author_facet	Xu Li Fanxu Zeng Chunlong Yao
author_sort	Xu Li
collection	DOAJ
description	Conventional paraphrase identification (PI) models based on deep learning usually focus on text representation and ignore the mining and matching of multi-granular interaction features. In addition, supervised learning relies on a large labeled data. However, labeled training set for PI is small in comparison with the high complexity of the task. To solve the problems, we propose a semi-supervised deep learning framework for PI. We use a neural encoder with word-by-word attention mechanism to reason equivalence or contradiction over pairs of words, phrases and sentences. We employ a two-stage training procedure. First, we use a language modeling objective to learn the initial parameters on the unlabeled corpora of more than one million pairs of sentences. This is followed by a supervised training, where we adapt these parameters to a specific classification task with labeled data. Experimental results on MRPC (Microsoft Research Paraphrase Corpus) and SICK (Sentences Involving Compositional Knowledge) datasets demonstrate the effectiveness of our approach. Compared with the previous neural network models, we achieve absolute improvements in accuracy of 7.6% and F1 of 5.4% on MRPC, Pearson's r of 4.5% and Spearman's ρ of 5.1% on SICK.
first_indexed	2024-12-23T23:39:37Z
format	Article
id	doaj.art-62cf5a9e927a43e3800160a865b2e89d
institution	Directory Open Access Journal
issn	2169-3536
language	English
last_indexed	2024-12-23T23:39:37Z
publishDate	2020-01-01
publisher	IEEE
record_format	Article
series	IEEE Access
spelling	doaj.art-62cf5a9e927a43e3800160a865b2e89d2022-12-21T17:25:44ZengIEEEIEEE Access2169-35362020-01-018607906080010.1109/ACCESS.2020.29840099050515A Semi-Supervised Paraphrase Identification Model Based on Multi-Granularity Interaction ReasoningXu Li0https://orcid.org/0000-0002-4450-5305Fanxu Zeng1Chunlong Yao2School of Information Science and Engineering, Dalian Polytechnic University, Dalian, ChinaSchool of Information Science and Engineering, Dalian Polytechnic University, Dalian, ChinaSchool of Information Science and Engineering, Dalian Polytechnic University, Dalian, ChinaConventional paraphrase identification (PI) models based on deep learning usually focus on text representation and ignore the mining and matching of multi-granular interaction features. In addition, supervised learning relies on a large labeled data. However, labeled training set for PI is small in comparison with the high complexity of the task. To solve the problems, we propose a semi-supervised deep learning framework for PI. We use a neural encoder with word-by-word attention mechanism to reason equivalence or contradiction over pairs of words, phrases and sentences. We employ a two-stage training procedure. First, we use a language modeling objective to learn the initial parameters on the unlabeled corpora of more than one million pairs of sentences. This is followed by a supervised training, where we adapt these parameters to a specific classification task with labeled data. Experimental results on MRPC (Microsoft Research Paraphrase Corpus) and SICK (Sentences Involving Compositional Knowledge) datasets demonstrate the effectiveness of our approach. Compared with the previous neural network models, we achieve absolute improvements in accuracy of 7.6% and F1 of 5.4% on MRPC, Pearson's r of 4.5% and Spearman's ρ of 5.1% on SICK.https://ieeexplore.ieee.org/document/9050515/Natural language processingparaphrase identificationdeep learningrecurrent neural network (RNN)attention mechanismunsupervised pre-training
spellingShingle	Xu Li Fanxu Zeng Chunlong Yao A Semi-Supervised Paraphrase Identification Model Based on Multi-Granularity Interaction Reasoning IEEE Access Natural language processing paraphrase identification deep learning recurrent neural network (RNN) attention mechanism unsupervised pre-training
title	A Semi-Supervised Paraphrase Identification Model Based on Multi-Granularity Interaction Reasoning
title_full	A Semi-Supervised Paraphrase Identification Model Based on Multi-Granularity Interaction Reasoning
title_fullStr	A Semi-Supervised Paraphrase Identification Model Based on Multi-Granularity Interaction Reasoning
title_full_unstemmed	A Semi-Supervised Paraphrase Identification Model Based on Multi-Granularity Interaction Reasoning
title_short	A Semi-Supervised Paraphrase Identification Model Based on Multi-Granularity Interaction Reasoning
title_sort	semi supervised paraphrase identification model based on multi granularity interaction reasoning
topic	Natural language processing paraphrase identification deep learning recurrent neural network (RNN) attention mechanism unsupervised pre-training
url	https://ieeexplore.ieee.org/document/9050515/
work_keys_str_mv	AT xuli asemisupervisedparaphraseidentificationmodelbasedonmultigranularityinteractionreasoning AT fanxuzeng asemisupervisedparaphraseidentificationmodelbasedonmultigranularityinteractionreasoning AT chunlongyao asemisupervisedparaphraseidentificationmodelbasedonmultigranularityinteractionreasoning AT xuli semisupervisedparaphraseidentificationmodelbasedonmultigranularityinteractionreasoning AT fanxuzeng semisupervisedparaphraseidentificationmodelbasedonmultigranularityinteractionreasoning AT chunlongyao semisupervisedparaphraseidentificationmodelbasedonmultigranularityinteractionreasoning

A Semi-Supervised Paraphrase Identification Model Based on Multi-Granularity Interaction Reasoning

Similar Items