A Semi-Supervised Paraphrase Identification Model Based on Multi-Granularity Interaction Reasoning

Conventional paraphrase identification (PI) models based on deep learning usually focus on text representation and ignore the mining and matching of multi-granular interaction features. In addition, supervised learning relies on a large labeled data. However, labeled training set for PI is small in...

Full description

Bibliographic Details
Main Authors: Xu Li, Fanxu Zeng, Chunlong Yao
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9050515/
_version_ 1819276400290430976
author Xu Li
Fanxu Zeng
Chunlong Yao
author_facet Xu Li
Fanxu Zeng
Chunlong Yao
author_sort Xu Li
collection DOAJ
description Conventional paraphrase identification (PI) models based on deep learning usually focus on text representation and ignore the mining and matching of multi-granular interaction features. In addition, supervised learning relies on a large labeled data. However, labeled training set for PI is small in comparison with the high complexity of the task. To solve the problems, we propose a semi-supervised deep learning framework for PI. We use a neural encoder with word-by-word attention mechanism to reason equivalence or contradiction over pairs of words, phrases and sentences. We employ a two-stage training procedure. First, we use a language modeling objective to learn the initial parameters on the unlabeled corpora of more than one million pairs of sentences. This is followed by a supervised training, where we adapt these parameters to a specific classification task with labeled data. Experimental results on MRPC (Microsoft Research Paraphrase Corpus) and SICK (Sentences Involving Compositional Knowledge) datasets demonstrate the effectiveness of our approach. Compared with the previous neural network models, we achieve absolute improvements in accuracy of 7.6% and F1 of 5.4% on MRPC, Pearson's r of 4.5% and Spearman's ρ of 5.1% on SICK.
first_indexed 2024-12-23T23:39:37Z
format Article
id doaj.art-62cf5a9e927a43e3800160a865b2e89d
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-12-23T23:39:37Z
publishDate 2020-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-62cf5a9e927a43e3800160a865b2e89d2022-12-21T17:25:44ZengIEEEIEEE Access2169-35362020-01-018607906080010.1109/ACCESS.2020.29840099050515A Semi-Supervised Paraphrase Identification Model Based on Multi-Granularity Interaction ReasoningXu Li0https://orcid.org/0000-0002-4450-5305Fanxu Zeng1Chunlong Yao2School of Information Science and Engineering, Dalian Polytechnic University, Dalian, ChinaSchool of Information Science and Engineering, Dalian Polytechnic University, Dalian, ChinaSchool of Information Science and Engineering, Dalian Polytechnic University, Dalian, ChinaConventional paraphrase identification (PI) models based on deep learning usually focus on text representation and ignore the mining and matching of multi-granular interaction features. In addition, supervised learning relies on a large labeled data. However, labeled training set for PI is small in comparison with the high complexity of the task. To solve the problems, we propose a semi-supervised deep learning framework for PI. We use a neural encoder with word-by-word attention mechanism to reason equivalence or contradiction over pairs of words, phrases and sentences. We employ a two-stage training procedure. First, we use a language modeling objective to learn the initial parameters on the unlabeled corpora of more than one million pairs of sentences. This is followed by a supervised training, where we adapt these parameters to a specific classification task with labeled data. Experimental results on MRPC (Microsoft Research Paraphrase Corpus) and SICK (Sentences Involving Compositional Knowledge) datasets demonstrate the effectiveness of our approach. Compared with the previous neural network models, we achieve absolute improvements in accuracy of 7.6% and F1 of 5.4% on MRPC, Pearson's r of 4.5% and Spearman's ρ of 5.1% on SICK.https://ieeexplore.ieee.org/document/9050515/Natural language processingparaphrase identificationdeep learningrecurrent neural network (RNN)attention mechanismunsupervised pre-training
spellingShingle Xu Li
Fanxu Zeng
Chunlong Yao
A Semi-Supervised Paraphrase Identification Model Based on Multi-Granularity Interaction Reasoning
IEEE Access
Natural language processing
paraphrase identification
deep learning
recurrent neural network (RNN)
attention mechanism
unsupervised pre-training
title A Semi-Supervised Paraphrase Identification Model Based on Multi-Granularity Interaction Reasoning
title_full A Semi-Supervised Paraphrase Identification Model Based on Multi-Granularity Interaction Reasoning
title_fullStr A Semi-Supervised Paraphrase Identification Model Based on Multi-Granularity Interaction Reasoning
title_full_unstemmed A Semi-Supervised Paraphrase Identification Model Based on Multi-Granularity Interaction Reasoning
title_short A Semi-Supervised Paraphrase Identification Model Based on Multi-Granularity Interaction Reasoning
title_sort semi supervised paraphrase identification model based on multi granularity interaction reasoning
topic Natural language processing
paraphrase identification
deep learning
recurrent neural network (RNN)
attention mechanism
unsupervised pre-training
url https://ieeexplore.ieee.org/document/9050515/
work_keys_str_mv AT xuli asemisupervisedparaphraseidentificationmodelbasedonmultigranularityinteractionreasoning
AT fanxuzeng asemisupervisedparaphraseidentificationmodelbasedonmultigranularityinteractionreasoning
AT chunlongyao asemisupervisedparaphraseidentificationmodelbasedonmultigranularityinteractionreasoning
AT xuli semisupervisedparaphraseidentificationmodelbasedonmultigranularityinteractionreasoning
AT fanxuzeng semisupervisedparaphraseidentificationmodelbasedonmultigranularityinteractionreasoning
AT chunlongyao semisupervisedparaphraseidentificationmodelbasedonmultigranularityinteractionreasoning