Contrastive Refinement for Dense Retrieval Inference in the Open-Domain Question Answering Task

In recent years, dense retrieval has emerged as the primary method for open-domain question-answering (OpenQA). However, previous research often focused on the query side, neglecting the importance of the passage side. We believe that both the query and passage sides are equally important and should...

Full description

Bibliographic Details
Main Authors: Qiuhong Zhai, Wenhao Zhu, Xiaoyu Zhang, Chenyun Liu
Format: Article
Language:English
Published: MDPI AG 2023-03-01
Series:Future Internet
Subjects:
Online Access:https://www.mdpi.com/1999-5903/15/4/137
Description
Summary:In recent years, dense retrieval has emerged as the primary method for open-domain question-answering (OpenQA). However, previous research often focused on the query side, neglecting the importance of the passage side. We believe that both the query and passage sides are equally important and should be considered for improved OpenQA performance. In this paper, we propose a contrastive pseudo-labeled data constructed around passages and queries separately. We employ an improved pseudo-relevance feedback (PRF) algorithm with a knowledge-filtering strategy to enrich the semantic information in dense representations. Additionally, we proposed an Auto Text Representation Optimization Model (AOpt) to iteratively update the dense representations. Experimental results demonstrate that our methods effectively optimize dense representations, making them more distinguishable in dense retrieval, thus improving the OpenQA system’s overall performance.
ISSN:1999-5903