Aggretriever: A Simple Approach to Aggregate Textual Representations for Robust Dense Passage Retrieval

Sheng-Chieh Lin,Minghan Li,Jimmy Lin

doi:10.1162/tacl_a_00556

Sheng-Chieh Lin, Minghan Li + Show 1 more

Open Access

https://doi.org/10.1162/tacl_a_00556

Copy DOI

Abstract

Abstract Pre-trained language models have been successful in many knowledge-intensive NLP tasks. However, recent work has shown that models such as BERT are not “structurally ready” to aggregate textual information into a [CLS] vector for dense passage retrieval (DPR). This “lack of readiness” results from the gap between language model pre-training and DPR fine-tuning. Previous solutions call for computationally expensive techniques such as hard negative mining, cross-encoder distillation, and further pre-training to learn a robust DPR model. In this work, we instead propose to fully exploit knowledge in a pre-trained language model for DPR by aggregating the contextualized token embeddings into a dense vector, which we call agg★. By concatenating vectors from the [CLS] token and agg★, our Aggretriever model substantially improves the effectiveness of dense retrieval models on both in-domain and zero-shot evaluations without introducing substantial training overhead. Code is available at https://github.com/castorini/dhr.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Transactions of the Association for Computational Linguistics	Publication Date: May 18, 2023
Citations: 5	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Aggretriever: A Simple Approach to Aggregate Textual Representations for Robust Dense Passage Retrieval

Abstract

Talk to us

Similar Papers

More From: Transactions of the Association for Computational Linguistics

Lead the way for us

Similar Papers

Better Few-Shot Text Classification with Pre-trained Language Model
Zheng Chen ... Yunchen Zhang
-
Zheng Chen, et. al.Zheng Chen ... Yunchen Zhang
01 Jan 2020
01 Jan 2020

Neural Transfer Learning For Vietnamese Sentiment Analysis Using Pre-trained Contextual Language Models
An Pha Le ... Tran Vu Pham
-
An Pha Le, et. al.An Pha Le ... Tran Vu Pham
16 Dec 2021
16 Dec 2021

A Comparison of Pre-Trained Language Models for Multi-Class Text Classification in the Financial Domain
Yusuf Arslan ... Lisa Veiber
-
Yusuf Arslan, et. al.Yusuf Arslan ... Lisa Veiber
19 Apr 2021
19 Apr 2021

A Study on the Integration of Pre-Trained SSL, ASR, LM and SLU Models for Spoken Language Understanding
Yifan Peng ... Siddharth Dalmia
-
Yifan Peng, et. al.Yifan Peng ... Siddharth Dalmia
09 Jan 2023
09 Jan 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Aggretriever: A Simple Approach to Aggregate Textual Representations for Robust Dense Passage Retrieval

Abstract

Talk to us

Similar Papers

More From: Transactions of the Association for Computational Linguistics