Efficient Neural Ranking Using Forward Indexes and Lightweight Encoders

Jurek Leonhardt,Avishek Anand,Abhijit Anand,Henrik Müller,Megha Khosla,Koustav Rudra

doi:10.1145/3631939

Abstract

Dual-encoder-based dense retrieval models have become the standard in IR. They employ large Transformer-based language models, which are notoriously inefficient in terms of resources and latency. We propose Fast-Forward indexes—vector forward indexes which exploit the semantic matching capabilities of dual-encoder models for efficient and effective re-ranking. Our framework enables re-ranking at very high retrieval depths and combines the merits of both lexical and semantic matching via score interpolation. Furthermore, in order to mitigate the limitations of dual-encoders, we tackle two main challenges: Firstly, we improve computational efficiency by either pre-computing representations, avoiding unnecessary computations altogether, or reducing the complexity of encoders. This allows us to considerably improve ranking efficiency and latency. Secondly, we optimize the memory footprint and maintenance cost of indexes; we propose two complementary techniques to reduce the index size and show that, by dynamically dropping irrelevant document tokens, the index maintenance efficiency can be improved substantially. We perform an evaluation to show the effectiveness and efficiency of Fast-Forward indexes—our method has low latency and achieves competitive results without the need for hardware acceleration, such as GPUs.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: ACM Transactions on Information Systems	Publication Date: Apr 29, 2024
Citations: 1	License type: cc-by-sa

R Discovery Prime

R Discovery Prime

Efficient Neural Ranking Using Forward Indexes and Lightweight Encoders

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Information Systems

Lead the way for us

Similar Papers

Commonsense Knowledge in Foundation and Large Language Models
Harsh Bhardwaj ... Maniya Tadhiyal
International Journal of Advanced Research in Science, Communication and Technology | VOL. -
Harsh Bhardwaj, et. al. Harsh Bhardwaj ... Maniya Tadhiyal
08 Feb 2024
International Journal of Advanced Research in Science, Communication and Technology | VOL. -

Forwarding and Optical Indices in an All-Optical BCube Networks
Suzhen Wang ... Yuan-Hsun Lo
-
Suzhen Wang, et. al.Suzhen Wang ... Yuan-Hsun Lo
01 Nov 2018
01 Nov 2018

Validation of large language models for detecting pathologic complete response in breast cancer using population-based pathology reports
Ken Cheligeer ... Yuan Xu
BMC Medical Informatics and Decision Making | VOL. 24
Ken Cheligeer, et. al.Ken Cheligeer ... Yuan Xu
03 Oct 2024
BMC Medical Informatics and Decision Making | VOL. 24

Chatbots and Large Language Models in Radiology: A Practical Primer for Clinical and Research Applications.
Rajesh Bhayana
Radiology | VOL. 310
Rajesh BhayanaRajesh Bhayana
01 Jan 2024
Radiology | VOL. 310

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient Neural Ranking Using Forward Indexes and Lightweight Encoders

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Information Systems