Effective Inference-Free Retrieval for Learned Sparse Representations

Name: Effective Inference-Free Retrieval for Learned Sparse Representations
Keywords: FOS: Computer and information sciences, Information Retrieval (cs.IR), Learned Sparse Retrieval, Inference-Free, Efficiency, Computer Science - Information Retrieval

Franco Maria Nardini; Thong Nguyen 0004; Cosimo Rulli; Rossano Venturini; Andrew Yates

Found an issue? Give us feedback

downloadFull-Text

IRIS Cnrarrow_drop_down

IRIS Cnr

Conference object . 2025

License: CC BY

Full-Text: https://iris.cnr.it/bitstream/20.500.14243/549723/1/3726302.3730185.pdf

Data sources: IRIS Cnr

arXiv.org e-Print Archive

Preprint . 2025

Data sources: arXiv.org e-Print Archive

https://doi.org/10.1145/372630...

Article . 2025 . Peer-reviewed

Data sources: Crossref

https://dx.doi.org/10.48550/ar...

Article . 2025

License: CC BY

Data sources: Datacite

DBLP

Conference object

Data sources: DBLP

DBLP

Article

Data sources: DBLP

Effective Inference-Free Retrieval for Learned Sparse Representations

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object 13 Jul 2025Embargo end date: 01 Jan 2025 Italy Publisher:ACMJournal:Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval

Authors: Franco Maria Nardini; Thong Nguyen 0004; Cosimo Rulli; Rossano Venturini; Andrew Yates;

doi: 10.1145/3726302.3730185 , 10.48550/arxiv.2505.01452

arXiv: 2505.01452

handle: 20.500.14243/549723

Effective Inference-Free Retrieval for Learned Sparse Representations

- Summary
- Subjects
- Metrics

Abstract

Learned Sparse Retrieval (LSR) is an effective IR approach that exploits pre-trained language models for encoding text into a learned bag of words. Several efforts in the literature have shown that sparsity is key to enabling a good trade-off between the efficiency and effectiveness of the query processor. To induce the right degree of sparsity, researchers typically use regularization techniques when training LSR models. Recently, new efficient -- inverted index-based -- retrieval engines have been proposed, leading to a natural question: has the role of regularization changed in training LSR models? In this paper, we conduct an extended evaluation of regularization approaches for LSR where we discuss their effectiveness, efficiency, and out-of-domain generalization capabilities. We first show that regularization can be relaxed to produce more effective LSR encoders. We also show that query encoding is now the bottleneck limiting the overall query processor performance. To remove this bottleneck, we advance the state-of-the-art of inference-free LSR by proposing Learned Inference-free Retrieval (Li-LSR). At training time, Li-LSR learns a score for each token, casting the query encoding step into a seamless table lookup. Our approach yields state-of-the-art effectiveness for both in-domain and out-of-domain evaluation, surpassing Splade-v3-Doc by 1 point of mRR@10 on MS MARCO and 1.8 points of nDCG@10 on BEIR.

Country

Italy

Related Organizations

Keywords

FOS: Computer and information sciences, Information Retrieval (cs.IR), Learned Sparse Retrieval, Inference-Free, Efficiency, Computer Science - Information Retrieval

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Green

Related to Research communities

Netherlands Research Portal