← smac.pub home

Breaking the Lens of the Telescope: Online Relevance Estimation over Large Retrieval Sets

pdf bibtex long conference paper to appear

Authors: Mandeep Rathee, Venktesh V, Sean MacAvaney, Avishek Anand

Appearing in: Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2025)

Links/IDs:
arXiv 2504.09353 Google Scholar 7wWfoDgAAAAJ:SP6oXDckpogC Enlighten 352749 smac.pub sigir2025-ore

Abstract:

Advanced relevance models, such as those that use large language models (LLMs), provide highly accurate relevance estimations. However, their computational costs make them infeasible for processing large document corpora. To address this, retrieval systems often employ a telescoping approach, where computationally efficient but less precise lexical and semantic retrievers filter potential candidates for further ranking. However, this approach heavily depends on the quality of early-stage retrieval, which can potentially exclude relevant documents early in the process. In this work, we propose a novel paradigm for re-ranking called \textit{online relevance estimation} that continuously updates relevance estimates for a query throughout the ranking process. Instead of re-ranking a fixed set of top-k documents in a single step, online relevance estimation iteratively re-scores smaller subsets of the most promising documents while adjusting relevance scores for the remaining pool based on the estimations from the final model using an online bandit-based algorithm. This dynamic process mitigates the recall limitations of telescoping systems by re-prioritizing documents initially deemed less relevant by earlier stages---including those completely excluded by earlier-stage retrievers. We validate our approach on TREC benchmarks under two scenarios: hybrid retrieval and adaptive retrieval. Experimental results demonstrate that our method is sample-efficient and significantly improves recall, highlighting the effectiveness of our online relevance estimation framework for modern search systems.

BibTeX @inproceedings{rathee:sigir2025-ore, author = {Rathee, Mandeep and V, Venktesh and MacAvaney, Sean and Anand, Avishek}, title = {Breaking the Lens of the Telescope: Online Relevance Estimation over Large Retrieval Sets}, booktitle = {Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval}, year = {2025}, url = {https://arxiv.org/abs/2504.09353} }