Skip to main content

Showing 1–16 of 16 results for author: Jagerman, R

Searching in archive cs. Search in all archives.
.
  1. Reliable Confidence Intervals for Information Retrieval Evaluation Using Generative A.I

    Authors: Harrie Oosterhuis, Rolf Jagerman, Zhen Qin, Xuanhui Wang, Michael Bendersky

    Abstract: The traditional evaluation of information retrieval (IR) systems is generally very costly as it requires manual relevance annotation from human experts. Recent advancements in generative artificial intelligence -- specifically large language models (LLMs) -- can generate relevance annotations at an enormous scale with relatively small computational costs. Potentially, this could alleviate the cost… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: KDD '24

  2. arXiv:2404.11791  [pdf, other

    cs.IR

    Consolidating Ranking and Relevance Predictions of Large Language Models through Post-Processing

    Authors: Le Yan, Zhen Qin, Honglei Zhuang, Rolf Jagerman, Xuanhui Wang, Michael Bendersky, Harrie Oosterhuis

    Abstract: The powerful generative abilities of large language models (LLMs) show potential in generating relevance labels for search applications. Previous work has found that directly asking about relevancy, such as ``How relevant is document A to query Q?", results in sub-optimal ranking. Instead, the pairwise ranking prompting (PRP) approach produces promising ranking performance through asking about pai… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  3. Can Query Expansion Improve Generalization of Strong Cross-Encoder Rankers?

    Authors: Minghan Li, Honglei Zhuang, Kai Hui, Zhen Qin, Jimmy Lin, Rolf Jagerman, Xuanhui Wang, Michael Bendersky

    Abstract: Query expansion has been widely used to improve the search results of first-stage retrievers, yet its influence on second-stage, cross-encoder rankers remains under-explored. A recent work of Weller et al. [44] shows that current expansion techniques benefit weaker models such as DPR and BM25 but harm stronger rankers such as MonoT5. In this paper, we re-examine this conclusion and raise the follo… ▽ More

    Submitted 30 April, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

  4. arXiv:2306.17563  [pdf, other

    cs.IR cs.CL cs.LG

    Large Language Models are Effective Text Rankers with Pairwise Ranking Prompting

    Authors: Zhen Qin, Rolf Jagerman, Kai Hui, Honglei Zhuang, Junru Wu, Le Yan, Jiaming Shen, Tianqi Liu, Jialu Liu, Donald Metzler, Xuanhui Wang, Michael Bendersky

    Abstract: Ranking documents using Large Language Models (LLMs) by directly feeding the query and candidate documents into the prompt is an interesting and practical problem. However, researchers have found it difficult to outperform fine-tuned baseline rankers on benchmark datasets. We analyze pointwise and listwise ranking prompts used by existing methods and argue that off-the-shelf LLMs do not fully unde… ▽ More

    Submitted 28 March, 2024; v1 submitted 30 June, 2023; originally announced June 2023.

    Comments: Accepted to NAACL 2024. Corrected results of RankT5 on TREC-DL19

  5. arXiv:2306.04455  [pdf, ps, other

    cs.IR

    RD-Suite: A Benchmark for Ranking Distillation

    Authors: Zhen Qin, Rolf Jagerman, Rama Pasumarthi, Honglei Zhuang, He Zhang, Aijun Bai, Kai Hui, Le Yan, Xuanhui Wang

    Abstract: The distillation of ranking models has become an important topic in both academia and industry. In recent years, several advanced methods have been proposed to tackle this problem, often leveraging ranking information from teacher rankers that is absent in traditional classification settings. To date, there is no well-established consensus on how to evaluate this class of models. Moreover, inconsi… ▽ More

    Submitted 12 June, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: 15 pages, 2 figures. arXiv admin note: text overlap with arXiv:2011.04006 by other authors

    ACM Class: H.3.3

  6. arXiv:2305.03653  [pdf, other

    cs.IR

    Query Expansion by Prompting Large Language Models

    Authors: Rolf Jagerman, Honglei Zhuang, Zhen Qin, Xuanhui Wang, Michael Bendersky

    Abstract: Query expansion is a widely used technique to improve the recall of search systems. In this paper, we propose an approach to query expansion that leverages the generative abilities of Large Language Models (LLMs). Unlike traditional query expansion approaches such as Pseudo-Relevance Feedback (PRF) that relies on retrieving a good set of pseudo-relevant documents to expand queries, we rely on the… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

    Comments: 7 pages, 2 figures

    ACM Class: H.3.3

  7. arXiv:2211.01494  [pdf, other

    cs.IR

    Regression Compatible Listwise Objectives for Calibrated Ranking with Binary Relevance

    Authors: Aijun Bai, Rolf Jagerman, Zhen Qin, Le Yan, Pratyush Kar, Bing-Rong Lin, Xuanhui Wang, Michael Bendersky, Marc Najork

    Abstract: As Learning-to-Rank (LTR) approaches primarily seek to improve ranking quality, their output scores are not scale-calibrated by design. This fundamentally limits LTR usage in score-sensitive applications. Though a simple multi-objective approach that combines a regression and a ranking objective can effectively learn scale-calibrated scores, we argue that the two objectives are not necessarily com… ▽ More

    Submitted 21 August, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

  8. arXiv:2210.10634  [pdf, other

    cs.IR cs.CL

    RankT5: Fine-Tuning T5 for Text Ranking with Ranking Losses

    Authors: Honglei Zhuang, Zhen Qin, Rolf Jagerman, Kai Hui, Ji Ma, Jing Lu, Jianmo Ni, Xuanhui Wang, Michael Bendersky

    Abstract: Recently, substantial progress has been made in text ranking based on pretrained language models such as BERT. However, there are limited studies on how to leverage more powerful sequence-to-sequence models such as T5. Existing attempts usually formulate text ranking as classification and rely on postprocessing to obtain a ranked list. In this paper, we propose RankT5 and study two T5-based rankin… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: 13 pages

  9. arXiv:2005.10615  [pdf, other

    cs.LG cs.IR stat.ML

    Accelerated Convergence for Counterfactual Learning to Rank

    Authors: Rolf Jagerman, Maarten de Rijke

    Abstract: Counterfactual Learning to Rank (LTR) algorithms learn a ranking model from logged user interactions, often collected using a production system. Employing such an offline learning approach has many benefits compared to an online one, but it is challenging as user feedback often contains high levels of bias. Unbiased LTR uses Inverse Propensity Scoring (IPS) to enable unbiased learning from logged… ▽ More

    Submitted 21 May, 2020; originally announced May 2020.

    Comments: SIGIR 2020 full conference paper

  10. arXiv:2002.00467  [pdf, other

    cs.IR cs.LG

    Safe Exploration for Optimizing Contextual Bandits

    Authors: Rolf Jagerman, Ilya Markov, Maarten de Rijke

    Abstract: Contextual bandit problems are a natural fit for many information retrieval tasks, such as learning to rank, text classification, recommendation, etc. However, existing learning methods for contextual bandit problems have one of two drawbacks: they either do not explore the space of all possible document rankings (i.e., actions) and, thus, may miss the optimal ranking, or they present suboptimal r… ▽ More

    Submitted 2 February, 2020; originally announced February 2020.

    Comments: 23 pages, 3 figures

  11. arXiv:1907.07260  [pdf, ps, other

    cs.IR

    Unbiased Learning to Rank: Counterfactual and Online Approaches

    Authors: Harrie Oosterhuis, Rolf Jagerman, Maarten de Rijke

    Abstract: This tutorial covers and contrasts the two main methodologies in unbiased Learning to Rank (LTR): Counterfactual LTR and Online LTR. There has long been an interest in LTR from user interactions, however, this form of implicit feedback is very biased. In recent years, unbiased LTR methods have been introduced to remove the effect of different types of bias caused by user-behavior in search. For in… ▽ More

    Submitted 16 July, 2019; originally announced July 2019.

    Comments: Abstract for tutorial appearing at SIGIR 2019

  12. To Model or to Intervene: A Comparison of Counterfactual and Online Learning to Rank from User Interactions

    Authors: Rolf Jagerman, Harrie Oosterhuis, Maarten de Rijke

    Abstract: Learning to Rank (LTR) from user interactions is challenging as user feedback often contains high levels of bias and noise. At the moment, two methodologies for dealing with bias prevail in the field of LTR: counterfactual methods that learn from historical data and model user behavior to deal with biases; and online methods that perform interventions to deal with bias but use no explicit user mod… ▽ More

    Submitted 15 July, 2019; originally announced July 2019.

    Comments: SIGIR 2019

  13. arXiv:1707.07493  [pdf, other

    cs.IR cs.NE stat.ML

    Modeling Label Ambiguity for Neural List-Wise Learning to Rank

    Authors: Rolf Jagerman, Julia Kiseleva, Maarten de Rijke

    Abstract: List-wise learning to rank methods are considered to be the state-of-the-art. One of the major problems with these methods is that the ambiguous nature of relevance labels in learning to rank data is ignored. Ambiguity of relevance labels refers to the phenomenon that multiple documents may be assigned the same relevance label for a given query, so that no preference order should be learned for th… ▽ More

    Submitted 24 July, 2017; originally announced July 2017.

  14. arXiv:1605.07422  [pdf, other

    cs.DC cs.IR cs.LG stat.ML

    Computing Web-scale Topic Models using an Asynchronous Parameter Server

    Authors: Rolf Jagerman, Carsten Eickhoff, Maarten de Rijke

    Abstract: Topic models such as Latent Dirichlet Allocation (LDA) have been widely used in information retrieval for tasks ranging from smoothing and feedback methods to tools for exploratory search and discovery. However, classical methods for inferring topic models do not scale up to the massive size of today's publicly available Web-scale data sets. The state-of-the-art approaches rely on custom strategie… ▽ More

    Submitted 18 June, 2017; v1 submitted 24 May, 2016; originally announced May 2016.

    Comments: To appear in SIGIR 2017

  15. arXiv:1511.03464  [pdf, other

    cs.CV

    A Directional Diffusion Algorithm for Inpainting

    Authors: Jan Deriu, Rolf Jagerman, Kai-En Tsay

    Abstract: The problem of inpainting involves reconstructing the missing areas of an image. Inpainting has many applications, such as reconstructing old damaged photographs or removing obfuscations from images. In this paper we present the directional diffusion algorithm for inpainting. Typical diffusion algorithms are bad at propagating edges from the image into the unknown masked regions. The directional d… ▽ More

    Submitted 11 November, 2015; originally announced November 2015.

  16. arXiv:1404.4818  [pdf, other

    cs.CY cs.CR cs.NI

    The fifteen year struggle of decentralizing privacy-enhancing technology

    Authors: Rolf Jagerman, Wendo Sabée, Laurens Versluis, Martijn de Vos, Johan Pouwelse

    Abstract: Ever since the introduction of the internet, it has been void of any privacy. The majority of internet traffic currently is and always has been unencrypted. A number of anonymous communication overlay networks exist whose aim it is to provide privacy to its users. However, due to the nature of the internet, there is major difficulty in getting these networks to become both decentralized and anonym… ▽ More

    Submitted 18 April, 2014; originally announced April 2014.