Skip to main content

Showing 1–7 of 7 results for author: Rau, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.09252  [pdf, other

    cs.CL cs.IR

    Context Embeddings for Efficient Answer Generation in RAG

    Authors: David Rau, Shuai Wang, Hervé Déjean, Stéphane Clinchant

    Abstract: Retrieval-Augmented Generation (RAG) allows overcoming the limited knowledge of LLMs by extending the input with external information. As a consequence, the contextual inputs to the model become much longer which slows down decoding time directly translating to the time a user has to wait for an answer. We address this challenge by presenting COCOM, an effective context compression method, reducin… ▽ More

    Submitted 23 July, 2024; v1 submitted 12 July, 2024; originally announced July 2024.

    Comments: 10 pages

  2. arXiv:2407.01463  [pdf, other

    cs.CL cs.AI

    Retrieval-augmented generation in multilingual settings

    Authors: Nadezhda Chirkova, David Rau, Hervé Déjean, Thibault Formal, Stéphane Clinchant, Vassilina Nikoulina

    Abstract: Retrieval-augmented generation (RAG) has recently emerged as a promising solution for incorporating up-to-date or domain-specific knowledge into large language models (LLMs) and improving LLM factuality, but is predominantly studied in English-only settings. In this work, we consider RAG in the multilingual setting (mRAG), i.e. with user queries and the datastore in 13 languages, and investigate w… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  3. arXiv:2407.01102  [pdf, other

    cs.CL cs.IR

    BERGEN: A Benchmarking Library for Retrieval-Augmented Generation

    Authors: David Rau, Hervé Déjean, Nadezhda Chirkova, Thibault Formal, Shuai Wang, Vassilina Nikoulina, Stéphane Clinchant

    Abstract: Retrieval-Augmented Generation allows to enhance Large Language Models with external knowledge. In response to the recent popularity of generative LLMs, many RAG approaches have been proposed, which involve an intricate number of different configurations such as evaluation datasets, collections, metrics, retrievers, and LLMs. Inconsistent benchmarking poses a major challenge in comparing approache… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 29 pages

  4. The Role of Complex NLP in Transformers for Text Ranking?

    Authors: David Rau, Jaap Kamps

    Abstract: Even though term-based methods such as BM25 provide strong baselines in ranking, under certain conditions they are dominated by large pre-trained masked language models (MLMs) such as BERT. To date, the source of their effectiveness remains unclear. Is it their ability to truly understand the meaning through modeling syntactic aspects? We answer this by manipulating the input order and position in… ▽ More

    Submitted 6 July, 2022; originally announced July 2022.

    Comments: Proceedings of the 2022 ACM SIGIR International Conference on the Theory of Information Retrieval (ICTIR '22)

  5. arXiv:2204.07233  [pdf, other

    cs.IR cs.AI cs.CL

    How Different are Pre-trained Transformers for Text Ranking?

    Authors: David Rau, Jaap Kamps

    Abstract: In recent years, large pre-trained transformers have led to substantial gains in performance over traditional retrieval models and feedback approaches. However, these results are primarily based on the MS Marco/TREC Deep Learning Track setup, with its very particular setup, and our understanding of why and how these models work better is fragmented at best. We analyze effective BERT-based cross-en… ▽ More

    Submitted 5 April, 2022; originally announced April 2022.

    Comments: ECIR 2022

  6. arXiv:1906.01634  [pdf, other

    cs.CL cs.AI cs.LG

    On the Realization of Compositionality in Neural Networks

    Authors: Joris Baan, Jana Leible, Mitja Nikolaus, David Rau, Dennis Ulmer, Tim Baumgärtner, Dieuwke Hupkes, Elia Bruni

    Abstract: We present a detailed comparison of two types of sequence to sequence models trained to conduct a compositional task. The models are architecturally identical at inference time, but differ in the way that they are trained: our baseline model is trained with a task-success signal only, while the other model receives additional supervision on its attention mechanism (Attentive Guidance), which has s… ▽ More

    Submitted 6 June, 2019; v1 submitted 4 June, 2019; originally announced June 2019.

    Comments: To appear at BlackboxNLP 2019, ACL

  7. arXiv:1905.01975  [pdf, other

    cs.CL

    Point-less: More Abstractive Summarization with Pointer-Generator Networks

    Authors: Freek Boutkan, Jorn Ranzijn, David Rau, Eelco van der Wel

    Abstract: The Pointer-Generator architecture has shown to be a big improvement for abstractive summarization seq2seq models. However, the summaries produced by this model are largely extractive as over 30% of the generated sentences are copied from the source text. This work proposes a multihead attention mechanism, pointer dropout, and two new loss functions to promote more abstractive summaries while main… ▽ More

    Submitted 18 April, 2019; originally announced May 2019.

    Comments: 7 pages