arxiv:2502.17125

LettuceDetect: A Hallucination Detection Framework for RAG Applications

Published on Feb 24

· Submitted by

adaamko on Mar 3

Upvote

Authors:

Ádám Kovács ,

Abstract

Retrieval Augmented Generation (RAG) systems remain vulnerable to hallucinated answers despite incorporating external knowledge sources. We present LettuceDetect a framework that addresses two critical limitations in existing hallucination detection methods: (1) the context window constraints of traditional encoder-based methods, and (2) the computational inefficiency of LLM based approaches. Building on ModernBERT's extended context capabilities (up to 8k tokens) and trained on the RAGTruth benchmark dataset, our approach outperforms all previous encoder-based models and most prompt-based models, while being approximately 30 times smaller than the best models. LettuceDetect is a token-classification model that processes context-question-answer triples, allowing for the identification of unsupported claims at the token level. Evaluations on the RAGTruth corpus demonstrate an F1 score of 79.22% for example-level detection, which is a 14.8% improvement over Luna, the previous state-of-the-art encoder-based architecture. Additionally, the system can process 30 to 60 examples per second on a single GPU, making it more practical for real-world RAG applications.

View arXiv page View PDF GitHub repository Add to collection

Community

adaamko

Paper author Paper submitter 3 days ago

We released 𝗟𝗲𝘁𝘁𝘂𝗰𝗲𝗗𝗲𝘁𝗲𝗰𝘁, a lightweight hallucination detection framework for Retrieval-Augmented Generation (RAG) pipelines.

LettuceDetect addresses two critical challenges:

The 𝗰𝗼𝗻𝘁𝗲𝘅𝘁-𝘄𝗶𝗻𝗱𝗼𝘄 𝗹𝗶𝗺𝗶𝘁𝘀 in prior encoder-only models.
The 𝗵𝗶𝗴𝗵 𝗰𝗼𝗺𝗽𝘂𝘁𝗲 𝗰𝗼𝘀𝘁𝘀 associated with LLM-based detectors.

Built on 𝗠𝗼𝗱𝗲𝗿𝗻𝗕𝗘𝗥𝗧, our encoder-based model is released under the 𝗠𝗜𝗧 license and comes with ready-to-use Python packages and pretrained models.

librarian-bot

3 days ago

This is an automated message from the Librarian Bot. I found the following papers similar to this paper.

The following papers were recommended by the Semantic Scholar API

Please give a thumbs up to this comment if you found it helpful!

If you want recommendations for any Paper on Hugging Face checkout this Space

You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Upvote

Models citing this paper 2

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2502.17125 in a dataset README.md to link it from this page.

LettuceDetect: A Hallucination Detection Framework for RAG Applications

Abstract

Community

Models citing this paper 2

Datasets citing this paper 0

Spaces citing this paper 2

Collections including this paper 2