T2-RAGBench: Text-and-Table Benchmark for Evaluating Retrieval-Augmented Generation (2026)
- Natural Language Processing
- LLM Evaluation
Kutay Isgörür, Jan Strich, Martin Semmann, Maximilian Trescher, Chris Biemann
Since many real-world documents combine textual and tabular data, robust Retrieval Augmented Generation (RAG) systems are essential for effectively accessing and analyzing such content to support complex reasoning tasks. Therefore, this paper introduces T2-RAGBench, a benchmark comprising 23,088 question-context-answer triples, designed to evaluate RAG methods on real-world text-and-table data.