MSRS: Evaluating Multi-Source Retrieval-Augmented Generation

📄 Paper | 🤗 Data

Overview

This paper introduces a scalable framework for constructing evaluation benchmarks that challenge RAG systems to integrate information across distinct sources and generate long-form responses. Using our framework, we build two new benchmarks on Multi-Source Retrieval and Synthesis: MSRS-Story and MSRS-Meet.

Dataset Statistics

Repository Structure

The datasets for MSRS-Story and MSRS-Meet are provided in the data directory.

The retrieval code and the settings created by each retrieval model, which serve as inputs for summarization, are located in the code/retrieval directory.

The summarization code is included in code/summarization.

The evaluation code, along with the generated summaries and their corresponding evaluation results (e.g., ROUGE-2, G-Eval), are located in the code/evaluation directory.

Quickstart

1. Setup

Install the required packages using Python version >=3.9.

pip install -r requirements.txt

2. Run

Examples for running the retrieval, summarization, and evaluation scripts are provided in usage.sh files alongside the scripts.

Experimental Results

Retrieval Peformance for MSRS-Story

Retrieval Peformance for MSRS-Meet

Summarization Performance for MSRS-Story

Summarization Performance for MSRS-Meet

Oracle Summarization Performance for Reasoning Models

Citation

If you find our work helpful, please consider citing it:

@inproceedings{
    phanse2025msrs,
    title={{MSRS}: Evaluating Multi-Source Retrieval-Augmented Generation},
    author={Rohan Phanse and Yijie Zhou and Kejian Shi and Wencai Zhang and Yixin Liu and Yilun Zhao and Arman Cohan},
    booktitle={Second Conference on Language Modeling},
    year={2025},
    url={https://openreview.net/forum?id=KtGsJm8bOC}
}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
code		code
data		data
images		images
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MSRS: Evaluating Multi-Source Retrieval-Augmented Generation

📄 Paper | 🤗 Data

Overview

Dataset Statistics

Repository Structure

Quickstart

1. Setup

2. Run

Experimental Results

Citation

About

Uh oh!

Releases

Packages

Languages

License

yale-nlp/MSRS

Folders and files

Latest commit

History

Repository files navigation

MSRS: Evaluating Multi-Source Retrieval-Augmented Generation

📄 Paper | 🤗 Data

Overview

Dataset Statistics

Repository Structure

Quickstart

1. Setup

2. Run

Experimental Results

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages