Skip to content
/ MSRS Public

Data and Code for COLM 2025 Paper "MSRS: Evaluating Multi-Source Retrieval-Augmented Generation"

License

Notifications You must be signed in to change notification settings

yale-nlp/MSRS

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

MSRS: Evaluating Multi-Source Retrieval-Augmented Generation

Overview

This paper introduces a scalable framework for constructing evaluation benchmarks that challenge RAG systems to integrate information across distinct sources and generate long-form responses. Using our framework, we build two new benchmarks on Multi-Source Retrieval and Synthesis: MSRS-Story and MSRS-Meet.

image-20230930180329172

Dataset Statistics

image-20230930180308855

Repository Structure

The datasets for MSRS-Story and MSRS-Meet are provided in the data directory.

The retrieval code and the settings created by each retrieval model, which serve as inputs for summarization, are located in the code/retrieval directory.

The summarization code is included in code/summarization.

The evaluation code, along with the generated summaries and their corresponding evaluation results (e.g., ROUGE-2, G-Eval), are located in the code/evaluation directory.

Quickstart

1. Setup

Install the required packages using Python version >=3.9.

pip install -r requirements.txt

2. Run

Examples for running the retrieval, summarization, and evaluation scripts are provided in usage.sh files alongside the scripts.

Experimental Results

Retrieval Peformance for MSRS-Story

image-20230930180329172

Retrieval Peformance for MSRS-Meet

image-20230930180329172

Summarization Performance for MSRS-Story

image-20230930180329172

Summarization Performance for MSRS-Meet

image-20230930180329172

Oracle Summarization Performance for Reasoning Models

image-20230930180329172

Citation

If you find our work helpful, please consider citing it:

@inproceedings{
    phanse2025msrs,
    title={{MSRS}: Evaluating Multi-Source Retrieval-Augmented Generation},
    author={Rohan Phanse and Yijie Zhou and Kejian Shi and Wencai Zhang and Yixin Liu and Yilun Zhao and Arman Cohan},
    booktitle={Second Conference on Language Modeling},
    year={2025},
    url={https://openreview.net/forum?id=KtGsJm8bOC}
}

About

Data and Code for COLM 2025 Paper "MSRS: Evaluating Multi-Source Retrieval-Augmented Generation"

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published