Skip to content

colaudiolab/AudioSet-R

Repository files navigation

AudioSet-R

Official implementation: "AudioSet-R: A Refined AudioSet with Multi-Stage LLM Label Reannotation"

If you think this repo is useful to you, please give it to us ⭐

📢 News:

August 1, 2025: The AudioSet-R paper was accepted by the ACM MM2025 Dataset Track.

May 31, 2025: We submitted our paper on the AudioSet-R Dataset to the ACM MM2025 Dataset track.

💁 Overview:

image

Illustrates the proposed three-stage relabeling framework for AudioSet.

🎼 Datasets:

The dataset used in this study is AudioSet dataset: Balanced training set (20550 .wav files), Evaluate set (18885 .wav files).

AudioSet official website: AudioSet, Github

We provide the json file of the audioset-R audio label and the json file of the original AudioSet audio label:

👍AudioSet-R: The json file of balanced training set and evaluate set

Original AudioSet: The json file of balanced training set and evaluate set

Example:

image

The detailed analysis for three-round audio content extraction.

🎉 Statistic:

We provide a comparison of the training and fine-tuning performance of AudioSet-R on various supervised and self-supervised audio classification models: image

🏃 Run:

If you want to perform label prediction on other datasets, please follow the steps below:

cd ./pipeline

Please read ./pipeline/README.md

Cites:

If you found this repo is helpful, please consider citing our papers:

@article{sun2025audiosetr,
  title={AudioSet-R: A Refined AudioSet with Multi-Stage LLM Label Reannotation},
  author={Yulin Sun and Qisheng Xu and Yi Su and Qian Zhu and Yong Dou and Xinwang Liu and Kele Xu},
  journal={arXiv preprint arXiv:2508.15429},
  year={2025}
}

About

Official implementation: "AudioSet-R: A Refined AudioSet with Multi-Stage LLM Label Reannotation"

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors