AudioSet-R

Official implementation: "AudioSet-R: A Refined AudioSet with Multi-Stage LLM Label Reannotation"

If you think this repo is useful to you, please give it to us ⭐

📢 News:

August 1, 2025: The AudioSet-R paper was accepted by the ACM MM2025 Dataset Track.

May 31, 2025: We submitted our paper on the AudioSet-R Dataset to the ACM MM2025 Dataset track.

💁 Overview:

Illustrates the proposed three-stage relabeling framework for AudioSet.

🎼 Datasets:

The dataset used in this study is AudioSet dataset: Balanced training set (20550 .wav files), Evaluate set (18885 .wav files).

AudioSet official website: AudioSet, Github

We provide the json file of the audioset-R audio label and the json file of the original AudioSet audio label:

👍AudioSet-R: The json file of balanced training set and evaluate set

Original AudioSet: The json file of balanced training set and evaluate set

Example:

The detailed analysis for three-round audio content extraction.

🎉 Statistic:

We provide a comparison of the training and fine-tuning performance of AudioSet-R on various supervised and self-supervised audio classification models:

🏃 Run:

If you want to perform label prediction on other datasets, please follow the steps below:

cd ./pipeline

Please read ./pipeline/README.md

Cites:

If you found this repo is helpful, please consider citing our papers:

@article{sun2025audiosetr,
  title={AudioSet-R: A Refined AudioSet with Multi-Stage LLM Label Reannotation},
  author={Yulin Sun and Qisheng Xu and Yi Su and Qian Zhu and Yong Dou and Xinwang Liu and Kele Xu},
  journal={arXiv preprint arXiv:2508.15429},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
Illustration		Illustration
pipeline		pipeline
.gitignore		.gitignore
AudioSet-R_eval.json		AudioSet-R_eval.json
AudioSet-R_train.json		AudioSet-R_train.json
LICENSE		LICENSE
README.md		README.md
balanced_train.json		balanced_train.json
evaluate_set.json		evaluate_set.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AudioSet-R

📢 News:

💁 Overview:

🎼 Datasets:

Example:

🎉 Statistic:

🏃 Run:

Cites:

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AudioSet-R

📢 News:

💁 Overview:

🎼 Datasets:

Example:

🎉 Statistic:

🏃 Run:

Cites:

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages