Saber: An Efficient Sampling with Adaptive Acceleration and Backtracking Enhanced Remasking for Diffusion Language Model

Our work introduces Saber, a training-free sampling algorithm for diffusion language models that enhances code generation by adaptively accelerating inference and incorporating backtracking, thereby improving output quality and speed while narrowing the performance gap with autoregressive models.

Installation environment

conda create -n saber python=3.11
pip install -r requirements.txt

or

uv venv saber --python 3.11
source saber/bin/activate
uv pip install -r requirements.txt

Setup

1. Download Model

Download the LLaDA-8B-Instruct model and place it in the models/ directory. You can download the model from Hugging Face.

2. Prepare Datasets

We provide the HumanEval, MBPP, HumanEval-ET, and MBPP-ET datasets in the data/ directory. For LiveCodeBench, you will need to download it separately and place it in data/livecodebench/.

Evaluation of Saber

Firstly, ensure the model is placed correctly in the models/ directory (the configs use the path ./models/LLaDA-8B-Instruct by default). Secondly, verify the datasets are in the correct path. Finally, execute the following command to evaluate:

python eval.py --config ./configs/humaneval.yaml

If you want to test other methods, change the method in the yaml file. For the humaneval and MBPP datasets, our code will print pass@1 And steps. For the livecodebench dataset, our code will save the generated results, and you need to run the evaluation program yourself

Citation

@article{dong2025saber,
  title={Saber: An Efficient Sampling with Adaptive Acceleration and Backtracking Enhanced Remasking for Diffusion Language Model},
  author={Yihong Dong and Zhaoyu Ma and Xue Jiang and Zhiyuan Fan and Jiaru Qian and Yongmin Li and Jianha Xiao and Zhi Jin and Rongyu Cao and Binhua Li and Fei Huang and Yongbin Li and Ge Li},
  journal={arXiv preprint arXiv:2510.18165},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
Dream		Dream
configs		configs
data		data
dataset_utils		dataset_utils
.gitignore		.gitignore
Readme.md		Readme.md
configuration_llada.py		configuration_llada.py
configuration_llada_fast.py		configuration_llada_fast.py
decoding.py		decoding.py
eval.py		eval.py
modeling_llada.py		modeling_llada.py
modeling_llada_fast.py		modeling_llada_fast.py
requirements.txt		requirements.txt
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Saber: An Efficient Sampling with Adaptive Acceleration and Backtracking Enhanced Remasking for Diffusion Language Model

Installation environment

Setup

1. Download Model

2. Prepare Datasets

Evaluation of Saber

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Saber: An Efficient Sampling with Adaptive Acceleration and Backtracking Enhanced Remasking for Diffusion Language Model

Installation environment

Setup

1. Download Model

2. Prepare Datasets

Evaluation of Saber

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages