D-OPSD
_{^{On-Policy Self-Distillation for Continuously Tuning Step-Distilled Diffusion Models}}

🛕 Environment Setup

git clone https://github.com/vvvvvjdy/D-OPSD.git
conda create -n dopsd python=3.12 -y
conda activate dopsd
pip install -r requirements.txt

🍬 Training

1. D-OPSD Z-Image-Turbo LoRA training with self-distilled vlm context (align with our paper):

Refer to (z-image-turbo_self-distill-vlm) for training guidance.

2. D-OPSD FLUX2-klein LoRA training with self-distilled eidting branch context for scenario of high id accuracy requirement:

Refer to (flux2-klein_self-distill-edit) for training guidance and more discussion.

3. D-OPSD FLUX2-klein LoRA training for image editing with self-distilled target image as reference imgae in teacher:

Refer to (flux2-klein-edit_self-distill-edit) for training guidance and more discussion.

🍬 Evaluation

Refer to (z-image-turbo_self-distill-vlm-eval) for evaluation guidance with Z-Image-Turbo and more discussion.

🎀 Highlight

D-OPSD is an on-policy self-distillation training framework for diffusion models especially timestep-distilled ones. It features in:

D-OPSD identify an emergent property of modern text to image diffusion models with LLM/VLM encoders and utilize this property to the continuous tuning of step-distilled diffusion model.
D-OPSD is a novel diffusion models on-policy self-distillation framework. By assigning the same model two roles with different contexts, D-OPSD enables supervised tuning on the student’s own roll-outs without requiring any external reward function or extra modules.
D-OPSD is validated in different settings. The results show that our method enables the model to learn new concepts, styles, and domain preferences while preserving its original few-step inference capability and previous knowledge.

In full fine-tuning, D-OPSD adapts the model toward the target domain (anime) while retaining original-domain knowledge and few-step inference capability.

In small customized LoRA training, D-OPSD learns new concepts from only a few image-text pairs while maintaining few-step generation quality and generalizing to unseen prompts.

🌺 Citation

If you find D-OPSD useful, please kindly cite our paper:

@article{jiang2026dopsd,
      title={D-OPSD: On-Policy Self-Distillation for Continuously Tuning Step-Distilled Diffusion Models},
      author={Jiang, Dengyang and Jin, Xin and Liu, Dongyang and Wang, Zanyi and Zheng, Mingzhe and Du, Ruoyi and Yang, Xiangpeng and Wu, Qilong and Li, Zhen and Gao, Peng and Yang, Harry and Hoi, Steven},
      journal={arXiv preprint arXiv:2605.05204},
      year={2026}
}

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
assets		assets
flux2-klein-edit-self-distill-gt-ref		flux2-klein-edit-self-distill-gt-ref
flux2-klein_self-distill-edit		flux2-klein_self-distill-edit
z-image-turbo_self-distill-vlm		z-image-turbo_self-distill-vlm
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

D-OPSD
_{^{On-Policy Self-Distillation for Continuously Tuning Step-Distilled Diffusion Models}}

🛕 Environment Setup

🍬 Training

1. D-OPSD Z-Image-Turbo LoRA training with self-distilled vlm context (align with our paper):

2. D-OPSD FLUX2-klein LoRA training with self-distilled eidting branch context for scenario of high id accuracy requirement:

3. D-OPSD FLUX2-klein LoRA training for image editing with self-distilled target image as reference imgae in teacher:

🍬 Evaluation

🎀 Highlight

🌺 Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

D-OPSDOn-Policy Self-Distillation for Continuously Tuning Step-Distilled Diffusion Models

🛕 Environment Setup

🍬 Training

1. D-OPSD Z-Image-Turbo LoRA training with self-distilled vlm context (align with our paper):

2. D-OPSD FLUX2-klein LoRA training with self-distilled eidting branch context for scenario of high id accuracy requirement:

3. D-OPSD FLUX2-klein LoRA training for image editing with self-distilled target image as reference imgae in teacher:

🍬 Evaluation

🎀 Highlight

🌺 Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

D-OPSD
_{^{On-Policy Self-Distillation for Continuously Tuning Step-Distilled Diffusion Models}}

Packages