A toolkit designed for the CapsBench Caption Evaluation Framework, as introduced in the paper Playground v3: Improving Text-to-Image Alignment with Deep-Fusion Large Language Models.

Python 3 Updated Jan 19, 2025

QwenLM / Qwen-Image

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 6,513 367 Updated Dec 24, 2025

multimodal-reasoning-lab / Bagel-Zebra-CoT

https://huggingface.co/datasets/multimodal-reasoning-lab/Zebra-CoT

Python 110 7 Updated Nov 1, 2025

huggingface / finetrainers

Scalable and memory-optimized training of diffusion models

Python 1,312 143 Updated Jun 4, 2025

ali-vilab / FACM

FACM: Flow-Anchored Consistency Models

Python 134 2 Updated Aug 6, 2025

pfloos / QuAcK

QuAcK: a software for emerging quantum electronic structure methods

Fortran 30 13 Updated Dec 23, 2025

GoatWu / Self-Forcing-Plus

Forked from guandeh17/Self-Forcing

Unofficial extension implementation of Self-Forcing to support I2V && 14B training.

Python 299 20 Updated Sep 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CodingMan jianlong-yuan

Achievements

Achievements

Block or report jianlong-yuan

Stars

linkedin / Liger-Kernel

zhiyuanyou / DeQA-Score

zwx8981 / LIQE

Fzkuji / swat-attention

amorehead / jvp_flash_attention

NVIDIA-NeMo / Automodel

RiseAI-Sys / attention-gym

KellerJordan / Muon

facebookresearch / dinov3

NVIDIA / tilus

linzhiqiu / t2v_metrics

stepfun-ai / NextStep-1

alexfdom / capsbench

QwenLM / Qwen-Image

multimodal-reasoning-lab / Bagel-Zebra-CoT

huggingface / finetrainers

ali-vilab / FACM

pfloos / QuAcK

GoatWu / Self-Forcing-Plus

NVlabs / Sana

hao-ai-lab / FastVideo

ByteDance-Seed / SeedVR

xlite-dev / Awesome-DiT-Inference

ByteDance-Seed / Bagel

NVIDIA-NeMo / NeMo

lllyasviel / FramePack

aigc-apps / VideoX-Fun

tianweiy / CausVid

tianweiy / DMD2

TrajectoryCrafter / TrajectoryCrafter