Skip to content
View jianlong-yuan's full-sized avatar
  • Alibaba-DAMO
  • beijing

Block or report jianlong-yuan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Efficient Triton Kernels for LLM Training

Python 5,976 455 Updated Dec 24, 2025

[CVPR 2025] Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution

Python 210 4 Updated Dec 16, 2025

[CVPR2023] Blind Image Quality Assessment via Vision-Language Correspondence: A Multitask Learning Perspective

Python 233 14 Updated Jan 10, 2025

๐Ÿš€ Sliding Window Attention Training for Efficient Large Language Models

Python 13 Updated Dec 8, 2025

Flash Attention Triton kernel with support for second-order derivatives

Python 125 11 Updated Dec 21, 2025

Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support

Python 214 37 Updated Dec 24, 2025

Triton based sparse quantization attention kernel collection

Python 38 4 Updated Aug 29, 2025

Muon is an optimizer for hidden layers in neural networks

Python 2,123 100 Updated Nov 23, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 9,017 664 Updated Nov 20, 2025

Tilus is a tile-level kernel programming language with explicit control over shared memory and registers.

Python 433 15 Updated Dec 16, 2025

Evaluating text-to-image/video/3D models with VQAScore

Jupyter Notebook 367 30 Updated Sep 22, 2025
Python 583 16 Updated Dec 24, 2025

A toolkit designed for the CapsBench Caption Evaluation Framework, as introduced in the paper Playground v3: Improving Text-to-Image Alignment with Deep-Fusion Large Language Models.

Python 3 Updated Jan 19, 2025

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 6,513 367 Updated Dec 24, 2025

https://huggingface.co/datasets/multimodal-reasoning-lab/Zebra-CoT

Python 110 7 Updated Nov 1, 2025

Scalable and memory-optimized training of diffusion models

Python 1,312 143 Updated Jun 4, 2025

FACM: Flow-Anchored Consistency Models

Python 134 2 Updated Aug 6, 2025

QuAcK: a software for emerging quantum electronic structure methods

Fortran 30 13 Updated Dec 23, 2025

Unofficial extension implementation of Self-Forcing to support I2V && 14B training.

Python 299 20 Updated Sep 29, 2025

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 4,841 321 Updated Dec 21, 2025

A unified inference and post-training framework for accelerated video generation.

Python 2,857 228 Updated Dec 24, 2025

Repo for SeedVR2 & SeedVR (CVPR2025 Highlight)

Python 853 51 Updated Jul 2, 2025

๐Ÿ“šA curated list of Awesome Diffusion Inference Papers with Codes: Sampling, Cache, Quantization, Parallelism, etc.๐ŸŽ‰

Python 479 24 Updated Nov 28, 2025

Open-source unified multimodal model

Python 5,503 481 Updated Oct 27, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,353 3,243 Updated Dec 24, 2025

Lets make video diffusion practical!

Python 16,393 1,597 Updated Oct 16, 2025

๐Ÿ“น A more flexible framework that can generate videos at any resolution and creates videos from images.

Python 1,731 129 Updated Dec 23, 2025

(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models

Python 1,123 63 Updated Aug 7, 2025

(NeurIPS 2024 Oral ๐Ÿ”ฅ) Improved Distribution Matching Distillation for Fast Image Synthesis

Python 1,136 54 Updated Mar 5, 2025

[ICCV 2025, Oral] TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models

Python 808 39 Updated Dec 17, 2025
Next