Skip to content
View qqingzheng's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report qqingzheng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Native Multimodal Models are World Learners

Python 1,130 39 Updated Nov 5, 2025

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,312 531 Updated Nov 5, 2025

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,255 419 Updated Nov 3, 2025

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 2,075 372 Updated Nov 5, 2025

[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

Python 1,264 42 Updated Jun 12, 2025

(NeurIPS 2025) Vision Foundation Models as Effective Visual Tokenizers for Autoregressive Image Generation

Python 49 Updated Oct 14, 2025

Edit-R1: Reinforce Image Editing with Diffusion Negative-Aware Finetuning and MLLM Implicit Feedback

Python 149 2 Updated Oct 28, 2025

DiffusionNFT: Online Diffusion Reinforcement with Forward Process

Python 391 11 Updated Sep 22, 2025
Python 1 Updated Sep 21, 2025

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Python 4,908 664 Updated Sep 26, 2025

Taming Stable Diffusion for Lip Sync!

Python 5,078 818 Updated Jun 20, 2025

​​Unlimited-length talking video generation​​ that supports image-to-video and video-to-video generation

Python 2,948 457 Updated Aug 25, 2025

A collection of paper/projects that trains flow matching model/policies via RL.

286 9 Updated Oct 9, 2025

Fundamentals of Digital Media Technology(04713901) | Peking University ECE Course Materials

C 23 1 Updated Feb 4, 2022

Large language model review prompts

JavaScript 269 27 Updated Oct 24, 2025

Efficient Triton Kernels for LLM Training

Python 5,802 426 Updated Nov 5, 2025

Official repository for the UAE paper, unified-GRPO, and unified-Bench

Python 147 6 Updated Sep 12, 2025

A simple code to implement diffusion algos

Python 4 Updated Sep 8, 2025

Minimal PyTorch implementation of TP, SP, and FSDP

Python 9 2 Updated Oct 10, 2025

High quality training free inpaint for every stable diffusion model. Supports ComfyUI

Python 659 24 Updated Oct 26, 2025

PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)

Jupyter Notebook 2,031 340 Updated Jul 14, 2024

Odysseus: Playground of LLM Sequence Parallelism

Python 78 5 Updated Jun 17, 2024

USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference

Python 588 68 Updated Oct 14, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,106 1,902 Updated Nov 1, 2025

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 5,921 323 Updated Sep 30, 2025

Tiny-FSDP, a minimalistic re-implementation of the PyTorch FSDP

Python 89 7 Updated Aug 20, 2025

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 38,656 4,650 Updated Aug 19, 2024

Official inference code and LongText-Bench benchmark for our paper X-Omni (https://arxiv.org/pdf/2507.22058).

Python 388 10 Updated Aug 26, 2025

GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset

Python 233 5 Updated Aug 15, 2025
Next