Skip to content
View jinjinw's full-sized avatar

Block or report jinjinw

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL

Python 2,123 138 Updated Nov 4, 2025

HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation

Python 2,937 150 Updated Feb 3, 2026

Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference

Python 1,264 42 Updated Feb 24, 2026

[SIGGRAPH'24] CharacterGen: Efficient 3D Character Generation from Single Images with Multi-View Pose Canonicalization

JavaScript 781 60 Updated Apr 11, 2025

Official implementation of HPSv3: Towards Wide-Spectrum Human Preference Score (ICCV2025)

Python 278 16 Updated Dec 5, 2025

An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation

Python 1,557 76 Updated Oct 16, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,823 1,793 Updated Mar 17, 2026
Python 333 27 Updated Mar 23, 2026

Official PyTorch implementation for "Large Language Diffusion Models"

Python 3,692 251 Updated Nov 12, 2025

[NeurIPS 2025 Spotlight] FUDOKI: Discrete Flow-based Unified Understanding and Generation via Kinetic-Optimal Velocities

Python 73 3 Updated Dec 21, 2025

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,940 94 Updated Aug 15, 2024

The author's implementation for the ICML 2024 paper.

Python 6 1 Updated Sep 25, 2024

[CVPR 2025] Implementation of "Forensics-Bench: A Comprehensive Forgery Detection Benchmark Suite for Large Vision Language Models"

Python 37 Updated Apr 28, 2025

[ICLR 2025] Code for the paper "Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning"

Python 87 12 Updated Feb 14, 2025

Official implementation of ICML 2025 Oral 🏆 paper "Orthogonal Subspace Decomposition for Generalizable AI-Generated Image Detection".

Python 202 19 Updated Jul 14, 2025

Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"

Python 895 112 Updated Jan 28, 2026

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 5,023 336 Updated Mar 17, 2026

一个第三方哔哩哔哩客户端,A third-party bilibili client。

17,345 535 Updated Feb 28, 2025

MM-EUREKA: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning

Python 772 31 Updated Sep 7, 2025

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,942 658 Updated Mar 23, 2026

[TMLR 2025🔥] A survey for the autoregressive models in vision.

790 23 Updated Nov 8, 2025

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 4,258 320 Updated Jan 5, 2026

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 11,869 1,214 Updated Nov 21, 2025

Official repository of In-Context LoRA for Diffusion Transformers

2,063 95 Updated Dec 20, 2024

The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"

Python 41 Updated Oct 11, 2024

Mamba SSM architecture

Python 17,683 1,653 Updated Mar 23, 2026

LaVIT: Empower the Large Language Model to Understand and Generate Visual Content

Jupyter Notebook 606 31 Updated Oct 6, 2024

[ICLR2025] MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models

Python 96 4 Updated Sep 14, 2024
Python 900 125 Updated Dec 11, 2024

[ECCV 2024] Sparse Beats Dense: Rethinking Supervision in Radar-Camera Depth Completion

Python 31 3 Updated Oct 9, 2024
Next