Skip to content
View wusize's full-sized avatar

Block or report wusize

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

📝The official repository of "Rethinking Cross-Generator Image Forgery Detection through DINOv3"

Python 25 3 Updated Dec 2, 2025

[ICLR 2026] Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation

Python 417 13 Updated Feb 18, 2026

[ICML 2026] a unified reinforcement learning toolbox for joint RL on language models and diffusion models

Python 89 2 Updated May 26, 2026

An agent-managed museum exhibit, built in Rust with Gajae-Code / LazyCodex — developed and maintained with no human intervention.

Rust 193,943 109,962 Updated Jun 8, 2026

the official repo for "D-AR: Diffusion via Autoregressive Models"

Python 139 3 Updated Jan 29, 2026

[Preprint] ViFeEdit: A Video-Free Tuner of Your Video Diffusion Transformer

Python 66 5 Updated Mar 31, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 379,111 79,344 Updated Jun 17, 2026

Implementation of "Hyperspherical Latents Improve Continuous-Token Autoregressive Generation"

Python 102 7 Updated Feb 28, 2026

[ICCV 2025] Official implementation of the paper: REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers

Python 505 30 Updated Dec 6, 2025

Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better

Python 190 18 Updated Apr 7, 2026

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 22,016 4,089 Updated Jun 16, 2026

A PyTorch native platform for training generative AI models

Python 5,440 863 Updated Jun 17, 2026

Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference

Python 1,277 41 Updated May 11, 2026

Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.

Python 87 4 Updated May 4, 2025

[ICLR 2026] Official repo of paper "Reconstruction Alignment Improves Unified Multimodal Models". Unlocking the Massive Zero-shot Potential in Unified Multimodal Models through Self-supervised Lear…

Python 404 17 Updated May 23, 2026

DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support

Python 766 71 Updated Mar 22, 2024

Open-source SOTA multi-image editing model

Python 871 43 Updated Jan 24, 2026

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 20,173 2,095 Updated Jun 9, 2026

GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset

Python 244 5 Updated Aug 15, 2025

Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"

Jupyter Notebook 194 4 Updated Feb 24, 2026

Official code for ICCV 2025 paper, X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distillation

Python 90 3 Updated Jun 26, 2025

A framework that allows you to apply Sparse AutoEncoder on any models

Python 54 3 Updated Jul 11, 2025

SigLIP-based Aesthetic Score Predictor

Python 420 10 Updated Dec 18, 2024

Open protocol for communication between AI agents, applications, and humans.

Python 1,013 119 Updated Aug 25, 2025

[NeurIPS 2025] Controllable Human-centric Keyframe Interpolation with Generative Prior

Python 32 Updated Mar 31, 2026

Official Implementation of Paper Transfer between Modalities with MetaQueries

Python 323 14 Updated Oct 12, 2025

DeepDubber-V1: Towards High Quality and Dialogue, Narration, Monologue Adaptive Movie Dubbing Via Multi-Modal Chain-of-Thoughts Reasoning Guidance

Python 30 2 Updated Sep 7, 2025
Next