Skip to content
View Aoko955's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Aoko955

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR 2026] Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO

Python 109 Updated Feb 28, 2026

CastleHill: Separable Causal Diffusion / Varitaion Flow Maps for LTX-2 long-form video generation

Python 11 Updated Mar 27, 2026

Official implementation of "OmniForcing: Unleashing Real-time Joint Audio-Visual Generation"[arXiv:2603.11647]. OmniForcing is the first framework to distill bidirectional audio-visual diffusion mo…

Python 132 1 Updated Mar 29, 2026

🧂 Salt: Self-Consistent Distribution Matching with Cache-Aware Training for Fast Video Generation

12 Updated Apr 6, 2026

[Tech Report] Alive: A Unified Audio-Video Generation Model

503 36 Updated Mar 31, 2026

Official inference code for SoulX-LiveAct: Towards Hour-Scale Real-Time Human Animation with Neighbor Forcing and ConvKV Memory

Python 1,197 102 Updated Apr 15, 2026

Matrix-Game 3.0: Real-Time and Streaming Interactive World Model with Long-Horizon Memory

Python 2,152 229 Updated Mar 30, 2026

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 5,081 343 Updated Apr 14, 2026

Code Implementation of "WorldCam: Interactive Autoregressive 3D Gaming Worlds with Camera Pose as a Unifying Geometric Representation"

Python 152 2 Updated Apr 15, 2026

Try X-Dub to sync any character in a video with any audio you like | Official repository for "From Inpainting to Editing: Unlocking Robust Mask-Free Visual Dubbing via Generative Bootstrapping"

Python 185 2 Updated Mar 19, 2026

Unified Codebase for Advanced World Models.

Python 670 33 Updated Apr 15, 2026

LeetCode 101:力扣刷题指南

10,028 1,254 Updated Feb 12, 2026
Python 94 2 Updated Mar 24, 2026

Official Pytorch implementation of AvatarForcing: One-Step Streaming Talking Avatars via Local-Future Sliding-Window Denoising

Python 47 2 Updated Mar 29, 2026

Generate high resolution videos with a custom voice and appearance, based on LTX-2/LTX-2.3 + Identity In-Context LoRA

Python 265 28 Updated Mar 24, 2026

Codebase for PrismMirror: Real-Time Human Frontal View Synthesis from a Single Image

9 Updated Mar 16, 2026

Codebase for Flash-VAED: Plug-and-Play VAE Decoders for Efficient Video Generation

15 Updated Feb 19, 2026

[ICLR 2026] LumiTex: Towards High-Fidelity PBR Texture Generation with Illumination Context

Python 56 1 Updated Apr 6, 2026

[ICLR 2025] GI-GS: Global Illumination Decomposition on Gaussian Splatting for Inverse Rendering

Python 121 10 Updated Mar 12, 2026

[CVPR26] RemedyGS: Defend 3D Gaussian Splatting Against Computation Cost Attacks

1 Updated Mar 28, 2026

[ICLR'26] code for paper "Token-level Data Selection for Safe LLM Fine-tuning"

Python 7 1 Updated Apr 4, 2026

SoulX-Podcast is an inference codebase by the Soul AI team for generating high-fidelity podcasts from text.

Python 3,291 429 Updated Dec 11, 2025

SoulX-FlashHead: A unified 1.3B-parameter framework designed for high-fidelity, infinite-length, and real-time streaming portrait video generation.

Python 679 56 Updated Apr 15, 2026

SoulX-FlashTalk is the first 14B model to achieve sub-second start-up latency (0.87s) while maintaining a real-time throughput of 32 FPS on an 8xH800 node.

Python 1,189 116 Updated Apr 2, 2026

🏠 [ECCV 2024] The core gsplat component for GaussianImage

Cuda 39 13 Updated Mar 13, 2026

[IEEE TCSVT] Preprocessing Enhanced Image Compression for Machine Vision

Python 17 Updated Mar 23, 2025

🏠[ECCV 2024] GaussianImage: 1000 FPS Image Representation and Compression by 2D Gaussian Splatting

Python 405 28 Updated Aug 26, 2024
Next