Aoko955

🎯

Focusing

Aoko955

🎯

Focusing

10 followers · 61 following

Stars

KlingAIResearch / VANS

[CVPR 2026] Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO

Python 109 Updated Feb 28, 2026

johndpope / ltx2-castlehill

CastleHill: Separable Causal Diffusion / Varitaion Flow Maps for LTX-2 long-form video generation

Python 11 Updated Mar 27, 2026

GAIR-NLP / daVinci-MagiHuman

Python 1,882 183 Updated Apr 11, 2026

OmniForcing / OmniForcing

Official implementation of "OmniForcing: Unleashing Real-time Joint Audio-Visual Generation"[arXiv:2603.11647]. OmniForcing is the first framework to distill bidirectional audio-visual diffusion mo…

Python 132 1 Updated Mar 29, 2026

XingtongGe / Salt

🧂 Salt: Self-Consistent Distribution Matching with Cache-Aware Training for Fast Video Generation

12 Updated Apr 6, 2026

FoundationVision / Alive

[Tech Report] Alive: A Unified Audio-Video Generation Model

503 36 Updated Mar 31, 2026

ShandaAI / PackForcing

235 3 Updated Mar 27, 2026

Soul-AILab / SoulX-LiveAct

Official inference code for SoulX-LiveAct: Towards Hour-Scale Real-Time Human Animation with Neighbor Forcing and ConvKV Memory

Python 1,197 102 Updated Apr 15, 2026

SkyworkAI / Matrix-Game

Matrix-Game 3.0: Real-Time and Streaming Interactive World Model with Long-Horizon Memory

Python 2,152 229 Updated Mar 30, 2026

NVlabs / Sana

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 5,081 343 Updated Apr 14, 2026

cvlab-kaist / WorldCam

Code Implementation of "WorldCam: Interactive Autoregressive 3D Gaming Worlds with Camera Pose as a Unifying Geometric Representation"

Python 152 2 Updated Apr 15, 2026

KlingAIResearch / X-Dub

Try X-Dub to sync any character in a video with any audio you like | Official repository for "From Inpainting to Editing: Unlocking Robust Mask-Free Visual Dubbing via Generative Bootstrapping"

Python 185 2 Updated Mar 19, 2026

OpenDCAI / OpenWorldLib

Unified Codebase for Advanced World Models.

Python 670 33 Updated Apr 15, 2026

cgao96 / leetcode_101

LeetCode 101：力扣刷题指南

10,028 1,254 Updated Feb 12, 2026

AlanBaade / LatentForcing

Python 94 2 Updated Mar 24, 2026

KlingAIResearch / AvatarForcing

Official Pytorch implementation of AvatarForcing: One-Step Streaming Talking Avatars via Local-Future Sliding-Window Denoising

Python 47 2 Updated Mar 29, 2026

MoonshotAI / Attention-Residuals

3,134 164 Updated Mar 17, 2026

ID-LoRA / ID-LoRA

Generate high resolution videos with a custom voice and appearance, based on LTX-2/LTX-2.3 + Identity In-Context LoRA

Python 265 28 Updated Mar 24, 2026

rslinfy / PrismMirror

Codebase for PrismMirror: Real-Time Human Frontal View Synthesis from a Single Image

9 Updated Mar 16, 2026

Aoko955 / Flash-VAED

Codebase for Flash-VAED: Plug-and-Play VAE Decoders for Efficient Video Generation

15 Updated Feb 19, 2026

LumiTexPBR / LumiTex

[ICLR 2026] LumiTex: Towards High-Fidelity PBR Texture Generation with Illumination Context

Python 56 1 Updated Apr 6, 2026

stopaimme / GI-GS

[ICLR 2025] GI-GS: Global Illumination Decomposition on Gaussian Splatting for Inverse Rendering

Python 121 10 Updated Mar 12, 2026

Polly-LYP / RemedyGS

[CVPR26] RemedyGS: Defend 3D Gaussian Splatting Against Computation Cost Attacks

1 Updated Mar 28, 2026

Polly-LYP / TOSS

[ICLR'26] code for paper "Token-level Data Selection for Safe LLM Fine-tuning"

Python 7 1 Updated Apr 4, 2026

Soul-AILab / SoulX-Podcast

SoulX-Podcast is an inference codebase by the Soul AI team for generating high-fidelity podcasts from text.

Python 3,291 429 Updated Dec 11, 2025

Soul-AILab / SoulX-FlashHead

SoulX-FlashHead: A unified 1.3B-parameter framework designed for high-fidelity, infinite-length, and real-time streaming portrait video generation.

Python 679 56 Updated Apr 15, 2026

Soul-AILab / SoulX-FlashTalk

SoulX-FlashTalk is the first 14B model to achieve sub-second start-up latency (0.87s) while maintaining a real-time throughput of 32 FPS on an 8xH800 node.

Python 1,189 116 Updated Apr 2, 2026

XingtongGe / gsplat

🏠 [ECCV 2024] The core gsplat component for GaussianImage

Cuda 39 13 Updated Mar 13, 2026

XingtongGe / PreprocessingICM

[IEEE TCSVT] Preprocessing Enhanced Image Compression for Machine Vision

Python 17 Updated Mar 23, 2025

Xinjie-Q / GaussianImage

🏠[ECCV 2024] GaussianImage: 1000 FPS Image Representation and Compression by 2D Gaussian Splatting

Python 405 28 Updated Aug 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly