Skip to content
View Aoko955's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Aoko955

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🚀 Self-hosted open-source WebRTC video conferencing platform built on peer-to-peer (P2P) architecture for fast, secure real-time communication with end-to-end privacy.

JavaScript 4,475 722 Updated Apr 30, 2026

[SIGGRAPH 2025] LAM: Large Avatar Model for One-shot Animatable Gaussian Head

Python 972 91 Updated Apr 30, 2026

Official codebase for "Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive Video Generation"

Python 589 36 Updated Apr 30, 2026

【Accepted by TPAMI】Human Motion Video Generation: A Survey (https://ieeexplore.ieee.org/document/11106267)

325 14 Updated Apr 29, 2026

Unified Codebase for Advanced World Models.

Python 727 38 Updated Apr 28, 2026

A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone

Python 24,480 1,905 Updated Apr 27, 2026

Open source impl of **MV-DUSt3R+ Single-Stage Scene Reconstruction from Sparse Views In 2 Seconds** from Meta Reality Labs. Project page https://mv-dust3rp.github.io/

Python 588 25 Updated Apr 27, 2026

✨ Self-hosted open-source WebRTC cam-to-cam peer-to-peer video calling platform for immersive 1-to-1 real-time communication with end-to-end privacy. Each room is limited to two participants for ma…

JavaScript 506 93 Updated Apr 27, 2026

📡 Self-hosted open-source WebRTC live broadcasting platform for real-time video, audio, and screen streaming to unlimited connected viewers.

JavaScript 201 50 Updated Apr 27, 2026

A curated list of awesome human-human interaction resources.

93 2 Updated Apr 27, 2026

Official implementation of "OmniForcing: Unleashing Real-time Joint Audio-Visual Generation"[arXiv:2603.11647]. OmniForcing is the first framework to distill bidirectional audio-visual diffusion mo…

Python 141 1 Updated Apr 27, 2026

Official Python inference and LoRA trainer package for the LTX-2 audio–video generative model.

Python 6,225 994 Updated Apr 23, 2026

CastleHill: Separable Causal Diffusion / Varitaion Flow Maps for LTX-2 long-form video generation

Python 11 Updated Apr 23, 2026
Python 64 8 Updated Apr 16, 2026

[SIGGRAPH‘2026] PEAR :Pixel-aligned Expressive humAn mesh Recovery

Python 250 19 Updated Apr 16, 2026

Code Implementation of "WorldCam: Interactive Autoregressive 3D Gaming Worlds with Camera Pose as a Unifying Geometric Representation"

Python 158 4 Updated Apr 15, 2026

Official inference code for SoulX-LiveAct: Towards Hour-Scale Real-Time Human Animation with Neighbor Forcing and ConvKV Memory

Python 1,221 106 Updated Apr 15, 2026

SoulX-FlashHead: A unified 1.3B-parameter framework designed for high-fidelity, infinite-length, and real-time streaming portrait video generation.

Python 726 65 Updated Apr 15, 2026

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 5,117 345 Updated Apr 14, 2026

Official repository of paper "CoMoVi: Co-Generation of 3D Human Motions and Realistic Videos"

Python 99 Updated Apr 9, 2026

Implementation of "Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length"

Python 2,056 236 Updated Apr 8, 2026

🧂 Salt: Self-Consistent Distribution Matching with Cache-Aware Training for Fast Video Generation

16 Updated Apr 6, 2026

[ICLR 2026] LumiTex: Towards High-Fidelity PBR Texture Generation with Illumination Context

Python 58 1 Updated Apr 6, 2026

[ICLR'26] code for paper "Token-level Data Selection for Safe LLM Fine-tuning"

Python 7 1 Updated Apr 4, 2026

SoulX-FlashTalk is the first 14B model to achieve sub-second start-up latency (0.87s) while maintaining a real-time throughput of 32 FPS on an 8xH800 node.

Python 1,238 118 Updated Apr 2, 2026

[Tech Report] Alive: A Unified Audio-Video Generation Model

501 36 Updated Mar 31, 2026

[ACM MM 2024] GS3LAM: Gaussian Semantic Splatting SLAM

Python 89 9 Updated Mar 31, 2026
Next