lxa9867

Xiang Li lxa9867

xAI | ex-Google Deepmind, CMU | Multimodal Understanding & Generation

93 followers · 16 following

xAI
Bellevue, WA
https://lxa9867.github.io/

Achievements

Stars

wenet-e2e / wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Python 1,192 178 Updated Jan 20, 2026

OpenVE-Team / OpenVE-3M

OpenVE-3M: A Large-Scale High-Quality Dataset for Instruction-Guided Video Editing

Python 36 Updated Jan 9, 2026

ByteVisionLab / NextFlow

NextFlow🚀: Unified Sequential Modeling Activates Multimodal Understanding and Generation

306 15 Updated Jan 9, 2026

Tencent-Hunyuan / HY-WorldPlay

HY-World 1.5: A Systematic Framework for Interactive World Modeling with Real-Time Latency and Geometric Consistency

Python 1,112 90 Updated Jan 13, 2026

yuemingPAN / SFD

Semantics Lead the Way: Harmonizing Semantic and Texture Modeling with Asynchronous Latent Diffusion

Python 300 3 Updated Dec 21, 2025

EzioBy / Ditto

[Preprint 2025] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset

Python 565 47 Updated Oct 29, 2025

baaivision / Emu3.5

Native Multimodal Models are World Learners

Python 1,451 56 Updated Dec 30, 2025

zhenye234 / X-Codec-2.0

Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 349 48 Updated Jul 21, 2025

AvaLovelace1 / BrickGPT

Official repository for BrickGPT, the first approach for generating physically stable toy brick models from text prompts.

Python 1,594 98 Updated Feb 7, 2026

bytetriper / RAE

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,753 65 Updated Jan 20, 2026

zelaki / ReDi

[NeurIPS'25 Spotlight] Boosting Generative Image Modeling via Joint Image-Feature Synthesis

Python 111 5 Updated Nov 3, 2025

qiuk2 / RobusTok

Image Tokenizer Needs Post-Training

Python 24 2 Updated Oct 4, 2025

Yikai-Wang / nvg

[ICLR 2026] Code for our paper "Next Visual Granularity Generation".

Python 49 1 Updated Jan 26, 2026

FoundationVision / Waver

Industry-level video foundation model for unified Text-to-Video (T2V) and Image-to-Video (I2V) generation.

881 106 Updated Aug 27, 2025

Kai-46 / KnapFormer

Python 126 5 Updated Aug 10, 2025

stepfun-ai / NextStep-1

[🚀 ICLR 2026 Oral]NextStep-1: SOTA Autogressive Image Generation with Continuous Tokens. A research project developed by the StepFun’s Multimodal Intelligence team.

Python 602 18 Updated Dec 25, 2025

QwenLM / Qwen-Image

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 7,241 422 Updated Dec 31, 2025

Wan-Video / Wan2.2

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,085 1,688 Updated Dec 17, 2025

ali-vilab / TTS-VAR

Test-time Scaling for VAR models

Python 31 4 Updated Sep 19, 2025

Jiawei-Yang / DeTok

Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"

Jupyter Notebook 172 4 Updated Dec 17, 2025

guandeh17 / Self-Forcing

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 3,122 238 Updated Sep 12, 2025

lxtGH / DenseWorld-1M

Code and dataset link for "DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World"

122 2 Updated Oct 2, 2025

JIA-Lab-research / VisionThink

[NeurIPS 2025] Efficient Reasoning Vision Language Models

Python 448 29 Updated Sep 18, 2025

dc-ai-projects / DC-AR

Python 81 Updated Oct 18, 2025

HW-whistleblower / True-Story-of-Pangu

诺亚盘古大模型研发背后的真正的心酸与黑暗的故事。

11,389 1,341 Updated Jul 9, 2025

camlab-ethz / GAOT

[NeurIPS 2025] Geometry Aware Operator Transformer As An Efficient And Accurate Neural Surrogate For PDEs On Arbitrary Domains

Python 73 19 Updated Oct 23, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 17,306 2,477 Updated Feb 7, 2026

XueZeyue / DanceGRPO

An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation

Python 1,501 75 Updated Oct 16, 2025

ByteVisionLab / DetailFlow

🔥 Official impl. of "DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction"

Python 165 8 Updated Jul 10, 2025

Paper2Poster / Paper2Poster

[NeurIPS 2025 D&B] Open-source Multi-agent Poster Generation from Papers

Python 3,077 208 Updated Dec 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Xiang Li lxa9867

Achievements

Achievements

Block or report lxa9867

Stars

wenet-e2e / wespeaker

OpenVE-Team / OpenVE-3M

ByteVisionLab / NextFlow

Tencent-Hunyuan / HY-WorldPlay

yuemingPAN / SFD

EzioBy / Ditto

baaivision / Emu3.5

zhenye234 / X-Codec-2.0

AvaLovelace1 / BrickGPT

bytetriper / RAE

zelaki / ReDi

qiuk2 / RobusTok

Yikai-Wang / nvg

FoundationVision / Waver

Kai-46 / KnapFormer

stepfun-ai / NextStep-1

QwenLM / Qwen-Image

Wan-Video / Wan2.2

ali-vilab / TTS-VAR

Jiawei-Yang / DeTok

guandeh17 / Self-Forcing

lxtGH / DenseWorld-1M

JIA-Lab-research / VisionThink

dc-ai-projects / DC-AR

HW-whistleblower / True-Story-of-Pangu

camlab-ethz / GAOT

huggingface / trl

XueZeyue / DanceGRPO

ByteVisionLab / DetailFlow

Paper2Poster / Paper2Poster