lxa9867

Xiang Li lxa9867

xAI | ex-Google Deepmind, CMU | Multimodal Understanding & Generation

93 followers · 16 following

xAI
Bellevue, WA
https://lxa9867.github.io/

Achievements

Stars

128 stars written in Python

Clear filter

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 29,291 3,528 Updated Jan 26, 2025

Genesis-Embodied-AI / Genesis

A generative world for general-purpose robotics & embodied AI learning.

Python 28,458 2,654 Updated Apr 9, 2026

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 25,386 1,871 Updated Jul 31, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 17,986 2,628 Updated Apr 9, 2026

deepseek-ai / Janus

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,697 2,233 Updated Feb 1, 2025

lllyasviel / FramePack

Lets make video diffusion practical!

Python 16,729 1,650 Updated Oct 16, 2025

Wan-Video / Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

Python 15,770 2,534 Updated Mar 5, 2026

Wan-Video / Wan2.2

Wan: Open and Advanced Large-Scale Video Generative Models

Python 15,150 1,839 Updated Mar 17, 2026

SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 14,310 2,114 Updated Apr 4, 2026

zai-org / CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 12,618 1,276 Updated Nov 4, 2025

Tencent-Hunyuan / HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 11,940 1,222 Updated Nov 21, 2025

QwenLM / Qwen-Image

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 7,743 476 Updated Feb 10, 2026

ByteDance-Seed / Bagel

Open-source unified multimodal model

Python 5,791 512 Updated Oct 27, 2025

InternLM / xtuner

A Next-Generation Training Engine Built for Ultra-Large MoE Models

Python 5,118 414 Updated Apr 9, 2026

StarsfieldAI / R1-V

Witness the aha moment of VLM with less than $3.

Python 4,045 285 Updated May 19, 2025

SandAI-org / MAGI-1

MAGI-1: Autoregressive Video Generation at Scale

Python 3,673 236 Updated Jun 17, 2025

Paper2Poster / Paper2Poster

[NeurIPS 2025 D&B] Open-source Multi-agent Poster Generation from Papers

Python 3,597 250 Updated Dec 21, 2025

NovaSky-AI / SkyThought

Sky-T1: Train your own O1 preview model within $450

Python 3,373 343 Updated Jul 12, 2025

facebookresearch / Mask2Former

Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

Python 3,309 507 Updated Jul 29, 2024

mit-han-lab / efficientvit

Efficient vision foundation models for high-resolution generation and perception.

Python 3,279 240 Updated Sep 5, 2025

guandeh17 / Self-Forcing

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 3,277 256 Updated Sep 12, 2025

stepfun-ai / Step-Video-T2V

Python 3,183 337 Updated Mar 17, 2025

jy0205 / Pyramid-Flow

[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling

Python 3,178 301 Updated Dec 21, 2024

FoundationVision / LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,942 93 Updated Aug 15, 2024

Yuanshi9815 / OminiControl

[ICCV 2025 Highlight] OminiControl: Minimal and Universal Control for Diffusion Transformer

Python 1,910 146 Updated Jul 3, 2025

showlab / Show-o

[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,910 90 Updated Jan 8, 2026

LTH14 / mar

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 1,891 120 Updated Feb 20, 2026

bytetriper / RAE

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,846 75 Updated Feb 25, 2026

AvaLovelace1 / BrickGPT

Official repository for BrickGPT, the first approach for generating physically stable toy brick models from text prompts.

Python 1,628 102 Updated Feb 7, 2026

sihyun-yu / REPA

[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Python 1,601 85 Updated Mar 16, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Xiang Li lxa9867

Achievements

Achievements

Block or report lxa9867

Stars

meta-llama / llama3

Genesis-Embodied-AI / Genesis

black-forest-labs / flux

huggingface / trl

deepseek-ai / Janus

lllyasviel / FramePack

Wan-Video / Wan2.1

Wan-Video / Wan2.2

SWivid / F5-TTS

zai-org / CogVideo

Tencent-Hunyuan / HunyuanVideo

QwenLM / Qwen-Image

ByteDance-Seed / Bagel

InternLM / xtuner

StarsfieldAI / R1-V

SandAI-org / MAGI-1

Paper2Poster / Paper2Poster

NovaSky-AI / SkyThought

facebookresearch / Mask2Former

mit-han-lab / efficientvit

guandeh17 / Self-Forcing

stepfun-ai / Step-Video-T2V

jy0205 / Pyramid-Flow

FoundationVision / LlamaGen

Yuanshi9815 / OminiControl

showlab / Show-o

LTH14 / mar

bytetriper / RAE

AvaLovelace1 / BrickGPT

sihyun-yu / REPA