Skip to content
View lxa9867's full-sized avatar

Block or report lxa9867

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
128 stars written in Python
Clear filter

The official Meta Llama 3 GitHub site

Python 29,291 3,528 Updated Jan 26, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 28,458 2,654 Updated Apr 9, 2026

Official inference repo for FLUX.1 models

Python 25,386 1,871 Updated Jul 31, 2025

Train transformer language models with reinforcement learning.

Python 17,986 2,628 Updated Apr 9, 2026

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,697 2,233 Updated Feb 1, 2025

Lets make video diffusion practical!

Python 16,729 1,650 Updated Oct 16, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 15,770 2,534 Updated Mar 5, 2026

Wan: Open and Advanced Large-Scale Video Generative Models

Python 15,150 1,839 Updated Mar 17, 2026

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 14,310 2,114 Updated Apr 4, 2026

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 12,618 1,276 Updated Nov 4, 2025

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 11,940 1,222 Updated Nov 21, 2025

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 7,743 476 Updated Feb 10, 2026

Open-source unified multimodal model

Python 5,791 512 Updated Oct 27, 2025

A Next-Generation Training Engine Built for Ultra-Large MoE Models

Python 5,118 414 Updated Apr 9, 2026

Witness the aha moment of VLM with less than $3.

Python 4,045 285 Updated May 19, 2025

MAGI-1: Autoregressive Video Generation at Scale

Python 3,673 236 Updated Jun 17, 2025

[NeurIPS 2025 D&B] Open-source Multi-agent Poster Generation from Papers

Python 3,597 250 Updated Dec 21, 2025

Sky-T1: Train your own O1 preview model within $450

Python 3,373 343 Updated Jul 12, 2025

Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

Python 3,309 507 Updated Jul 29, 2024

Efficient vision foundation models for high-resolution generation and perception.

Python 3,279 240 Updated Sep 5, 2025

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 3,277 256 Updated Sep 12, 2025

[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling

Python 3,178 301 Updated Dec 21, 2024

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,942 93 Updated Aug 15, 2024

[ICCV 2025 Highlight] OminiControl: Minimal and Universal Control for Diffusion Transformer

Python 1,910 146 Updated Jul 3, 2025

[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,910 90 Updated Jan 8, 2026

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 1,891 120 Updated Feb 20, 2026

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,846 75 Updated Feb 25, 2026

Official repository for BrickGPT, the first approach for generating physically stable toy brick models from text prompts.

Python 1,628 102 Updated Feb 7, 2026

[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Python 1,601 85 Updated Mar 16, 2025
Next