Starred repositories
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…
The official GitHub page for the survey paper "A Survey of Large Language Models".
No fortress, purely open ground. OpenManus is Coming.
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
This repo contains the code for 1D tokenizer and generator
Bring portraits to life in Real Time!onnx/tensorrt support!实时肖像驱动!
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
[INTERSPEECH'24] Official repository for "MultiTalk: Enhancing 3D Talking Head Generation Across Languages with Multilingual Video Dataset"
[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
VideoSys: An easy and efficient system for video generation
Code for SCIS-2025 Paper "UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation".
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
[NeurIPS 2024] Boosting the performance of consistency models with PCM!
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
[CVPR 2024] The official repo for FlashAvatar
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
[ECCV 2024] codes of DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior
Code for the ICML 2024 paper "Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment"
[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
Open-Sora: Democratizing Efficient Video Production for All
[CVPR 2024 Highlight] Official repository for paper "SIFU: Side-view Conditioned Implicit Function for Real-world Usable Clothed Human Reconstruction"
TripoSR: Fast 3D Object Reconstruction from a Single Image