Starred repositories
Industry leading face manipulation platform
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
Stable Diffusion web UI
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…
The collection of pre-trained, state-of-the-art AI models for ailia SDK
ncnn is a high-performance neural network inference framework optimized for the mobile platform
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
A library for efficient similarity search and clustering of dense vectors.
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
Implementation of popular deep learning networks with TensorRT network definition API
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Generative Models by Stability AI
No fortress, purely open ground. OpenManus is Coming.
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
BoxMOT: Pluggable SOTA multi-object tracking modules modules for segmentation, object detection and pose estimation models
The minimal opencv for Android, iOS, ARM Linux, Windows, Linux, MacOS, HarmonyOS, WebAssembly, watchOS, tvOS, visionOS
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
A curated list of recent diffusion models for video generation, editing, and various other applications.
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
[CVPR 2024 Highlight] Official repository for paper "SIFU: Side-view Conditioned Implicit Function for Real-world Usable Clothed Human Reconstruction"
Digital Human Resource: 2D/3D/4D Human Modeling, Avatar Generation & Animation, Clothed People Digitalization, Virtual Try-On, and Others.
ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation
A collection of papers and codes for human pose transfer