Starred repositories
Stable Diffusion web UI
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
No fortress, purely open ground. OpenManus is Coming.
High-Resolution Image Synthesis with Latent Diffusion Models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
Open-Sora: Democratizing Efficient Video Production for All
Generative Models by Stability AI
Industry leading face manipulation platform
Image-to-Image Translation in PyTorch
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
WebUI extension for ControlNet
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
The official GitHub page for the survey paper "A Survey of Large Language Models".
Official implementation of AnimateDiff.
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…
PyTorch package for the discrete VAE used for DALL·E.
[CVPR 2024] Official repository for "MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model"
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/