Stars
Stable Diffusion web UI
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
A feature-rich command-line audio/video downloader
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Tensors and Dynamic neural networks in Python with strong GPU acceleration
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
AI agents running research on single-GPU nanochat training automatically
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
The world's simplest facial recognition api for Python and the command line
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
AiLearning:数据分析+机器学习实战+线性代数+PyTorch+NLTK+TF2
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Official Code for DragGAN (SIGGRAPH 2023)
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
OpenMMLab Detection Toolbox and Benchmark
⚡ A Fast, Extensible Progress Bar for Python and CLI
Open-Sora: Democratizing Efficient Video Production for All
State-of-the-art 2D and 3D Face Analysis Project
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.