Stars
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
The official gpt4free repository | various collection of powerful language models | o4, o3 and deepseek r1, gpt-4.1, gemini 2.5
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 300+ supported cars.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Official Code for DragGAN (SIGGRAPH 2023)
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
Open-Sora: Democratizing Efficient Video Production for All
A generative world for general-purpose robotics & embodied AI learning.
Generative Models by Stability AI
FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Official inference repo for FLUX.1 models
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
WebUI extension for ControlNet
Stable Diffusion with Core ML on Apple Silicon
Avatars for Zoom, Skype and other video-conferencing apps.
Lets make video diffusion practical!
Wan: Open and Advanced Large-Scale Video Generative Models
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.