Stars
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
DSPy: The framework for programming—not prompting—language models
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
An LLM agent that conducts deep research (local and web) on any given topic and generates a long report with citations.
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
Graph Neural Network Library for PyTorch
DeepSeek Coder: Let the Code Write Itself
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using Agentic Retrieval 🔄.
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Python package built to ease deep learning on graph, on top of existing DL frameworks.
Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.
A Library for Advanced Deep Time Series Models for General Time Series Analysis.
YOLOv3 in PyTorch > ONNX > CoreML > TFLite
Fast and Accurate ML in 3 Lines of Code
The power of Claude Code / GeminiCLI / CodexCLI + [Gemini / OpenAI / OpenRouter / Azure / Grok / Ollama / Custom Model / All Of The Above] working as one.
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
Pretrained ConvNets for pytorch: NASNet, ResNeXt, ResNet, InceptionV4, InceptionResnetV2, Xception, DPN, etc.
Python client for Baidu Yun (Personal Cloud Storage) 百度云/百度网盘Python客户端
Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also …
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
Official Tensorflow implementation of U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation (ICLR 2020)