Highlights
- Pro
Stars
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
A high-throughput and memory-efficient inference and serving engine for LLMs
30 days of Python programming challenge is a step-by-step guide to learn the Python programming language in 30 days. This challenge may take more than100 days, follow your own pace. These videos ma…
Fully open reproduction of DeepSeek-R1
deep learning for image processing including classification and object-detection etc.
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Fast and memory-efficient exact attention
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
A PyTorch implementation of the Transformer model in "Attention is All You Need".
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
深度学习与PyTorch入门实战视频教程 配套源代码和PPT
Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins
[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"
Class activation maps for your PyTorch models (CAM, Grad-CAM, Grad-CAM++, Smooth Grad-CAM++, Score-CAM, SS-CAM, IS-CAM, XGrad-CAM, Layer-CAM)
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
OpenMMLab Foundational Library for Training Deep Learning Models
[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language
Ready-to-use code and tutorial notebooks to boost your way into few-shot learning for image classification.
[ICCV2021] Official PyTorch implementation of Segmenter: Transformer for Semantic Segmentation
(ICLR 2022 Spotlight) Official PyTorch implementation of "How Do Vision Transformers Work?"
A Change Detection Repo Standing on the Shoulders of Giants
A full Python Implementation of the ROUGE Metric (not a wrapper)