-
Zhejiang University, China
- Hangzhou, China
- luohao.site
Stars
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Models and examples built with TensorFlow
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
📄 Awesome CV is LaTeX template for your outstanding job application
The most cited deep learning papers
Official inference repo for FLUX.1 models
Image-to-Image Translation in PyTorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习
LAVIS - A One-stop Library for Language-Vision Intelligence
🐍 Geometric Computer Vision Library for Spatial AI
A framework for training and evaluating AI models on a variety of openly available dialogue datasets.
⚡ The Open Research Copilot. Build high-perf Portfolios, Lab Sites & Docs in Markdown + Jupyter. 100% Data Control. 🦫 数据科学家的开源 Copilot。一键部署 👇
Pretrained ConvNets for pytorch: NASNet, ResNeXt, ResNet, InceptionV4, InceptionResnetV2, Xception, DPN, etc.
Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
Pytorch implementation of convolutional neural network visualization techniques
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO