Stars
(CVPR 2023) Pytorch implementation of “T2M-GPT: Generating Human Motion from Textual Descriptions with Discrete Representations”
基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.
Demonstrations of Loss of Plasticity and Implementation of Continual Backpropagation
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).
Dynamic Spatial-Temporal Aggregation for Skeleton-Aware Sign Language Recognition (COLING2024)
Using AI to measure Parkinson’s disease severity at home
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
papers about Face Detection; Face Alignment; Face Recognition && Face Identification && Face Verification && Face Representation; Face Reconstruction; Face Tracking; Face Super-Resolution && Face D…
ydl832 / pytorch-loss
Forked from CoinCheung/pytorch-losslabel-smooth, amsoftmax, partial-fc, focal-loss, triplet-loss, lovasz-softmax. Maybe useful
[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Official implementation of CrossViT. https://arxiv.org/abs/2103.14899
🔥🔥High-Performance Face Recognition Library on PaddlePaddle & PyTorch🔥🔥
State-of-the-art 2D and 3D Face Analysis Project
Pytorch implementation of "All Tokens Matter: Token Labeling for Training Better Vision Transformers"
Reimplementation of Mutual-Channel Loss for Fine-Grained Image Classification.
PyTorch implementation of LS-CNN: Characterizing Local Patches at Multiple Scales for Face Recognition
Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models
PyTorch-based CNN implementation for estimating age from face images
Demo code for "LOHO: Latent Optimization of Hairstyles via Orthogonalization".
PyTorch implementation of multi-task learning architectures, incl. MTI-Net (ECCV2020).
An multi-image stitching algorithm robust to outliers.
Code for ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks
Official PyTorch code for "BAM: Bottleneck Attention Module (BMVC2018)" and "CBAM: Convolutional Block Attention Module (ECCV2018)"
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.