- Singapore
- yptheangel.github.io
Stars
Clone a voice in 5 seconds to generate arbitrary speech in real-time
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Deezer source separation library including pretrained models.
State-of-the-art 2D and 3D Face Analysis Project
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Command-line program to download image galleries and collections from several image hosting sites
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
End-to-End Object Detection with Transformers
StyleGAN2 - Official TensorFlow Implementation
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/
Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System
Official Tensorflow implementation of U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation (ICLR 2020)
[ECCV 2022] ByteTrack: Multi-Object Tracking by Associating Every Detection Box
Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models
Pre-trained Deep Learning models and demos (high quality and extremely fast)
A Trimap-Free Portrait Matting Solution in Real Time [AAAI 2022]
Model summary in PyTorch similar to `model.summary()` in Keras
Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.
mean Average Precision - This code evaluates the performance of your neural net for object recognition.
A Python package for Bayesian forecasting with object-oriented design and probabilistic models under the hood.
[CVPR 2020] Interpreting the Latent Space of GANs for Semantic Face Editing
Realtime human head pose estimation with ONNXRuntime and OpenCV.
A generalized framework for prototyping full-stack cooperative driving automation applications under CARLA+SUMO.
My best practice of training large dataset using PyTorch.
Real-time pose estimation accelerated with NVIDIA TensorRT
SOTA Semantic Segmentation Models in PyTorch
Implementation of Super Resolution CNN in Keras.