Stars
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Foundational Models for State-of-the-Art Speech and Text Translation
Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.
A simple tool to easily use Montreal Forced Aligner. Also provide alignment(TextGrid) retrieved from ESD.
Official PyTorch implementation of "Camera Distance-aware Top-down Approach for 3D Multi-person Pose Estimation from a Single RGB Image", ICCV 2019
An end-to-end library for editing and rendering motion of 3D characters with deep learning [SIGGRAPH 2020]
Ultralytics YOLOv5 in PyTorch > ONNX > CoreML > TFLite
Official PyTorch implementation of "Camera Distance-aware Top-down Approach for 3D Multi-person Pose Estimation from a Single RGB Image", ICCV 2019
Official PyTorch implementation of "Accurate 3D Hand Pose Estimation for Whole-Body 3D Human Mesh Estimation", CVPRW 2022 (Oral.)