Starred repositories
An Open Source Machine Learning Framework for Everyone
Robust Speech Recognition via Large-Scale Weak Supervision
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
Clone a voice in 5 seconds to generate arbitrary speech in real-time
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Port of OpenAI's Whisper model in C/C++
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Deezer source separation library including pretrained models.
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Kalman Filter book using Jupyter Notebook. Focuses on building intuition and experience, not formal proofs. Includes Kalman filters,extended Kalman filters, unscented Kalman filters, particle filte…
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
Avatars for Zoom, Skype and other video-conferencing apps.
A technical report on convolution arithmetic in the context of deep learning
StyleGAN - Official TensorFlow Implementation
An open source library for face detection in images. The face detection speed can reach 1000FPS.
cvpr2024/cvpr2023/cvpr2022/cvpr2021/cvpr2020/cvpr2019/cvpr2018/cvpr2017 论文/代码/解读/直播合集,极市团队整理
🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Code for the paper Hybrid Spectrogram and Waveform Source Separation