Stars
阅读3服务器版,桌面端,iOS可用。后端 Kotlin + Spring Boot + Vert.x + Coroutine ;前端 Vue.js + Element。麻烦点点star,关注一下公众号【假装大佬】❗️
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
A pytorch implementation of Maximum Mean Discrepancies(MMD) loss
🧠 A PyTorch implementation of 'Deep CORAL: Correlation Alignment for Deep Domain Adaptation.', ECCV 2016
Code for the paper "Minimal-Entropy Correlation Alignment for Unsupervised Deep Domain Adaptation", ICLR 2018
Voice Conversion by CycleGAN (语音克隆/语音转换):CycleGAN-VC3
Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2
A pytorch implementation of StarGAN-VC2
Neighborhood Attention Transformer with Progressive Channel Fusion for Speaker Verification
Dolphin is a multilingual, multitask ASR model jointly trained by DataoceanAI and Tsinghua University.
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deplo…
ISO 639 language codes with names in English, French & German, provided in JSON & CSV or as a NodeJS module.
Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics rec…
dockerd as a compliant Container Runtime Interface for Kubernetes
The repo provides information about KeSpeech dataset.
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Robust Speech Recognition via Large-Scale Weak Supervision
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
Multilingual speech understanding: ASR + emotion recognition + audio event detection. 50+ languages, 15x faster than Whisper, non-autoregressive.
GPU & Accelerator process monitoring for AMD, Apple, Huawei, Intel, NVIDIA and Qualcomm
Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deplo…
Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.
MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g. model conversion and visualization. Convert models between Caffe, Keras, MXNet, Tensorflow, CNTK, …