w-okada

Follow

w-okada

Follow

1.1k followers · 2 following

Achievements

Achievements

Stars

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 58,683 6,417 Updated Apr 30, 2026

reinehonoka / Voice-Design-Cloner

録音不要でオリジナルAI音声の教師データを作るGUIツール

Python 148 16 Updated Jun 10, 2026

unhappychoice / mdts

A local markdown preview server. npx mdts — and you're done.

TypeScript 204 16 Updated Jun 11, 2026

pipecat-ai / smart-turn

Python 1,425 85 Updated Jan 29, 2026

yahoojapan / JGLUE

JGLUE: Japanese General Language Understanding Evaluation

Python 344 20 Updated Mar 31, 2025

kujirahand / book-local-llm-sample

ローカルLLMの解説本のサンプル一式

Python 24 2 Updated Jun 12, 2026

FireRedTeam / FireRedTTS2

Long-form streaming TTS system for multi-speaker dialogue generation

Python 1,404 124 Updated Oct 26, 2025

Ant-Brain / EfficientWord-Net

OneShot Learning-based hotword detection.

Jupyter Notebook 314 50 Updated Feb 11, 2026

ocavue / vad-web

Voice activity detector (VAD) for the browser

TypeScript 9 2 Updated Jan 12, 2025

codename0og / codename-rvc-fork-3

Codename's rvc fork version 3, based on Applio.

Python 38 4 Updated Aug 2, 2025

kinopeee / cursorrules

1,117 60 Updated Dec 17, 2025

litagin02 / Style-Bert-VITS2

Forked from fishaudio/Bert-VITS2

Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles.

Python 1,295 206 Updated Dec 7, 2025

sdbds / Zonos-for-windows

Forked from Zyphra/Zonos

Python 500 63 Updated Mar 7, 2025

massao000 / add-dictionary

OpenJTalkのユーザ辞書をGUIで追加するアプリ

Python 3 Updated Oct 8, 2025

snakers4 / silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 9,326 785 Updated Mar 26, 2026

Plachtaa / seed-vc

zero-shot voice conversion & singing voice conversion, with real-time support

Python 3,810 495 Updated Apr 20, 2025

wavlab-speech / versa

Versatile Evaluation of Speech and Audio

Python 415 48 Updated May 29, 2026

prj-beatrice / beatrice-vst

声質変換 VST

C++ 75 9 Updated May 16, 2026

yl4579 / StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Python 6,288 691 Updated Aug 10, 2024

niel-blue / beatrice-trainer-webui

Python 12 1 Updated Jun 4, 2026

FunAudioLLM / SenseVoice

Multilingual speech understanding: ASR + emotion recognition + audio event detection. 50+ languages, 15x faster than Whisper, non-autoregressive.

Python 8,570 780 Updated Jun 9, 2026

JarodMica / beatrice_trainer_webui

Python 47 13 Updated Mar 27, 2026

wiseman / py-webrtcvad

Python interface to the WebRTC Voice Activity Detector

C 2,488 430 Updated Jul 4, 2024

Textualize / textual

The lean application framework for Python. Build sophisticated user interfaces with a simple Python API. Run your apps in the terminal and a web browser.

Python 36,271 1,219 Updated May 27, 2026

prompt-toolkit / python-prompt-toolkit

Library for building powerful interactive command line applications in Python

Python 10,495 793 Updated May 14, 2026

uthree / tinyvc

a lightweight voice conversion

Python 86 13 Updated Feb 25, 2026

ConsistencyVC / ConsistencyVC-voive-conversion

Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion

Python 154 23 Updated Oct 16, 2023

fishaudio / Bert-VITS2

vits2 backbone with multilingual-bert

Python 8,764 1,289 Updated Jun 8, 2026

SYSTRAN / faster-whisper

Faster Whisper transcription with CTranslate2

Python 23,630 1,935 Updated Nov 19, 2025

KoeAI / LLVC

Python 428 44 Updated Nov 6, 2023