Skip to content
View w-okada's full-sized avatar

Block or report w-okada

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 58,683 6,417 Updated Apr 30, 2026

録音不要でオリジナルAI音声の教師データを作るGUIツール

Python 148 16 Updated Jun 10, 2026

A local markdown preview server. npx mdts — and you're done.

TypeScript 204 16 Updated Jun 11, 2026
Python 1,425 85 Updated Jan 29, 2026

JGLUE: Japanese General Language Understanding Evaluation

Python 344 20 Updated Mar 31, 2025

ローカルLLMの解説本のサンプル一式

Python 24 2 Updated Jun 12, 2026

Long-form streaming TTS system for multi-speaker dialogue generation

Python 1,404 124 Updated Oct 26, 2025

OneShot Learning-based hotword detection.

Jupyter Notebook 314 50 Updated Feb 11, 2026

Voice activity detector (VAD) for the browser

TypeScript 9 2 Updated Jan 12, 2025

Codename's rvc fork version 3, based on Applio.

Python 38 4 Updated Aug 2, 2025

Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles.

Python 1,295 206 Updated Dec 7, 2025
Python 500 63 Updated Mar 7, 2025

OpenJTalkのユーザ辞書をGUIで追加するアプリ

Python 3 Updated Oct 8, 2025

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 9,326 785 Updated Mar 26, 2026

zero-shot voice conversion & singing voice conversion, with real-time support

Python 3,810 495 Updated Apr 20, 2025

Versatile Evaluation of Speech and Audio

Python 415 48 Updated May 29, 2026

声質変換 VST

C++ 75 9 Updated May 16, 2026

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Python 6,288 691 Updated Aug 10, 2024

Multilingual speech understanding: ASR + emotion recognition + audio event detection. 50+ languages, 15x faster than Whisper, non-autoregressive.

Python 8,570 780 Updated Jun 9, 2026

Python interface to the WebRTC Voice Activity Detector

C 2,488 430 Updated Jul 4, 2024

The lean application framework for Python. Build sophisticated user interfaces with a simple Python API. Run your apps in the terminal and a web browser.

Python 36,271 1,219 Updated May 27, 2026

Library for building powerful interactive command line applications in Python

Python 10,495 793 Updated May 14, 2026

a lightweight voice conversion

Python 86 13 Updated Feb 25, 2026

Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion

Python 154 23 Updated Oct 16, 2023

vits2 backbone with multilingual-bert

Python 8,764 1,289 Updated Jun 8, 2026

Faster Whisper transcription with CTranslate2

Python 23,630 1,935 Updated Nov 19, 2025
Python 428 44 Updated Nov 6, 2023
Next