Skip to content
View ecyht2's full-sized avatar

Highlights

  • Pro

Block or report ecyht2

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

CapSpeech: Enabling Downstream Applications in Style-Captioned Text-to-Speech

Jupyter Notebook 365 41 Updated Aug 14, 2025

SoTA open-source TTS

Python 14,415 1,932 Updated Sep 25, 2025

A retargetable MLIR-based machine learning compiler and runtime toolkit.

C++ 3,441 782 Updated Nov 5, 2025

Towards Human-Sounding Speech

Python 5,691 481 Updated May 6, 2025

LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

Python 287 37 Updated May 16, 2025
Python 4,539 362 Updated Jun 12, 2025

eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

C 5,778 1,134 Updated Oct 27, 2025

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…

C++ 8,704 962 Updated Nov 5, 2025

https://hf.co/hexgrad/Kokoro-82M

JavaScript 4,714 530 Updated Aug 6, 2025

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 3,589 290 Updated Aug 14, 2025

An open source real-time AI inference engine for seamless scaling

Python 20 4 Updated Jul 2, 2025

High performance UI layout library in C.

C 15,866 612 Updated Oct 23, 2025

TTS support with GGML

C++ 190 23 Updated Oct 5, 2025

Tensor library for machine learning

C++ 13,367 1,374 Updated Nov 4, 2025

GGUF Quantization support for native ComfyUI models

Python 2,715 189 Updated Nov 4, 2025

Diffusion model(SD,Flux,Wan,Qwen Image,...) inference in pure C/C++

C++ 4,515 438 Updated Nov 3, 2025

A markup-based typesetting system that is powerful and easy to learn.

Rust 47,751 1,296 Updated Nov 4, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 17,121 1,872 Updated Oct 21, 2025

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 13,521 1,988 Updated Nov 3, 2025

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 114,252 15,917 Updated Nov 4, 2025

The developer platform for on-demand cloud development environments to create software faster and more securely.

TypeScript 13,519 1,334 Updated Nov 5, 2025

Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

Go 155,380 13,536 Updated Nov 5, 2025

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Python 4,222 485 Updated Apr 15, 2025

🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transf…

Go 36,963 2,922 Updated Nov 4, 2025

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Python 6,032 628 Updated Aug 10, 2024

High-resolution models for human tasks.

Python 5,198 305 Updated Nov 18, 2024

SOTA Open Source TTS

Python 23,971 1,957 Updated Nov 3, 2025

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 52,048 5,702 Updated Sep 10, 2025

Silero Models: pre-trained text-to-speech models made embarrassingly simple

Jupyter Notebook 5,540 347 Updated Oct 31, 2025

A fast, local neural text to speech system

C++ 10,210 850 Updated Aug 26, 2025
Next