Skip to content
View ayutaz's full-sized avatar

Block or report ayutaz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 250 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Burst accelerated mesh simplification.

C# 75 9 Updated Aug 30, 2025

On-device TTS model by Neuphonic

Python 2,221 182 Updated Oct 8, 2025

About Build minimal ONNX Runtime with GitHub Actions

8 Updated Jun 29, 2025

VoiceStar: Robust, Duration-controllable TTS that can Extrapolate

Python 288 24 Updated May 31, 2025
Python 166 24 Updated Sep 19, 2025

PyTorch implementation of Miipher-2 [2025] which is a speech restoration model by Google DeepMind

Python 52 4 Updated Sep 22, 2025
Python 3 2 Updated Oct 8, 2025

A Survey of Spoken Dialogue Models (60 pages)

310 17 Updated Nov 28, 2024

A program that automatically transcribes karaoke MIDI scores and lyrics, along with another program to play them

Python 8 1 Updated Sep 10, 2025

Privacy-focused local‑first platform that scales.

TypeScript 1,725 67 Updated Oct 9, 2025

Real-time & local speech-to-text server.

Python 7,626 700 Updated Oct 6, 2025

Frontier Open-Source Text-to-Speech

9,511 1,166 Updated Sep 5, 2025

ncnn android piper the fast and local neural text-to-speech engine

C++ 33 2 Updated Oct 1, 2025

KawaiiPhysics : Simple Bone Physics for UnrealEngine 4 & 5

C++ 2,658 339 Updated Jul 27, 2025

Unity Editor integration with Model Context Protocol (MCP) enabling AI assistants like Claude to interact with Unity projects. Features a TypeScript MCP server and C# Unity plugin with extensible c…

C# 113 6 Updated May 28, 2025

A Tensor Computation Library with JIT Compilation

Rust 3 Updated Oct 9, 2025

A real-time and light-weight software for generation of non-linguistic behaviors (turn-taking, backchannel, and head-nodding) in conversational AIs

Python 55 6 Updated Oct 7, 2025

State-of-the-art TTS model under 25MB 😻

Python 8,864 435 Updated Aug 23, 2025

tsumiki

TypeScript 775 52 Updated Sep 8, 2025

Prominence-Based Segmentation with Clustering for Word Discovery and Lexicon Learning in Python

Python 1 1 Updated May 8, 2025

Browser-based visual editor for building WebGL, WebGPU, WebXR apps

TypeScript 757 98 Updated Oct 9, 2025
HTML 1 Updated May 26, 2025

PyTorch native quantization and sparsity for training and inference

Python 2,394 344 Updated Oct 9, 2025
Python 282 33 Updated Jul 22, 2025

Simplify your onnx model (Support for Python 3.12/3.13 & Linux aarch64 pre-built wheels)

C++ 2 Updated Jul 2, 2025

Fast and local neural text-to-speech engine

C++ 1,145 127 Updated Sep 10, 2025

uLoopMCP enables AI to autonomously compile, test, debug, and manipulate Unity projects. It bridges Unity Editor with AI coding assistants (Claude Code, Cursor, GitHub Copilot, Windsurf) using Mode…

C# 66 5 Updated Oct 2, 2025

PyTorch implementation of Audio Flamingo: Series of Advanced Audio Understanding Language Models

751 59 Updated Sep 16, 2025

Survey of audio language models

Jupyter Notebook 59 3 Updated Jun 21, 2025

Fine-tuning Moshi/J-Moshi on your own spoken dialogue data

Python 71 8 Updated Jul 31, 2025
Next