Skip to content
View solaoi's full-sized avatar

Highlights

  • Pro

Block or report solaoi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

The swiss army knife of lossless video/audio editing

TypeScript 38,343 1,865 Updated Feb 18, 2026

VOICE → WORDS

Swift 1,103 82 Updated Jan 15, 2026

C inference for Qwen3-ASR 0.6b and 1.7b transcriptions models

C 348 25 Updated Feb 17, 2026

A fast and soft pattern search for trillion-scale corpora.

Python 154 3 Updated Feb 13, 2026

Offline streaming speech-to-text in the browser

JavaScript 23 1 Updated Aug 28, 2025

A Streaming-Native Serving Engine for TTS/STS Models

Python 54 5 Updated Feb 13, 2026

マネーフォワードMeを自動化、保有資産の可視化を行います

TypeScript 195 16 Updated Feb 15, 2026

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Python 1,790 479 Updated Feb 16, 2026

A real-time and light-weight software for generation of non-linguistic behaviors (turn-taking, backchannel, and head-nodding) in conversational AIs

Python 80 11 Updated Feb 3, 2026

Zero-copy deserialization framework for Rust

Rust 4,027 217 Updated Feb 10, 2026

Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.

Python 246 22 Updated Mar 7, 2025

VoiceBench: Benchmarking LLM-Based Voice Assistants

Python 331 20 Updated Jan 29, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 208,380 38,327 Updated Feb 18, 2026

🎙️ AI Dictation App - Open Source and Local-first ⚡ Type 3x faster, no keyboard needed. 🆓 Powered by open source models, works offline, fast and accurate.

TypeScript 796 77 Updated Feb 18, 2026

Chrome extension that analyzes tweets on X timeline based on the X algorithm weights

JavaScript 109 9 Updated Feb 1, 2026

Massive open Japanese speech corpus

Python 360 33 Updated Jan 19, 2026
Python 1 Updated Feb 6, 2026

A lightning fast audio upsampler.

Python 723 62 Updated Feb 2, 2026

A free, open source, and extensible speech-to-text application that works completely offline.

Rust 15,414 1,063 Updated Feb 17, 2026

Browser automation CLI for AI agents

TypeScript 14,428 847 Updated Feb 18, 2026

Curated list of design and UI resources from stock photos, web templates, CSS frameworks, UI libraries, tools and much more

64,823 11,977 Updated Jan 31, 2026

Training code for FAcodec presented in NaturalSpeech3

Python 238 21 Updated Aug 26, 2024

Unsupervised Speech Decomposition Via Triple Information Bottleneck

Python 698 96 Updated Oct 23, 2024

Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications

Python 87 9 Updated Dec 20, 2024

Excel to structured JSON (tables, shapes, charts) for LLM/RAG pipelines

Python 118 16 Updated Feb 18, 2026

A lightweight text-to-speech model with zero-shot voice cloning

Python 792 30 Updated Feb 6, 2026

Conversion between Traditional and Simplified Chinese

C++ 9,478 1,042 Updated Jan 27, 2026

Open Audio Watermarking Tool

Python 468 44 Updated Dec 22, 2025

A highly compressive and high-quality neural audio codec for speech models.

Python 253 22 Updated Jan 23, 2026

Hono <-> React Router Adapter

TypeScript 281 16 Updated Mar 25, 2025
Next