Skip to content
View hppRC's full-sized avatar
🏠
sleepy
🏠
sleepy

Block or report hppRC

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 4,630 356 Updated Jun 21, 2025

High-Quality Voice Cloning TTS for 600+ Languages

Python 6,066 882 Updated May 6, 2026
Python 2 Updated Feb 4, 2026

A SOTA Industrial-Grade Voice Activity Detection & Audio Event Detection, supporting 100+ languages, outperforming Silero-VAD, TEN-VAD, FunASR-VAD and WebRTC-VAD

Python 388 26 Updated May 6, 2026

A Git subcommand that makes `git worktree` simple

Go 522 21 Updated May 14, 2026

Zero-copy deserialization framework for Rust

Rust 4,207 227 Updated May 17, 2026

A highly compressive and high-quality neural audio codec for speech models.

Python 267 26 Updated Jan 23, 2026

A Streaming-Native Serving Engine for TTS/STS Models

Python 66 8 Updated May 15, 2026

Lightweight coding agent that runs in your terminal

Rust 83,294 12,071 Updated May 17, 2026

[EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation

Python 152 5 Updated May 18, 2025
Python 303 40 Updated Jul 22, 2025

Repair malformed JSON from LLMs, APIs, logs, and user input in Python.

Python 4,889 195 Updated May 14, 2026

Grounding Image Matching in 3D with MASt3R

Python 2,915 263 Updated Jun 30, 2025

Things you can do with the token embeddings of an LLM

Python 1,451 50 Updated Dec 1, 2025

vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

Python 2,341 404 Updated May 14, 2026

Helm charts for llm-d

Shell 52 57 Updated Jul 22, 2025

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 3,198 477 Updated May 16, 2026

Fast neural codec compression and generation for audio waveforms

C++ 230 21 Updated Dec 4, 2024

Analyze coding (agent) CLI token usage and costs from local data.

TypeScript 14,296 562 Updated May 17, 2026

dev tools, env vars, task runner

Rust 28,290 1,134 Updated May 17, 2026

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

Python 1,147 100 Updated Nov 24, 2025

Official code for "F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization"

Python 158 18 Updated Mar 3, 2026

Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits for the end of the source utterance to start translating--- H…

Rust 1,459 117 Updated Apr 15, 2025

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 10,218 952 Updated May 16, 2026

A neural word aligner based on multilingual BERT

Python 375 62 Updated Mar 10, 2022

Python wrapper for OpenJTalk

Cython 249 84 Updated Apr 8, 2025

pyopenjtalk-plus: A Python wrapper for OpenJTalk with additional improvements

Python 57 4 Updated Mar 30, 2026

A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)

Python 1,183 131 Updated Aug 28, 2024

[ICCV 2025] Implementation for Describe Anything: Detailed Localized Image and Video Captioning

Python 1,487 90 Updated Jun 26, 2025

NAISTの入試で提出した小論文

TeX 35 2 Updated Jan 27, 2023
Next