Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Starred repositories
🇫🇷 Skills pour agents IA spécialisés dans la bureaucratie française : Comptable, Notaire, ...
Open-source orchestration for zero-human companies
Teams-first Multi-agent orchestration for Claude Code
The repo is finally unlocked. enjoy the party! The fastest repo in history to surpass 100K stars ⭐. Join Discord: https://discord.gg/5TUQKqFWd Built in Rust using oh-my-codex.
A lightweight library for normalizing speech transcripts before computing WER
FastAPI to serve Qwen-ASR with streaming support. Tested. Benchmarked. Flash Attention 2. Fast & Stable.
Strava for Claude Code / Codex. CLI to track your AI coding output tokens, spend, and streaks. Compete on a global leaderboard and see what your friends are building. Make every agentic coding sess…
An open-source AI agent that lives in your terminal.
Pure C inference of Mistral Voxtral Realtime 4B speech to text model
This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.
MARS5 speech model (TTS) from CAMB.AI
🔥 The API to search, scrape, and interact with the web for AI
High fidelity neural audio codec for TTS models
On-device voice activity detection (VAD) powered by deep learning
Build, deploy, and orchestrate AI agents. Sim is the central intelligence layer for your AI workforce.
The baselines of ARC-Challenge-Interspeech2026
Train your own speech AI model from scratch
Awesome Neural Codec Models, Text-to-Speech Synthesizers & Speech Language Models
ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation
a custom node for separation vocals from music based on Music-Source-Separation-Training
CRYFISH: On deep audio analysis with Large Language Models
The official repository for the paper “NonVerbalSpeech-38K: A Scalable Pipeline for Enabling Non-Verbal Speech Generation and Understanding”.
This project is a Python bot that automates the process of logging into Gmail, joining a Google Meet, recording the audio of the meeting, and then generating a summary, key points, action items, an…
C++ library for audio and music analysis, description and synthesis, including Python bindings