-
AI-Research-SKILLs Public
Forked from Orchestra-Research/AI-Research-SKILLsComprehensive open-source library of AI research and engineering skills for any AI model. Package the skills and your claude code/codex/gemini agent will be an AI research agent with full horsepowe…
TeX MIT License UpdatedMar 16, 2026 -
superpowers Public
Forked from obra/superpowersAn agentic skills framework & software development methodology that works.
Shell MIT License UpdatedMar 16, 2026 -
funasr-api Public
Forked from Quantatirsk/funasr-apiSpeech recognition API service powered by FunASR and Qwen-ASR, supporting 52 languages, compatible with OpenAI API and Alibaba Cloud Speech API. 基于 FunASR 与 Qwen3-ASR 的语音识别 API 服务,支持 52 种语言,兼容 Open…
Python UpdatedFeb 2, 2026 -
FlashFunAsr Public
Forked from lovemefan/FlashFunAsrFlashFunAsr: A lightweight vLLM implementation built from scratch for FunASR nano
Python Apache License 2.0 UpdatedJan 5, 2026 -
pyloudnorm Public
Forked from csteinmetz1/pyloudnormFlexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm
Python MIT License UpdatedJan 4, 2026 -
loudness Public
Forked from iver56/loudnessThe world's fastest Python package for calculating integrated loudness (LUFS) from audio data as NumPy arrays
C++ MIT License UpdatedDec 26, 2025 -
Fun-ASR-Nano-2512-Deploy Public
Forked from fengin/Fun-ASR-Nano-2512-DeployFun-ASR-Nano-2512官方发布的仓库内容有点多,部署起来坑也比较多,本项目提供一个简化的部署方案。
Python UpdatedDec 26, 2025 -
error-align Public
Forked from corticph/error-alignText-to-text alignment algorithm for speech recognition error analysis.
Python UpdatedDec 15, 2025 -
-
-
GLM-ASR Public
Forked from zai-org/GLM-ASRGLM-ASR-Nano: A robust, open-source speech recognition model with 1.5B parameters
Python Apache License 2.0 UpdatedDec 10, 2025 -
livekit Public
Forked from livekit/livekitEnd-to-end realtime stack for connecting humans and AI
Go Apache License 2.0 UpdatedDec 9, 2025 -
tiny-audio Public
Forked from alexkroman/tiny-audioTrain your own speech AI model from scratch
Python UpdatedDec 9, 2025 -
minimind Public
Forked from jingyaogong/minimind🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
Python Apache License 2.0 UpdatedNov 27, 2025 -
DiariZen Public
Forked from BUTSpeechFIT/DiariZenA toolkit for speaker diarization.
Jupyter Notebook MIT License UpdatedNov 19, 2025 -
flashlight Public
Forked from flashlight/flashlightA C++ standalone library for machine learning
C++ MIT License UpdatedNov 12, 2025 -
annotated_deep_learning_paper_implementations Public
Forked from labmlai/annotated_deep_learning_paper_implementations🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
Python MIT License UpdatedNov 11, 2025 -
langchain Public
Forked from langchain-ai/langchain🦜🔗 The platform for reliable agents.
Python MIT License UpdatedNov 10, 2025 -
flashinfer Public
Forked from flashinfer-ai/flashinferFlashInfer: Kernel Library for LLM Serving
Cuda Apache License 2.0 UpdatedOct 29, 2025 -
audiomentations Public
Forked from iver56/audiomentationsA Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.
Python MIT License UpdatedSep 26, 2025 -
audiolab Public
Forked from pengzhendong/audiolabAn audio reader & writer built on top of PyAV
Python Apache License 2.0 UpdatedSep 26, 2025 -
CarelessWhisper-Streaming Public
Forked from tomer9080/WhisperRT-StreamingCausal streaming adaptation of OpenAI Whisper for real-time transcription on small audio chunks.
-
livekit-plugins-fireredchat-pvad Public
Forked from fireredchat-submodules/livekit-plugins-fireredchat-pvadFireRedChat pVAD plugin for LiveKit Agents
Python Apache License 2.0 UpdatedSep 16, 2025 -
VoiceBench Public
Forked from MatthewCYM/VoiceBenchVoiceBench: Benchmarking LLM-Based Voice Assistants
Python Apache License 2.0 UpdatedAug 22, 2025 -
TouchNet Public
Forked from xingchensong/TouchNetA native-PyTorch library for large scale M-LLM (text/audio) training with tp/cp/dp.
Python Apache License 2.0 UpdatedAug 6, 2025 -
ContextASR-Bench Public
Forked from MrSupW/ContextASR-BenchA Massive Contextual Speech Recognition Benchmark.
Python MIT License UpdatedJul 9, 2025 -
GigaSpeech2 Public
Forked from SpeechColab/GigaSpeech2An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement
Python Apache License 2.0 UpdatedJun 28, 2025 -
cuml Public
Forked from rapidsai/cumlcuML - RAPIDS Machine Learning Library
C++ Apache License 2.0 UpdatedJun 17, 2025 -
dasheng-denoiser Public
Forked from xiaomi-research/dasheng-denoiserOfficial PyTorch inference code for the Interspeech 2025 paper: Efficient Speech Enhancement via Embeddings from Pre-trained Generative Audioencoders
Python Apache License 2.0 UpdatedJun 16, 2025 -
AISHELL-5 Public
Forked from DaiYvhang/AISHELL-5In-car multi-channel speech transcription system of AISHELL-5.
Python Apache License 2.0 UpdatedJun 9, 2025