A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 14,028 1,455 Updated Dec 19, 2025

FireRedTeam / FireRedASR

Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics rec…

Python 1,675 153 Updated Sep 22, 2025

slopus / happy

Mobile and Web client for Codex and Claude Code, with realtime voice, encryption and fully featured

TypeScript 5,341 417 Updated Dec 7, 2025

nicolaus625 / CMI-bench

Python 17 1 Updated Jun 24, 2025

mulab-mir / muchomusic

MuChoMusic is a benchmark for evaluating music understanding in multimodal audio-language models.

Jupyter Notebook 43 2 Updated Dec 3, 2024

urinieto / msaf

Music Structure Analysis Framework

Python 535 88 Updated Jul 9, 2025

libAudioFlux / audioFlux

A library for audio and music analysis, feature extraction.

C 3,229 149 Updated May 24, 2024

yizhilll / MERT

Official implementation of the paper "Acoustic Music Understanding Model with Large-Scale Self-supervised Training".

Python 421 27 Updated May 25, 2025

deezer / spleeter

Deezer source separation library including pretrained models.

Python 27,883 3,054 Updated Apr 2, 2025

Nitrogen216 / fxxk-coming-soon

A project to help researchers reproduce research papers using LLMs, addressing the problem of "Coming Soon" repos with no actual code.

6 Updated Aug 11, 2025

skyzh / tiny-llm

A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.

Python 3,552 242 Updated Dec 18, 2025

hitnology / snoopy

macOS 版本史努比屏幕保护

Objective-C 474 9 Updated Jun 24, 2025

AkojimaSLP / Neural-mask-estimation

Python 44 9 Updated Dec 5, 2019

fgnt / nn-gev

Neural network supported GEV beamformer

Python 212 95 Updated Feb 19, 2018

Irreq / neural-beamformer

Beamformer Skeleton

Python 2 Updated Oct 20, 2023

LCAV / pyroomacoustics

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Python 1,738 475 Updated Dec 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

zhouzhao01

Achievements

Achievements

Block or report zhouzhao01

Stars

xiaomi-research / xares-llm

StellanLi / EchoFree

microsoft / AEC-Challenge

tencent-ailab / SongBloom

m-bain / whisperX

spring-media / DeepPhonemizer

facebookresearch / audiobox-aesthetics

gudgud96 / frechet-audio-distance

declare-lab / jamify

mir-aidj / all-in-one

stanford-cs336 / assignment1-basics

stanford-cs336 / spring2025-lectures

tencent-ailab / MuQ

oraios / serena

modelscope / FunASR