Skip to content
View zhouzhao01's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report zhouzhao01

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

XARES-LLM

Python 29 1 Updated Dec 19, 2025

AEC Challenge

456 143 Updated Jun 4, 2024

The official code repository for SongBloom: Coherent Song Generation via Interleaved Autoregressive Sketching and Diffusion Refinement

Python 693 75 Updated Dec 4, 2025

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 19,248 2,049 Updated Oct 21, 2025

Grapheme to phoneme conversion with deep learning.

Python 414 53 Updated Dec 8, 2023

Unified automatic quality assessment for speech, music, and sound.

Python 649 48 Updated Jun 5, 2025

A lightweight library for Frechet Audio Distance calculation.

Python 302 28 Updated Dec 2, 2025

JAM: A Tiny Flow-based Song Generator with Fine-grained Controllability and Aesthetic Alignment

Python 116 14 Updated Aug 7, 2025

All-In-One Music Structure Analyzer

Python 685 104 Updated May 9, 2024

Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch

Python 1,003 1,324 Updated Aug 29, 2025

Official repository of the paper "MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization".

Python 284 12 Updated Aug 4, 2025

A powerful coding agent toolkit providing semantic retrieval and editing capabilities (MCP server & other integrations)

Python 17,380 1,196 Updated Dec 19, 2025

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 14,028 1,455 Updated Dec 19, 2025

Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics rec…

Python 1,675 153 Updated Sep 22, 2025

Mobile and Web client for Codex and Claude Code, with realtime voice, encryption and fully featured

TypeScript 5,341 417 Updated Dec 7, 2025
Python 17 1 Updated Jun 24, 2025

MuChoMusic is a benchmark for evaluating music understanding in multimodal audio-language models.

Jupyter Notebook 43 2 Updated Dec 3, 2024

Music Structure Analysis Framework

Python 535 88 Updated Jul 9, 2025

A library for audio and music analysis, feature extraction.

C 3,229 149 Updated May 24, 2024

Official implementation of the paper "Acoustic Music Understanding Model with Large-Scale Self-supervised Training".

Python 421 27 Updated May 25, 2025

Deezer source separation library including pretrained models.

Python 27,883 3,054 Updated Apr 2, 2025

A project to help researchers reproduce research papers using LLMs, addressing the problem of "Coming Soon" repos with no actual code.

6 Updated Aug 11, 2025

A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.

Python 3,552 242 Updated Dec 18, 2025

macOS 版本史努比屏幕保护

Objective-C 474 9 Updated Jun 24, 2025

Neural network supported GEV beamformer

Python 212 95 Updated Feb 19, 2018

Beamformer Skeleton

Python 2 Updated Oct 20, 2023

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Python 1,738 475 Updated Dec 15, 2025
Next