Skip to content
View zhouzhao01's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report zhouzhao01

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
81 results for source starred repositories
Clear filter

Multi-task learning using uncertainty to weigh losses for scene geometry and semantics, Auxiliary Tasks in Multi-task Learning

Python 644 88 Updated Jun 20, 2020

Toolbox for Evaluation of AEC/AES Systems

MATLAB 32 4 Updated Jun 9, 2025

Control adaptive filters with neural networks.

Python 267 44 Updated Feb 2, 2025

End-To-End Deep Learning-based Adaptation Control for Linear Acoustic Echo Cancellation

Python 37 15 Updated Nov 17, 2023

Acoustic Echo Cancellation with Nerual Kalman Filtering

HTML 346 76 Updated Feb 21, 2023

This Repostory contains the pretrained DTLN-aec model for real-time acoustic echo cancellation.

Python 362 80 Updated Apr 26, 2022

Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning

Python 4,134 612 Updated Dec 1, 2025

A benchmark for evaluating audio encoders on various audio tasks.

Python 42 7 Updated Dec 11, 2025

XARES-LLM

Python 52 2 Updated Jan 12, 2026

AEC Challenge

468 146 Updated Jun 4, 2024

The official code repository for SongBloom: Coherent Song Generation via Interleaved Autoregressive Sketching and Diffusion Refinement

Python 747 83 Updated Dec 4, 2025

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 20,066 2,138 Updated Feb 9, 2026

Grapheme to phoneme conversion with deep learning.

Python 419 57 Updated Dec 8, 2023

Unified automatic quality assessment for speech, music, and sound.

Python 672 49 Updated Jun 5, 2025

A lightweight library for Frechet Audio Distance calculation.

Python 308 27 Updated Jan 11, 2026

JAM: A Tiny Flow-based Song Generator with Fine-grained Controllability and Aesthetic Alignment

Python 152 20 Updated Aug 7, 2025

All-In-One Music Structure Analyzer

Python 716 111 Updated May 9, 2024

Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch

Python 1,230 1,622 Updated Aug 29, 2025

Official repository of the paper "MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization".

Python 308 13 Updated Aug 4, 2025

A powerful coding agent toolkit providing semantic retrieval and editing capabilities (MCP server & other integrations)

Python 19,966 1,346 Updated Feb 9, 2026

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 14,860 1,574 Updated Feb 4, 2026

Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics rec…

Python 1,754 159 Updated Jan 29, 2026

Mobile and Web client for Codex and Claude Code, with realtime voice, encryption and fully featured

TypeScript 11,054 824 Updated Jan 29, 2026
Python 17 1 Updated Jun 24, 2025

MuChoMusic is a benchmark for evaluating music understanding in multimodal audio-language models.

Jupyter Notebook 44 2 Updated Dec 3, 2024

Music Structure Analysis Framework

Python 543 88 Updated Jul 9, 2025

A library for audio and music analysis, feature extraction.

C 3,263 150 Updated May 24, 2024

Official implementation of the paper "Acoustic Music Understanding Model with Large-Scale Self-supervised Training".

Python 431 26 Updated May 25, 2025
Next