yezhangyinge

yezhangyinge yezhangyinge

normal people.

17 followers · 220 following

Achievements

Lists (31)

Sort

Stars

fclearner / Personal-vad-2.0

Implementation of "Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition"

Python 12 3 Updated Jun 9, 2026

LeventureQys / AudioProcesser

This project focuses on audio processing and filter simulation research. It uses Python for simulation experiments and C++ for engineering implementation, covering extensive machine learning practi…

Jupyter Notebook 13 5 Updated Jun 13, 2026

WangHelin1997 / SoloSpeech

SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipeline

Python 314 32 Updated Jan 19, 2026

Deep-Learning-101 / Speech-Processing-Paper

https://deeplearning101.twman.org/Speech-Processing Speech Processing (語音處理)

HTML 47 4 Updated Jun 2, 2026

zhiyongchenGREAT / Few-shot-Robust-Speaker-TTS

Repo for Paper: Towards Robust Speaker Recognition against Intrinsic Variation with Foundation Model Few-shot Tuning and Effective Speech Synthesis

Python 11 2 Updated Sep 24, 2025

Tomsawyerhu / Chinese-WebNovel-Skill

中文网文小说写作skill

Python 395 52 Updated May 22, 2026

seongq / flowmse

(ICASSP 2025, official code)FlowSE: Flow Matching-based Speech Enhancement

Python 101 4 Updated Jul 23, 2025

Gilgamesh-J / X-ASR

X-ASR is a series of automatic speech recognition models based on the icefall framework, focusing on streaming ASR and low-latency deployment.

Swift 116 11 Updated Jun 11, 2026

huggingface / open_asr_leaderboard

Python 217 103 Updated Jun 11, 2026

speechio / chinese_text_normalization

Chinese text normalization for speech processing

Python 732 151 Updated Mar 18, 2023

Egonex-AI / Understand-Anything

Graphs that teach > graphs that impress. Turn any code into an interactive knowledge graph you can explore, search, and ask questions about. Works with Claude Code, Codex, Cursor, Copilot, Gemini C…

TypeScript 59,786 4,949 Updated Jun 11, 2026

GannotLab / LC-DeepBeam

Python 9 Updated May 17, 2026

crlandsc / moises-light

Unofficial PyTorch implementation of "Moises-Light: Resource-efficient Band-split U-Net For Music Source Separation"

Python 31 1 Updated May 1, 2026

elevoctech / ESMB-corpus

19 1 Updated Oct 7, 2021

popcornell / FastMSS

Python 26 4 Updated May 18, 2026

openmirlab / melband-roformer-infer

Python 8 4 Updated Jan 13, 2026

FIGLAB / SoundBubble

Daehwa Kim and Chris Harrison. "SoundBubble: Finger-Bound Virtual Microphone using Headset/Glasses Beamforming" CHI 2026

C# 10 1 Updated Mar 16, 2026

chenhg5 / cc-connect

Bridge local AI coding agents (Claude Code, Cursor, Gemini CLI, Codex) to messaging platforms (Feishu/Lark, DingTalk, Slack, Telegram, Discord, LINE, WeChat Work). Chat with your AI dev assistant f…

Go 12,412 1,176 Updated Jun 15, 2026

anthropics / skills

Public repository for Agent Skills

Python 150,908 17,807 Updated Jun 9, 2026

Shybert-AI / AEC-Two-Stage-Based

基于两阶段的声学回声消除系统 A Two-Stage-Based Acoustic Echo Cancellation System

Python 17 8 Updated Feb 22, 2026

Andong-Li-speech / DegVoC

This is the repository for the work "DegVoC: Rethinking Neural Vocoder from a Degradation Perspective", which is accepted at AAAI 2026.

Python 11 Updated Apr 29, 2026

multica-ai / andrej-karpathy-skills

A single CLAUDE.md file to improve Claude Code behavior, derived from Andrej Karpathy's observations on LLM coding pitfalls.

175,800 17,944 Updated Apr 20, 2026

richiejp / deepvqe-ggml

DeepVQE reimplementation in PyTorch and GGML — real-time acoustic echo cancellation with soft delay estimation

Python 40 11 Updated Apr 27, 2026

localai-org / LocalVQE

Lean neural real-time acoustic echo cancellation with soft delay estimation - GGML and PyTorch inference

C++ 106 10 Updated Jun 14, 2026

viewfinder-annn / AnyEnhance-v1

AnyEnhance-based Baseline for the CCF-AATC 2025 Challenge Track 1

Python 57 5 Updated May 21, 2026

woongzip1 / UniverSR

Official implemtation of UniverSR (ICASSP 2026)

Python 52 6 Updated Apr 9, 2026

Imbad0202 / academic-research-skills

Academic Research Skills for Claude Code: research → write → review → revise → finalize

Python 31,583 2,596 Updated Jun 15, 2026

Xiaobin-Rong / unipase

Official repository of UniPASE, a SOTA USE model

30 1 Updated Mar 19, 2026

juice500ml / phonetic-arithmetic

Python 7 1 Updated May 25, 2026

ceva-ip / DPDFNet

DPDFNet: causal single-channel speech enhancement that boosts DeepFilterNet2 with dual-path RNN blocks for stronger long-range temporal and cross-band modeling. Repo includes PyTorch implementation…

Python 95 11 Updated May 27, 2026

yezhangyinge yezhangyinge

Lists (31)

AEC

AFX

ASR

ASV-spoof

AVSE

binaural&spatial

BWE

C/Python SPTK

codec

CT

Dataset

detector

DOA

Interesting

KWS

Light-Weighting

LLM

MCSE

Metric

NN deploy

NS

Others

PNS

rust

SER

Spatial audio

Speaker

speech accessment

SSSL

TTS

VAD

Stars