zhuchb

zhuchb zhuchb

A PhD student works on speech

1 follower · 3 following

Stars

Tonyyouyou / Auto-Landmark

Python 4 Updated Sep 15, 2024

Berkeley-Speech-Group / Speech-Articulatory-Coding

Jupyter Notebook 63 13 Updated May 29, 2025

speechandlanguageprocessing / ICASSP2022-Depression

Automatic Depression Detection: a GRU/ BiLSTM-based Model and An Emotional Audio-Textual Corpus

Python 215 36 Updated Jul 10, 2023

zxzhao0 / C2SER

We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through contextual perception and chain of Thought (CoT).

Python 49 3 Updated Mar 3, 2025

AIM3-RUC / RUCM3ED

M3ED: Multi-modal Multi-scene Multi-label Emotional Dialogue Database. ACL 2022

124 7 Updated Sep 24, 2022

OzymandiasChen / SAFM

Sparse Adapter Fusion for Continual Learning in NLP - EACL 2026

Python 15 Updated Apr 9, 2026

OzymandiasChen / ActorMind

ActorMind: Emulating Human Actor Reasoning for Speech Role-Playing - ACL Findings 2026

25 1 Updated Jun 4, 2026

OzymandiasChen / PCGR

Prototype Conditioned Generative Replay for Continual Learning in NLP - NAACL 2025

Python 26 Updated Apr 9, 2026

weitianxin / Awesome-Agentic-Reasoning

A curated list of papers and resources based on the survey "Agentic Reasoning for Large Language Models"

1,279 100 Updated Mar 9, 2026

langchain-ai / langchain

The agent engineering platform.

Python 139,693 23,162 Updated Jun 19, 2026

phioranex / openclaw-docker

Shell 681 100 Updated Jun 16, 2026

thu-coai / PsyQA

一个中文心理健康支持问答数据集，提供了丰富的援助策略标注。可用于生成富有援助策略的长咨询文本。

267 21 Updated Jul 21, 2024

WangHelin1997 / SSR-Speech

SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis

Python 151 18 Updated Jan 1, 2025

Zain-Jiang / Speech-Editing-Toolkit

It's a repository for implementations of neural speech editing algorithms.

Python 207 21 Updated Jan 9, 2024

snakers4 / silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 9,366 786 Updated Mar 26, 2026

yt-dlp / yt-dlp

A feature-rich command-line audio/video downloader

Python 171,648 14,470 Updated Jun 18, 2026

NVIDIA / audio-flamingo

PyTorch implementation of Audio Flamingo: Series of Advanced Audio Understanding Language Models

1,145 96 Updated Dec 15, 2025

facebookresearch / EmpatheticDialogues

Dialogue model that produces empathetic responses when trained on the EmpatheticDialogues dataset.

Python 553 69 Updated Dec 3, 2021

bootphon / phonemizer

Simple text to phones converter for multiple languages

Python 1,555 198 Updated Sep 26, 2024

Kyubyong / g2p

g2p: English Grapheme To Phoneme Conversion

Python 924 134 Updated Jan 5, 2023

lucidrains / vector-quantize-pytorch

Vector (and Scalar) Quantization, in Pytorch

Python 3,964 331 Updated Jun 5, 2026

HITsz-TMG / Uni-MoE

Uni-MoE: Lychee's Large Multimodal Model Family.

Python 1,110 69 Updated Dec 22, 2025

OpenSparseLLMs / LLaMA-MoE-v2

🚀 LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training

Python 93 13 Updated Dec 3, 2024

WangHelin1997 / CapSpeech

CapSpeech: Enabling Downstream Applications in Style-Captioned Text-to-Speech

Jupyter Notebook 368 41 Updated Aug 14, 2025

SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 14,780 2,158 Updated May 18, 2026

dreamtheater123 / Awesome-SpeechLM-Survey

Github repository for ACL 2025 paper: Recent Advances in Speech Language Models: A Survey.

209 10 Updated Jun 17, 2025

spotify / pedalboard

🎛 🔊 A Python library for audio.

C++ 6,171 339 Updated May 21, 2026

jishengpeng / ControlSpeech

[ACL 2025 Main] ControlSpeech: Towards Simultaneous Zero-shot Speaker Cloning and Zero-shot Language Style Control With Decoupled Codec

Python 274 14 Updated Nov 22, 2024

fishaudio / fish-speech

SOTA Open Source TTS

Python 30,867 2,637 Updated Jun 9, 2026

MiniMax-AI / MiniMax-M2

MiniMax-M2, a model built for Max coding & agentic workflows.

2,599 215 Updated Nov 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly