jyhan03

Jiangyu Han jyhan03

Audio & Speech Processing

40 followers · 54 following

Brno University of Technology
17:24 (UTC +02:00)

Achievements

Stars

REAL-TSE / REAL-TSE-Challenge

Python 11 1 Updated Apr 10, 2026

REAL-TSE / wesep-real-tse

Python 12 1 Updated Apr 14, 2026

Xiaobin-Rong / gtcrn

The official implementation of GTCRN, an ultra-lightweight SE model.

Python 620 103 Updated Jan 18, 2026

duoan / TorchCode

🔥 LeetCode for PyTorch — practice implementing softmax, attention, GPT-2 and more from scratch with instant auto-grading. Jupyter-based, self-hosted or try online.

Jupyter Notebook 3,507 287 Updated Mar 27, 2026

JusperLee / TIGER

TIGER: Time-frequency Interleaved Gain Extraction and Reconstruction for Efficient Speech Separation

Python 409 61 Updated Oct 6, 2025

wenet-e2e / wesep

Target Speaker Extraction Toolkit

Python 259 36 Updated Oct 4, 2025

zlin0 / wedefense

WeDefense: A Toolkit to Defend Against Fake Audio

Python 29 1 Updated Feb 20, 2026

openclaw / openclaw

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 357,938 72,736 Updated Apr 15, 2026

BUTSpeechFIT / MultiSV

MultiSV: scripts for data preparation

Shell 30 3 Updated Jan 18, 2025

langchain-ai / langchain

The agent engineering platform

Python 133,659 22,084 Updated Apr 15, 2026

AI4Hearing / neurohear

Python 8 1 Updated Jan 12, 2026

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 90,823 13,941 Updated Apr 11, 2026

facebookresearch / flow_matching

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 4,339 332 Updated Jan 5, 2026

facebookresearch / sam-audio

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python 3,456 307 Updated Jan 5, 2026

pipecat-ai / pipecat

Open Source framework for voice and multimodal conversational AI

Python 11,313 1,936 Updated Apr 15, 2026

open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 9,760 808 Updated Mar 25, 2026

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 15,684 1,647 Updated Mar 17, 2026

mistralai / mistral-inference

Official inference library for Mistral models

Jupyter Notebook 10,771 1,036 Updated Feb 26, 2026

fixie-ai / ultravox

A fast multimodal LLM for real-time voice

Python 4,398 371 Updated Dec 12, 2025

gpt-omni / mini-omni

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 3,540 310 Updated Nov 5, 2024

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 20,582 2,363 Updated Mar 16, 2026

Audio-Reasoning-Challenge / Audio-Reasoning-Challenge-Baselines

The baselines of ARC-Challenge-Interspeech2026

Python 58 5 Updated Dec 1, 2025

X-LANCE / SLAM-LLM

A Framework for Speech, Language, Audio, Music Processing with Large Language Model

Python 1,019 112 Updated Jan 15, 2026

DanielLin94144 / Full-Duplex-Bench

A Benchmark for Evaluating Turn-Taking and Overlap Handling in Full-Duplex Spoken Dialogue Models

Python 162 12 Updated Apr 15, 2026

ddlBoJack / emotion2vec

[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Python 1,095 85 Updated Dec 23, 2024

hiyouga / LlamaFactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 70,137 8,580 Updated Apr 12, 2026

liyunlongaaa / NSD-MS2S

CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence architecture

Shell 85 11 Updated Jun 17, 2025

facebookresearch / omnilingual-asr

Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages

Python 2,768 247 Updated Dec 30, 2025

dougbrion / pytorch-classification-uncertainty

This repo contains a PyTorch implementation of the paper: "Evidential Deep Learning to Quantify Classification Uncertainty"

Python 520 71 Updated Jan 2, 2024

jxzhangjhu / Awesome-LLM-Uncertainty-Reliability-Robustness

Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models

816 54 Updated Apr 5, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jiangyu Han jyhan03

Achievements

Achievements

Block or report jyhan03

Stars

REAL-TSE / REAL-TSE-Challenge

REAL-TSE / wesep-real-tse

Xiaobin-Rong / gtcrn

duoan / TorchCode

JusperLee / TIGER

wenet-e2e / wesep

zlin0 / wedefense

openclaw / openclaw

BUTSpeechFIT / MultiSV

langchain-ai / langchain

AI4Hearing / neurohear

rasbt / LLMs-from-scratch

facebookresearch / flow_matching

facebookresearch / sam-audio

pipecat-ai / pipecat

open-mmlab / Amphion

modelscope / FunASR

mistralai / mistral-inference

fixie-ai / ultravox

gpt-omni / mini-omni

FunAudioLLM / CosyVoice

Audio-Reasoning-Challenge / Audio-Reasoning-Challenge-Baselines

X-LANCE / SLAM-LLM

DanielLin94144 / Full-Duplex-Bench

ddlBoJack / emotion2vec

hiyouga / LlamaFactory

liyunlongaaa / NSD-MS2S

facebookresearch / omnilingual-asr

dougbrion / pytorch-classification-uncertainty

jxzhangjhu / Awesome-LLM-Uncertainty-Reliability-Robustness