Skip to content
View jyhan03's full-sized avatar
  • Brno University of Technology
  • 17:24 (UTC +02:00)

Block or report jyhan03

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 11 1 Updated Apr 10, 2026
Python 12 1 Updated Apr 14, 2026

The official implementation of GTCRN, an ultra-lightweight SE model.

Python 620 103 Updated Jan 18, 2026

🔥 LeetCode for PyTorch — practice implementing softmax, attention, GPT-2 and more from scratch with instant auto-grading. Jupyter-based, self-hosted or try online.

Jupyter Notebook 3,507 287 Updated Mar 27, 2026

TIGER: Time-frequency Interleaved Gain Extraction and Reconstruction for Efficient Speech Separation

Python 409 61 Updated Oct 6, 2025

Target Speaker Extraction Toolkit

Python 259 36 Updated Oct 4, 2025

WeDefense: A Toolkit to Defend Against Fake Audio

Python 29 1 Updated Feb 20, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 357,938 72,736 Updated Apr 15, 2026

MultiSV: scripts for data preparation

Shell 30 3 Updated Jan 18, 2025

The agent engineering platform

Python 133,659 22,084 Updated Apr 15, 2026
Python 8 1 Updated Jan 12, 2026

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 90,823 13,941 Updated Apr 11, 2026

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 4,339 332 Updated Jan 5, 2026

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python 3,456 307 Updated Jan 5, 2026

Open Source framework for voice and multimodal conversational AI

Python 11,313 1,936 Updated Apr 15, 2026

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 9,760 808 Updated Mar 25, 2026

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 15,684 1,647 Updated Mar 17, 2026

Official inference library for Mistral models

Jupyter Notebook 10,771 1,036 Updated Feb 26, 2026

A fast multimodal LLM for real-time voice

Python 4,398 371 Updated Dec 12, 2025

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 3,540 310 Updated Nov 5, 2024

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 20,582 2,363 Updated Mar 16, 2026

The baselines of ARC-Challenge-Interspeech2026

Python 58 5 Updated Dec 1, 2025

A Framework for Speech, Language, Audio, Music Processing with Large Language Model

Python 1,019 112 Updated Jan 15, 2026

A Benchmark for Evaluating Turn-Taking and Overlap Handling in Full-Duplex Spoken Dialogue Models

Python 162 12 Updated Apr 15, 2026

[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Python 1,095 85 Updated Dec 23, 2024

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 70,137 8,580 Updated Apr 12, 2026

CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence architecture

Shell 85 11 Updated Jun 17, 2025

Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages

Python 2,768 247 Updated Dec 30, 2025

This repo contains a PyTorch implementation of the paper: "Evidential Deep Learning to Quantify Classification Uncertainty"

Python 520 71 Updated Jan 2, 2024

Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models

816 54 Updated Apr 5, 2026
Next