Skip to content
View jyhan03's full-sized avatar
  • Brno University of Technology
  • 03:32 (UTC +02:00)

Block or report jyhan03

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 5 1 Updated Apr 1, 2026
Python 9 1 Updated Apr 2, 2026

The official implementation of GTCRN, an ultra-lightweight SE model.

Python 604 101 Updated Jan 18, 2026

🔥 LeetCode for PyTorch — practice implementing softmax, attention, GPT-2 and more from scratch with instant auto-grading. Jupyter-based, self-hosted or try online.

Jupyter Notebook 3,373 275 Updated Mar 27, 2026

TIGER: Time-frequency Interleaved Gain Extraction and Reconstruction for Efficient Speech Separation

Python 405 61 Updated Oct 6, 2025

Target Speaker Extraction Toolkit

Python 254 35 Updated Oct 4, 2025

WeDefense: A Toolkit to Defend Against Fake Audio

Python 27 1 Updated Feb 20, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 347,270 69,265 Updated Apr 4, 2026

MultiSV: scripts for data preparation

Shell 30 3 Updated Jan 18, 2025

The agent engineering platform

Python 132,263 21,810 Updated Apr 4, 2026
Python 8 1 Updated Jan 12, 2026

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 89,918 13,737 Updated Apr 1, 2026

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 4,297 325 Updated Jan 5, 2026

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python 3,437 305 Updated Jan 5, 2026

Open Source framework for voice and multimodal conversational AI

Python 11,012 1,869 Updated Apr 3, 2026

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 9,737 804 Updated Mar 25, 2026

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 15,530 1,629 Updated Mar 17, 2026

Official inference library for Mistral models

Jupyter Notebook 10,752 1,034 Updated Feb 26, 2026

A fast multimodal LLM for real-time voice

Python 4,391 370 Updated Dec 12, 2025

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 3,539 308 Updated Nov 5, 2024

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 20,386 2,325 Updated Mar 16, 2026

The baselines of ARC-Challenge-Interspeech2026

Python 57 5 Updated Dec 1, 2025

A Framework for Speech, Language, Audio, Music Processing with Large Language Model

Python 1,014 111 Updated Jan 15, 2026

A Benchmark for Evaluating Turn-Taking and Overlap Handling in Full-Duplex Spoken Dialogue Models

Python 146 12 Updated Feb 23, 2026

[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Python 1,089 84 Updated Dec 23, 2024

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 69,478 8,453 Updated Apr 1, 2026

CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence architecture

Shell 85 11 Updated Jun 17, 2025

Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages

Python 2,746 246 Updated Dec 30, 2025

This repo contains a PyTorch implementation of the paper: "Evidential Deep Learning to Quantify Classification Uncertainty"

Python 517 71 Updated Jan 2, 2024

Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models

813 54 Updated Mar 28, 2026
Next