Lists (32)
Sort Name ascending (A-Z)
academic
acoustic echo cancellation
AIGC
audio codec
audio codecs
audio separation
audio tools
bandwidth extension
beamforming
computer vision
deep learning
diffusion
entertainments
hearing aid
LLM
mircophone array
music tools
noise reduction
packet loss compensation
programming related
simulation tools
singing voice tools
sound source localization
spatial audio
speaker recognition
speech dereverberation
speech diarization
speech frontend
speech recognition
speech separation
speech signal processing
speech voice tools
Starred repositories
This is an official implementation for "Video Swin Transformers".
tloen / llama-int8
Forked from meta-llama/llamaQuantized inference code for LLaMA models
YapaLab / yolo-face
Forked from ultralytics/ultralyticsYOLO Face 🚀 in PyTorch
davidbrowne17 / csm-streaming
Forked from SesameAILabs/csmRealtime demo, Streaming and Finetuning code for CSM
世界上最好的提示词 (总计估值超过300亿的提示词)外国网友x1xh成功获取了 v0、Manus、Cursor、Same.dev 和 Lovable 的完整官方系统提示词和内部工具。
anan235 / dia-multilingual
Forked from nari-labs/diaA TTS model capable of generating ultra-realistic dialogue in one pass.
ROCm / flash-attention
Forked from Dao-AILab/flash-attentionFast and memory-efficient exact attention
victorchall / genmoai-smol
Forked from genmoai/mochiThe best OSS video generation models
kyutai-labs / nanoGPTaudio
Forked from karpathy/nanoGPTCode for the blog "Neural audio codecs: how to get audio into LLMs"
sisrfeng / n_gpu
Forked from wookayin/gpustat📊 A simple command-line utility for querying and monitoring GPU status
ASLP-lab / DiffRhythm2
Forked from xiaomi-research/diffrhythm2Di♪♪Rhythm 2: Efficient And High Fidelity Song Generation Via Block Flow Matching
orion-labs / opuslib
Forked from svartalf/python-opusPython bindings to the libopus, IETF low-delay audio codec
cogmhear / avse_challenge
Forked from claritychallenge/clarityCOG-MHEAR Audio-Visual Speech Enhancement Challenge
(Interspeech 2025, official code) Speech enhancement based on cascaded two flows
ws-choi / sdx23
Forked from kuielab/mdx-netKUIELAB-MDX-Net got the 2nd place on the Leaderboard A and the 3rd place on the Leaderboard B in the MDX-Challenge ISMIR 2021
Fast and memory-efficient exact attention
Code for "SNR-Aligned Consistent Diffusion for Adaptive Speech Enhancement" (Interspeech 2025)
WenzheLiu-Speech / aac-datasets
Forked from Labbeti/aac-datasetsAudio Captioning datasets for PyTorch.