Lists (32)
Sort Name ascending (A-Z)
academic
acoustic echo cancellation
AIGC
audio codec
audio codecs
audio separation
audio tools
bandwidth extension
beamforming
computer vision
deep learning
diffusion
entertainments
hearing aid
LLM
mircophone array
music tools
noise reduction
packet loss compensation
programming related
simulation tools
singing voice tools
sound source localization
spatial audio
speaker recognition
speech dereverberation
speech diarization
speech frontend
speech recognition
speech separation
speech signal processing
speech voice tools
Starred repositories
ASLP-lab / DiffRhythm2
Forked from xiaomi-research/diffrhythm2Di♪♪Rhythm 2: Efficient And High Fidelity Song Generation Via Block Flow Matching
kyutai-labs / nanoGPTaudio
Forked from karpathy/nanoGPTCode for the blog "Neural audio codecs: how to get audio into LLMs"
This is an official implementation for "Video Swin Transformers".
YapaLab / yolo-face
Forked from ultralytics/ultralyticsYOLO Face 🚀 in PyTorch
世界上最好的提示词 (总计估值超过300亿的提示词)外国网友x1xh成功获取了 v0、Manus、Cursor、Same.dev 和 Lovable 的完整官方系统提示词和内部工具。
(Interspeech 2025, official code) Speech enhancement based on cascaded two flows
Code for "SNR-Aligned Consistent Diffusion for Adaptive Speech Enhancement" (Interspeech 2025)
jsalt2020_simulate toolkit modified by ustc-nercslip for chime-7&8
davidbrowne17 / csm-streaming
Forked from SesameAILabs/csmRealtime demo, Streaming and Finetuning code for CSM
anan235 / dia-multilingual
Forked from nari-labs/diaA TTS model capable of generating ultra-realistic dialogue in one pass.
RooCodeInc / Roo-Code
Forked from cline/clineRoo Code gives you a whole dev team of AI agents in your code editor.
Fast and memory-efficient exact attention
ROCm / flash-attention
Forked from Dao-AILab/flash-attentionFast and memory-efficient exact attention
victorchall / genmoai-smol
Forked from genmoai/mochiThe best OSS video generation models
NVIDIA Linux open GPU with P2P support
cogmhear / avse_challenge
Forked from claritychallenge/clarityCOG-MHEAR Audio-Visual Speech Enhancement Challenge
AlexIII / g729a-python
Forked from ploverlake/g729aG.729А audio codec for python 3
WenzheLiu-Speech / aac-datasets
Forked from Labbeti/aac-datasetsAudio Captioning datasets for PyTorch.
orion-labs / opuslib
Forked from svartalf/python-opusPython bindings to the libopus, IETF low-delay audio codec
ws-choi / sdx23
Forked from kuielab/mdx-netKUIELAB-MDX-Net got the 2nd place on the Leaderboard A and the 3rd place on the Leaderboard B in the MDX-Challenge ISMIR 2021
Generative Agents: Interactive Simulacra of Human Behavior using GPT4All free model which runs on CPU
sisrfeng / n_gpu
Forked from wookayin/gpustat📊 A simple command-line utility for querying and monitoring GPU status
tloen / llama-int8
Forked from meta-llama/llamaQuantized inference code for LLaMA models