-
-
Interspeech2026-Audio-Encoder-Challenge Public
Forked from DataoceanAI/Interspeech2026-Audio-Encoder-ChallengeUpdatedDec 12, 2025 -
Dasheng Public
Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"
-
dasheng_xiaomi Public
Forked from XiaoMi/dashengOfficial PyTorch code for Deep Audio-Signal Holistic Embeddings
Python Apache License 2.0 UpdatedNov 7, 2025 -
xares Public
Forked from jimbozhang/xaresA benchmark for evaluating audio encoders on various audio tasks.
Python Apache License 2.0 UpdatedSep 1, 2025 -
CED Public
Source code for Consistent ensemble distillation for audio tagging
-
hf_transformers_custom_model_ced Public
Forked from jimbozhang/hf_transformers_custom_model_ced🤗 Transformers custom models (CED)
Python Apache License 2.0 UpdatedApr 25, 2025 -
hf_transformers_custom_model_dasheng Public
Forked from jimbozhang/hf_transformers_custom_model_dasheng🤗 Transformers custom models (Dasheng)
-
Baselines and Classifiers for speaker anti-spoofing detection
-
AudioCaption Public
Dataset and baseline for the first Audiocaption task
-
SAT Public
Streaming Audiotransformers for online Audio tagging
-
HEAR_CED Public
Hear evaluation for CED models.
-
nanopi-openwrt Public
Forked from stupidloud/nanopi-openwrtOpenwrt for Nanopi R4S
Shell UpdatedAug 27, 2023 -
hearbenchmark.com Public
Forked from hearbenchmark/hearbenchmark.comHEAR Benchmark website and leaderboard submissions
Apache License 2.0 UpdatedAug 25, 2023 -
CDur Public
Repository for the paper "Towards duration robust weakly supervised sound event detection"
-
GPV Public
Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper
-
Datadriven-GPVAD Public
The codebase for Data-driven general-purpose voice activity detection.
-
text_based_depression Public
Source code for the paper "Text-based Depression Detection: What Triggers An Alert"
-
Dcase2018_pooling Public
Repo for our pooling approach on the DCASE2018 task4
-
torchaudio Public
Forked from pytorch/audioData manipulation and transformation for audio signal processing, powered by PyTorch
Python BSD 2-Clause "Simplified" License UpdatedJun 21, 2023 -
UIT_Mobile Public
Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"
-
PSL Public
Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"
-
HEAR2021_EfficientLatent Public
Submission to the HEAR2021 Challenge
-
ignite Public
Forked from pytorch/igniteHigh-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.
Python BSD 3-Clause "New" or "Revised" License UpdatedDec 23, 2021 -
ImageNet21K Public
Forked from Alibaba-MIIL/ImageNet21KOfficial Pytorch Implementation of: "ImageNet-21K Pretraining for the Masses"(NeurIPS, 2021) paper
-
hifi-gan Public
Forked from jik876/hifi-ganHiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
-
coc-pyright Public
Forked from fannheyward/coc-pyrightPyright extension for coc.nvim
-
-
audioset_tagging_cnn Public
Forked from qiuqiangkong/audioset_tagging_cnnPython MIT License UpdatedMar 13, 2021 -
SpokenLanguageClassifiers Public
Pretrained spoken language classifiers from audio.