RicherMans

Heinrich Dinkel RicherMans

日新月异

206 followers · 143 following

Xiaomi
China, Beijing
richermans.github.io

Achievements

x2 x2

Achievements

x2 x2

mecat Public
Forked from xiaomi-research/mecat

Python 1 Apache License 2.0 Updated Dec 17, 2025
Interspeech2026-Audio-Encoder-Challenge Public
Forked from DataoceanAI/Interspeech2026-Audio-Encoder-Challenge

Updated Dec 12, 2025
Dasheng Public

Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"

Python 78 3 Apache License 2.0 Updated Nov 7, 2025
dasheng_xiaomi Public
Forked from XiaoMi/dasheng

Official PyTorch code for Deep Audio-Signal Holistic Embeddings

Python Apache License 2.0 Updated Nov 7, 2025
xares Public
Forked from jimbozhang/xares

A benchmark for evaluating audio encoders on various audio tasks.

Python Apache License 2.0 Updated Sep 1, 2025
CED Public

Source code for Consistent ensemble distillation for audio tagging

audio tagging sound classification

Python 53 6 GNU General Public License v3.0 Updated Jun 12, 2025
hf_transformers_custom_model_ced Public
Forked from jimbozhang/hf_transformers_custom_model_ced

🤗 Transformers custom models (CED)

Python Apache License 2.0 Updated Apr 25, 2025
hf_transformers_custom_model_dasheng Public
Forked from jimbozhang/hf_transformers_custom_model_dasheng

🤗 Transformers custom models (Dasheng)

Python 1 Apache License 2.0 Updated Apr 24, 2025
Speaker-Anti-Spoofing-Classifiers Public

Baselines and Classifiers for speaker anti-spoofing detection

dataset baseline spoofing anti-spoofing spoofing-attack spoofs

Python 18 5 Updated Jul 25, 2024
AudioCaption Public

Dataset and baseline for the first Audiocaption task

dataset baseline audiocaption

Python 79 9 MIT License Updated Jul 25, 2024
SAT Public

Streaming Audiotransformers for online Audio tagging

Python 49 4 GNU General Public License v3.0 Updated Jun 14, 2024
HEAR_CED Public

Hear evaluation for CED models.

Python 7 2 GNU General Public License v3.0 Updated Mar 13, 2024
nanopi-openwrt Public
Forked from stupidloud/nanopi-openwrt

Openwrt for Nanopi R4S

Shell Updated Aug 27, 2023
hearbenchmark.com Public
Forked from hearbenchmark/hearbenchmark.com

HEAR Benchmark website and leaderboard submissions

Apache License 2.0 Updated Aug 25, 2023
CDur Public

Repository for the paper "Towards duration robust weakly supervised sound event detection"

Python 23 5 GNU General Public License v3.0 Updated Aug 3, 2023
GPV Public

Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper

machine-learning pytorch voice-activity-detection speech-activity-detection noise-robust-asr sound-activity

Python 141 29 GNU General Public License v3.0 Updated Aug 3, 2023
Datadriven-GPVAD Public

The codebase for Data-driven general-purpose voice activity detection.

machine-learning pytorch voice-activity-detection speech-activity-detection noise-robust

Python 94 23 MIT License Updated Aug 3, 2023
text_based_depression Public

Source code for the paper "Text-based Depression Detection: What Triggers An Alert"

Python 50 9 Updated Jul 6, 2023
Dcase2018_pooling Public

Repo for our pooling approach on the DCASE2018 task4

Python 15 3 Apache License 2.0 Updated Jul 6, 2023
torchaudio Public
Forked from pytorch/audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

Python BSD 2-Clause "Simplified" License Updated Jun 21, 2023
UIT_Mobile Public

Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"

Python 23 3 GNU General Public License v3.0 Updated Mar 6, 2023
PSL Public

Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"

Python 31 4 GNU General Public License v3.0 Updated Apr 29, 2022
HEAR2021_EfficientLatent Public

Submission to the HEAR2021 Challenge

Python 17 7 Apache License 2.0 Updated Mar 5, 2022
ignite Public
Forked from pytorch/ignite

High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.

Python BSD 3-Clause "New" or "Revised" License Updated Dec 23, 2021
ImageNet21K Public
Forked from Alibaba-MIIL/ImageNet21K

Official Pytorch Implementation of: "ImageNet-21K Pretraining for the Masses"(NeurIPS, 2021) paper

Python 2 MIT License Updated Aug 7, 2021
hifi-gan Public
Forked from jik876/hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Python 1 MIT License Updated Apr 28, 2021
coc-pyright Public
Forked from fannheyward/coc-pyright

Pyright extension for coc.nvim

TypeScript 1 MIT License Updated Apr 25, 2021
Nanopi-R4S Public

My NanoPi R4S builds

Shell 1 Updated Apr 24, 2021
audioset_tagging_cnn Public
Forked from qiuqiangkong/audioset_tagging_cnn

Python MIT License Updated Mar 13, 2021
SpokenLanguageClassifiers Public

Pretrained spoken language classifiers from audio.

Python 10 2 MIT License Updated Jan 21, 2021

Heinrich Dinkel RicherMans

Achievements

Achievements

mecat Public

Uh oh!

Interspeech2026-Audio-Encoder-Challenge Public

Uh oh!

Dasheng Public

Uh oh!

dasheng_xiaomi Public

Uh oh!

xares Public

Uh oh!

CED Public

Uh oh!

hf_transformers_custom_model_ced Public

Uh oh!

hf_transformers_custom_model_dasheng Public

Uh oh!

Speaker-Anti-Spoofing-Classifiers Public

Uh oh!

AudioCaption Public

Uh oh!

SAT Public

Uh oh!

HEAR_CED Public

Uh oh!

nanopi-openwrt Public

Uh oh!

hearbenchmark.com Public

Uh oh!

CDur Public

Uh oh!

GPV Public

Uh oh!

Datadriven-GPVAD Public

Uh oh!

text_based_depression Public

Uh oh!

Dcase2018_pooling Public

Uh oh!

torchaudio Public

Uh oh!

UIT_Mobile Public

Uh oh!

PSL Public

Uh oh!

HEAR2021_EfficientLatent Public

Uh oh!

ignite Public

Uh oh!

ImageNet21K Public

Uh oh!

hifi-gan Public

Uh oh!

coc-pyright Public

Uh oh!

Nanopi-R4S Public

Uh oh!

audioset_tagging_cnn Public

Uh oh!

SpokenLanguageClassifiers Public

Uh oh!