RicherMans

Heinrich Dinkel RicherMans

日新月异

206 followers · 143 following

Xiaomi
China, Beijing
richermans.github.io

Achievements

x2 x2

Achievements

x2 x2

Stars

zai-org / GLM-TTS

GLM-TTS: Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning

Python 741 88 Updated Dec 17, 2025

allenai / open-instruct

AllenAI's post-training codebase

Python 3,456 476 Updated Dec 19, 2025

facebookresearch / sam-audio

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python 2,047 144 Updated Dec 19, 2025

xiaomi-research / xares-llm

XARES-LLM

Python 29 1 Updated Dec 19, 2025

KdaiP / DC-Speech-VAE

5Hz Deep-Compression Speech VAE for AR-Diffusion and CALMs

Python 55 9 Updated Nov 19, 2025

LTH14 / JiT

PyTorch implementation of JiT https://arxiv.org/abs/2511.13720

Python 1,816 107 Updated Dec 8, 2025

XiaoMi / xiaomi-miloco

Xiaomi Miloco

Python 1,923 122 Updated Dec 17, 2025

XiaoMi / xiaomi-mimo-vl-miloco

Xiaomi MiMo-VL-Miloco

179 4 Updated Nov 14, 2025

facebookresearch / omnilingual-asr

Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages

Python 2,487 213 Updated Dec 16, 2025

NVlabs / OmniVinci

OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and language.

Python 605 51 Updated Oct 29, 2025

xiaomi-research / diffrhythm2

Python 97 14 Updated Nov 6, 2025

meituan-longcat / LongCat-Flash-Omni

This is the official repo for the paper "LongCat-Flash-Omni Technical Report"

Python 443 24 Updated Dec 15, 2025

deepseek-ai / DeepSeek-OCR

Contexts Optical Compression

Python 21,498 1,922 Updated Oct 25, 2025

community-scripts / ProxmoxVE

Proxmox VE Helper-Scripts (Community Edition)

Shell 24,032 2,162 Updated Dec 19, 2025

ddlBoJack / Omni-Captioner

Data Pipeline, Models, and Benchmark for Omni-Captioner.

Python 105 Updated Oct 17, 2025

bytetriper / RAE

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,641 53 Updated Nov 15, 2025

scottishfold0621 / ACMID

Python 19 2 Updated Oct 9, 2025

lucidrains / discrete-distribution-network

Exploration into Discrete Distribution Network, by Lei Yang out of Beijing

Python 36 Updated Oct 19, 2025

inclusionAI / MingTok-Audio

Python 70 7 Updated Nov 12, 2025

inclusionAI / Ming-UniAudio

Ming-UniAudio: Speech LLM for Joint Understanding, Generation and Editing with Unified Representation

Python 406 28 Updated Nov 27, 2025

ZhikangNiu / Semantic-VAE

Official code for "Semantic-VAE: Semantic-Alignment Latent Representation for Better Speech Synthesis"

Python 102 4 Updated Oct 26, 2025

wsntxxn / UniFlow-Audio

Python 65 4 Updated Dec 5, 2025

Hannieliao / Emilia-NV

Official Repository of Paper: "Emilia-NV: A Non-Verbal Speech Dataset with Word-Level Annotation for Human-Like Speech Modeling"

81 2 Updated Sep 18, 2025

QwenLM / Qwen3-Omni

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 3,136 191 Updated Oct 9, 2025

OpenBMB / VoxCPM

VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning

Python 2,989 319 Updated Dec 15, 2025

kyutai-labs / unmute

Make text LLMs listen and speak

Python 1,036 180 Updated Dec 11, 2025

tekaratzas / RustGPT

An transformer based LLM. Written completely in Rust

Rust 2,999 254 Updated Oct 10, 2025

XiaomiMiMo / MiMo-Audio

MiMo-Audio: Audio Language Models are Few-Shot Learners

Python 903 87 Updated Sep 20, 2025

amorehead / jvp_flash_attention

Flash Attention Triton kernel with support for second-order derivatives

Python 122 11 Updated Dec 17, 2025

xiaomi-research / q-frame

[ICCV 2025] Implementation of the paper "Q-Frame: Query-aware Frame Selection and Multi-Resolution Adaptation for Video-LLMs"

Python 61 2 Updated Oct 25, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Heinrich Dinkel RicherMans

Achievements

Achievements

Block or report RicherMans

Stars

zai-org / GLM-TTS

allenai / open-instruct

facebookresearch / sam-audio

xiaomi-research / xares-llm

KdaiP / DC-Speech-VAE

LTH14 / JiT

XiaoMi / xiaomi-miloco

XiaoMi / xiaomi-mimo-vl-miloco

facebookresearch / omnilingual-asr

NVlabs / OmniVinci

xiaomi-research / diffrhythm2

meituan-longcat / LongCat-Flash-Omni

deepseek-ai / DeepSeek-OCR

community-scripts / ProxmoxVE

ddlBoJack / Omni-Captioner

bytetriper / RAE

scottishfold0621 / ACMID

lucidrains / discrete-distribution-network

inclusionAI / MingTok-Audio

inclusionAI / Ming-UniAudio

ZhikangNiu / Semantic-VAE

wsntxxn / UniFlow-Audio

Hannieliao / Emilia-NV

QwenLM / Qwen3-Omni

OpenBMB / VoxCPM

kyutai-labs / unmute

tekaratzas / RustGPT

XiaomiMiMo / MiMo-Audio

amorehead / jvp_flash_attention

xiaomi-research / q-frame