980202006

980202006

34 followers · 458 following

Achievements

Starred repositories

yoyolicoris / music-spectrogram-diffusion-pytorch

Python 86 6 Updated Jan 29, 2023

facebookresearch / sam-audio

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python 2,276 164 Updated Dec 19, 2025

facebookresearch / dacvae

DACVAE

Python 138 13 Updated Dec 19, 2025

XiaomiMiMo / MiMo-V2-Flash

MiMo-V2-Flash: Efficient Reasoning, Coding, and Agentic Foundation Model

787 25 Updated Dec 17, 2025

disco-speech / DisCo-Speech

54 3 Updated Dec 18, 2025

MiniMax-AI / VTP

Towards Scalable Pre-training of Visual Tokenizers for Generation

Python 248 5 Updated Dec 16, 2025

End2End-Diffusion / iREPA

Official implementation for What matters for Representation Alignment: Global Information or Spatial Structure?

Python 138 7 Updated Dec 15, 2025

ali-vilab / Wan-Move

[NeurIPS 2025] Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Python 447 18 Updated Dec 19, 2025

aqtq314 / VogenSVS

Jupyter Notebook 15 1 Updated Aug 22, 2025

GuitarsAI / ADSP_Tutorials

Advanced Signal Processing Notebooks and Tutorials

Jupyter Notebook 171 47 Updated Dec 9, 2021

jesseengel / ml-audio-start

Forked from drscotthawley/ml-audio-start

Suggestions for those interested in developing audio applications of machine learning

14 Updated Jan 10, 2020

zai-org / GLM-TTS

GLM-TTS: Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning

Python 755 92 Updated Dec 17, 2025

acids-ircam / ravetable

Ravetable synthesis - Latent signal processing

Max 32 Updated Sep 25, 2025

zai-org / CogView4

CogView4, CogView3-Plus and CogView3(ECCV 2024)

Python 1,100 79 Updated Mar 29, 2025

apple / ml-cross-entropy

Python 565 56 Updated Sep 23, 2025

zai-org / Open-AutoGLM

An Open Phone Agent Model & Framework. Unlocking the AI Phone for Everyone

Python 18,508 2,899 Updated Dec 19, 2025

Yujia-Yan / Transkun

A simple yet effective Audio-to-Midi Automatic Piano Transcription system

Python 268 26 Updated Nov 22, 2024

biaofuxmu / EAST

Python 6 1 Updated Jul 11, 2025

OpenNSP / Hifi-vaegan

Python 47 6 Updated Aug 31, 2024

microsoft / UniSpeech

UniSpeech - Large Scale Self-Supervised Learning for Speech

Python 474 74 Updated Apr 5, 2024

dzq84 / simple_dac_codec

Python 2 Updated Oct 16, 2024

moiseshorta / MelSpecVAE

Variational Autoencoder in the mel-spectrogram domain for one-shot audio synthesis

Jupyter Notebook 145 18 Updated Dec 12, 2021

WWWWxp / M3-TTS

Pytorch Implementation of the paper "M3-TTS: Multi-modal DiT Alignment & Mel-latent for Zero-shot High-fidelity Speech Synthesis"

Python 92 2 Updated Dec 18, 2025

yhj137 / PianistTransformer

This is the official implementation for the paper "Pianist Transformer: Towards Expressive Piano Performance Rendering via Scalable Self-Supervised Pre-Training".

Python 12 1 Updated Dec 7, 2025

kszpxxzmc / ViSAudio

ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation

99 3 Updated Dec 11, 2025

taka19 / v2coder

PyTorch implementation of V2Coder

Python 3 2 Updated Jun 27, 2025

haofuml / cyclical_annealing

OpenEdge ABL 198 20 Updated Nov 22, 2022

Zehong-Ma / DeCo

Official repository for “DeCo: Frequency-Decoupled Pixel Diffusion for End-to-End Image Generation”

Python 134 6 Updated Dec 18, 2025

google-research / byt5

Python 530 33 Updated Feb 13, 2024

GiantAILab / YingMusic-Singer

Python 41 4 Updated Dec 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

980202006

Achievements

Achievements

Block or report 980202006

Starred repositories

yoyolicoris / music-spectrogram-diffusion-pytorch

facebookresearch / sam-audio

facebookresearch / dacvae

XiaomiMiMo / MiMo-V2-Flash

disco-speech / DisCo-Speech

MiniMax-AI / VTP

End2End-Diffusion / iREPA

ali-vilab / Wan-Move

aqtq314 / VogenSVS

GuitarsAI / ADSP_Tutorials

jesseengel / ml-audio-start

zai-org / GLM-TTS

acids-ircam / ravetable

zai-org / CogView4

apple / ml-cross-entropy

zai-org / Open-AutoGLM

Yujia-Yan / Transkun

biaofuxmu / EAST

OpenNSP / Hifi-vaegan

microsoft / UniSpeech

dzq84 / simple_dac_codec

moiseshorta / MelSpecVAE

WWWWxp / M3-TTS

yhj137 / PianistTransformer

kszpxxzmc / ViSAudio

taka19 / v2coder

haofuml / cyclical_annealing

Zehong-Ma / DeCo

google-research / byt5

GiantAILab / YingMusic-Singer

Starred topics

chinese-nlp