jianganbai

Anbai Jiang jianganbai

PhD student at EE, Tsinghua. Anomaly Detection | Audio Processing

31 followers · 18 following

Tsinghua University
Beijing
21:47 (UTC +08:00)
https://scholar.google.com/citations?user=w68g1qkAAAAJ&hl=zh-CN&oi=ao

Achievements

Lists (14)

Sort

NSFW

Speech

6 repositories

Time Series

1 repository

Toolkit

2 repositories

Vibration

4 repositories

Stars

liguge / Awesome-large-language-model-for-Prognostics-and-health-management

用于预测性维护与健康管理的大型语言模型（故障诊断大模型；剩余使用寿命预测大模型）

94 6 Updated Dec 15, 2025

facebookresearch / ssl-data-curation

PyTorch code for hierarchical k-means -- a data curation method for self-supervised learning

Jupyter Notebook 227 14 Updated Jun 21, 2024

XiaomiMiMo / MiMo-Audio

MiMo-Audio: Audio Language Models are Few-Shot Learners

Python 905 87 Updated Sep 20, 2025

naver-ai / rope-vit

[ECCV 2024] Official PyTorch implementation of RoPE-ViT "Rotary Position Embedding for Vision Transformer"

Python 428 10 Updated Oct 29, 2025

thuml / Large-Time-Series-Model

Official code, datasets and checkpoints for "Timer: Generative Pre-trained Transformers Are Large Time Series Models" (ICML 2024) and subsequent works

Python 882 91 Updated Jul 22, 2025

Charlie5DH / PredictiveMaintenance-and-Vibration-Resources

Papers and datasets for Vibration Analysis

Jupyter Notebook 198 42 Updated Feb 2, 2025

MoonshotAI / Kimi-Audio

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 4,393 319 Updated Jun 21, 2025

ddlBoJack / MMAR

[NeurIPS 2025] Benchmark data and code for MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix

Python 185 4 Updated Dec 13, 2025

jishengpeng / WavTokenizer

[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling

Python 1,247 104 Updated Mar 2, 2025

jonnor / machinehearing

Machine Learning applied to sound

Jupyter Notebook 285 48 Updated Jun 8, 2025

facebookresearch / audiobox-aesthetics

Unified automatic quality assessment for speech, music, and sound.

Python 649 48 Updated Jun 5, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 64,282 7,789 Updated Dec 21, 2025

hustcxl / Rotating-machine-fault-data-set

Open rotating mechanical fault datasets (开源旋转机械故障数据集整理)

1,123 303 Updated May 29, 2025

liuzy0708 / MCC5-THU-Gearbox-Benchmark-Datasets

A benchmark fault diagnosis dataset comprises vibration data collected from a gearbox under variable working conditions with intentionally induced faults, encompassing diverse fault severities and …

MATLAB 72 8 Updated Nov 26, 2025

faroit / python_audio_loading_benchmark

Benchmark popular audio i/o packages

Python 152 11 Updated Dec 19, 2023

FoundationVision / VAR

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,562 551 Updated Nov 10, 2025

YuanGongND / whisper-at

Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"

Python 411 35 Updated Feb 21, 2024

Jinbo-Hu / PSELDNets

PSELDNets: Pre-trained Neural Networks on Large-scale Synthetic Datasets for Sound Event Localization and Detection

Python 28 4 Updated Sep 17, 2025

nttcslab / m2d

Masked Modeling Duo: Towards a Universal Audio Pre-training Framework

Jupyter Notebook 131 7 Updated Dec 6, 2025

descriptinc / descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,662 165 Updated Dec 5, 2025

FunAudioLLM / SenseVoice

Multilingual Voice Understanding Model

Python 7,202 668 Updated Aug 15, 2025

RicherMans / Dasheng

Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"

Python 78 3 Updated Nov 7, 2025

nttcslab / dcase2024_task2_evaluator

Python 8 1 Updated Sep 10, 2024

Tele-AI / TeleSpeech-ASR

Python 815 74 Updated Jun 7, 2024

QwenLM / Qwen2-Audio

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 2,014 155 Updated Apr 21, 2025

modelscope / FunCodec

FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.

Python 438 34 Updated Jan 25, 2024

nttcslab / dcase2023_task2_evaluator

Python 12 2 Updated Aug 10, 2023

facebookresearch / encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,864 343 Updated Jan 4, 2024

meta-llama / llama-cookbook

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 18,096 2,671 Updated Nov 3, 2025

X-LANCE / SLAM-LLM

A Framework for Speech, Language, Audio, Music Processing with Large Language Model

Python 939 100 Updated Oct 24, 2025

Anbai Jiang jianganbai

Lists (14)

AIGC

Anomaly Detection

Audio

Codec

CV

Deep Learning

Federated Learning