Skip to content
View jianganbai's full-sized avatar

Block or report jianganbai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

用于预测性维护与健康管理的大型语言模型(故障诊断大模型;剩余使用寿命预测大模型)

94 6 Updated Dec 15, 2025

PyTorch code for hierarchical k-means -- a data curation method for self-supervised learning

Jupyter Notebook 227 14 Updated Jun 21, 2024

MiMo-Audio: Audio Language Models are Few-Shot Learners

Python 905 87 Updated Sep 20, 2025

[ECCV 2024] Official PyTorch implementation of RoPE-ViT "Rotary Position Embedding for Vision Transformer"

Python 428 10 Updated Oct 29, 2025

Official code, datasets and checkpoints for "Timer: Generative Pre-trained Transformers Are Large Time Series Models" (ICML 2024) and subsequent works

Python 882 91 Updated Jul 22, 2025

Papers and datasets for Vibration Analysis

Jupyter Notebook 198 42 Updated Feb 2, 2025

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 4,393 319 Updated Jun 21, 2025

[NeurIPS 2025] Benchmark data and code for MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix

Python 185 4 Updated Dec 13, 2025

[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling

Python 1,247 104 Updated Mar 2, 2025

Machine Learning applied to sound

Jupyter Notebook 285 48 Updated Jun 8, 2025

Unified automatic quality assessment for speech, music, and sound.

Python 649 48 Updated Jun 5, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 64,282 7,789 Updated Dec 21, 2025

Open rotating mechanical fault datasets (开源旋转机械故障数据集整理)

1,123 303 Updated May 29, 2025

A benchmark fault diagnosis dataset comprises vibration data collected from a gearbox under variable working conditions with intentionally induced faults, encompassing diverse fault severities and …

MATLAB 72 8 Updated Nov 26, 2025

Benchmark popular audio i/o packages

Python 152 11 Updated Dec 19, 2023

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,562 551 Updated Nov 10, 2025

Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"

Python 411 35 Updated Feb 21, 2024

PSELDNets: Pre-trained Neural Networks on Large-scale Synthetic Datasets for Sound Event Localization and Detection

Python 28 4 Updated Sep 17, 2025

Masked Modeling Duo: Towards a Universal Audio Pre-training Framework

Jupyter Notebook 131 7 Updated Dec 6, 2025

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,662 165 Updated Dec 5, 2025

Multilingual Voice Understanding Model

Python 7,202 668 Updated Aug 15, 2025

Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"

Python 78 3 Updated Nov 7, 2025
Python 815 74 Updated Jun 7, 2024

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 2,014 155 Updated Apr 21, 2025

FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.

Python 438 34 Updated Jan 25, 2024

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,864 343 Updated Jan 4, 2024

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 18,096 2,671 Updated Nov 3, 2025

A Framework for Speech, Language, Audio, Music Processing with Large Language Model

Python 939 100 Updated Oct 24, 2025
Next