Skip to content
View JishengBai's full-sized avatar

Block or report JishengBai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A mirror of BigVGAN and HiFi-GAN for access via PyTorch Hub.

Python 5 1 Updated Aug 14, 2024

Official PyTorch implementation of BigVGAN (ICLR 2023)

Python 1,200 145 Updated Sep 5, 2024

Audio Large Language Models

Python 899 46 Updated Jul 5, 2025

Unleash Next-Level AI! 🚀 💻 Code Generation: DeepSeek r1 + Claude 3.7 Sonnet - Unparalleled Performance! 📝 Content Creation: DeepSeek r1 + Gemini 2.5 Pro - Superior Quality! 🔌 OpenAI-Compatible. 🌊 S…

Python 2,795 503 Updated Feb 23, 2026

每个人都能看懂的大模型知识分享,LLMs春/秋招大模型面试前必看,让你和面试官侃侃而谈

Jupyter Notebook 6,111 574 Updated Mar 30, 2026

Code for the paper: GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities

Python 154 15 Updated Dec 5, 2024

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 23,830 2,743 Updated Mar 12, 2026

🧑‍🚀 全世界最好的LLM资料总结(多模态生成、Agent、辅助编程、AI审稿、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型) | Summary of the world's best LLM resources.

7,982 808 Updated Mar 31, 2026

Collection of awesome test-time (domain/batch/instance) adaptation methods

1,249 78 Updated Nov 14, 2025

WildDESED: A LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection

Python 17 Updated Nov 19, 2024

Let your Claude able to think

TypeScript 16,977 1,976 Updated Nov 4, 2025
Python 14 Updated Jan 2, 2025

A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline

Python 198 5 Updated Dec 13, 2024

A Framework for Speech, Language, Audio, Music Processing with Large Language Model

Python 1,012 110 Updated Jan 15, 2026

Image composition toolbox: everything you want to know about image composition or object insertion

Python 714 52 Updated Mar 21, 2026

We'll look into audio categorization using deep learning principles like Artificial Neural Networks (ANN), 1D Convolutional Neural Networks (CNN1D), and CNN2D in this repository. We undertake some …

Jupyter Notebook 56 13 Updated Mar 8, 2022

AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension

Python 126 5 Updated Dec 9, 2024

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 2,061 165 Updated Apr 21, 2025

[IJCAI 2024] EAT: Self-Supervised Pre-Training with Efficient Audio Transformer

Python 224 16 Updated Nov 30, 2025

The calflops is designed to calculate FLOPs、MACs and Parameters in all various neural networks, such as Linear、 CNN、 RNN、 GCN、Transformer(Bert、LlaMA etc Large Language Model)

Python 929 40 Updated Jun 27, 2024

MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.

Python 1,420 86 Updated Apr 21, 2025

code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)

Python 45 6 Updated May 9, 2022

This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fine-tuning of Audio Spectrogram Transformers via Soft Mixture…

Python 39 4 Updated Jul 31, 2024

A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models

Python 846 69 Updated Sep 13, 2023

cuML - RAPIDS Machine Learning Library

C++ 5,165 621 Updated Apr 1, 2026

A collection of implementations of adversarial domain adaptation algorithms

Python 645 107 Updated Sep 21, 2021

Mamba SSM architecture

Python 17,828 1,672 Updated Mar 30, 2026

A library built for easier audio self-supervised training, downstream tasks evaluation

Python 136 12 Updated Sep 25, 2025

This repository contains the code of the CP JKU submission to DCASE23 Task 1 "Low-complexity Acoustic Scene Classification"

Python 30 4 Updated Sep 18, 2023

A PyTorch-based Speech Toolkit

Python 11,405 1,676 Updated Mar 31, 2026
Next