-
Xi'an Lianfeng Acoustic Technologies Co., Ltd.
- https://jishengbai.github.io
Stars
A mirror of BigVGAN and HiFi-GAN for access via PyTorch Hub.
Official PyTorch implementation of BigVGAN (ICLR 2023)
Unleash Next-Level AI! 🚀 💻 Code Generation: DeepSeek r1 + Claude 3.7 Sonnet - Unparalleled Performance! 📝 Content Creation: DeepSeek r1 + Gemini 2.5 Pro - Superior Quality! 🔌 OpenAI-Compatible. 🌊 S…
每个人都能看懂的大模型知识分享,LLMs春/秋招大模型面试前必看,让你和面试官侃侃而谈
Code for the paper: GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
🧑🚀 全世界最好的LLM资料总结(多模态生成、Agent、辅助编程、AI审稿、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型) | Summary of the world's best LLM resources.
Collection of awesome test-time (domain/batch/instance) adaptation methods
WildDESED: A LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection
Let your Claude able to think
A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline
A Framework for Speech, Language, Audio, Music Processing with Large Language Model
Image composition toolbox: everything you want to know about image composition or object insertion
We'll look into audio categorization using deep learning principles like Artificial Neural Networks (ANN), 1D Convolutional Neural Networks (CNN1D), and CNN2D in this repository. We undertake some …
AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
[IJCAI 2024] EAT: Self-Supervised Pre-Training with Efficient Audio Transformer
The calflops is designed to calculate FLOPs、MACs and Parameters in all various neural networks, such as Linear、 CNN、 RNN、 GCN、Transformer(Bert、LlaMA etc Large Language Model)
MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.
code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)
This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fine-tuning of Audio Spectrogram Transformers via Soft Mixture…
A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models
A collection of implementations of adversarial domain adaptation algorithms
A library built for easier audio self-supervised training, downstream tasks evaluation
This repository contains the code of the CP JKU submission to DCASE23 Task 1 "Low-complexity Acoustic Scene Classification"