OpenMusic: SOTA Text-to-music (TTM) Generation
-
Updated
Jun 26, 2025 - Python
OpenMusic: SOTA Text-to-music (TTM) Generation
Implementation of MusicLM, a text to music model published by Google Research, with a few modifications.
Mustango: Toward Controllable Text-to-Music Generation
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
Local windowed attention multi-instrumental music transformer for supervised music generation
AAAI 2025: Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
some generative audio tools for ComfyUI
Portable AI music generator — full songs with vocals, covers, music videos. One-click install, 100% offline, NVIDIA GPU.
MuseControlLite: Multifunctional Music Generation with Lightweight Conditioners [ICML 2025]
SOTA Google's Perceiver-AR Music Transformer Implementation and Model
Turn your words into music! Describe a sound (e.g., happy, spooky) and this app generates a short piece based on your text.
[ICASSP'24] Investigating Personalization Methods in Text to Music Generation
[SOTA] [92% acc] 786M-8k-44L-32H multi-instrumental music transformer with true full MIDI instruments range, efficient encoding, octo-velocity and outro tokens
[DEPRECIATED] [PyTorch 2.0] [638M] [85.33% acc] Full-attention multi-instrumental music transformer for supervised music generation, optimized for speed, efficiency, and performance
【AIGC 实战入门笔记 —— AIGC 摩天大楼】分享 大语言模型(LLMs),大模型高效微调(SFT),检索增强生成(RAG),智能体(Agent),PPT自动生成, 角色扮演,文生图(Stable Diffusion) ,图像文字识别(OCR),语音识别(ASR),语音合成(TTS),人像分割(SA),多模态(VLM),Ai 换脸(Face Swapping), 文生视频(VD),图生视频(SVD),Ai 动作迁移,Ai 虚拟试衣,数字人,全模态理解(Omni),Ai音乐生成 干货学习 等 实战与经验。
A ComfyUI suite of nodes for Pollinations, LM Studio, Copilot CLI, and OpenAI-compatible generation with prompt enhancing, image gen, video gen, speech/audio gen, for local/cloud multi-engine workflows.
Python library for generating ambient music from text descriptions. No GPU required. Turn text into sound with a single line of code.
Exploring Bark, the Open-Source Text-to-Audio Generative Model
ComfyUI custom nodes for Suno — generate, remix, extend, and shape AI music inside ComfyUI via the muapi.ai API.
Portable offline AI audio studio with web UI & local API – XTTS, Fish Speech, Kokoro, Stable Audio, ACE-Step, voice cloning, music gen (no install)
Add a description, image, and links to the text-to-music topic page so that developers can more easily learn about it.
To associate your repository with the text-to-music topic, visit your repo's landing page and select "manage topics."