Stars
SongGLM: Lyric-to-Melody Generation with 2D Alignment Encoding and Multi-Task Pre-Training
[MMM2025] Official repository for Music2MIDI: Pop Music to MIDI Piano Cover Generation
A curated list of Datasets, Models and Papers for Music Emotion Recognition (MER)
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
MuChin: A Chinese Colloquial Description Benchmark for Evaluating Language Models in the Field of Music
Background Removal written with swift using u2net model
MIDI / symbolic music tokenizers for Deep Learning models 🎶
Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch
The official GitHub page for the survey paper "A Survey of Large Language Models".
Algorithm and Data for paper "Automatic Detection of Hierarchical Structure and Influence of Structure on Melody, Harmony and Rhythm in Popular Music"
Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"
Official Implementation for the ICML2022 paper "Directed Acyclic Transformer for Non-Autoregressive Machine Translation"
Text and docs related to Astroport and associated dashboards
YujxZJCN / SpecAugment
Forked from DemisEom/SpecAugmentA Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
YujxZJCN / Swin-Transformer-Semantic-Segmentation
Forked from SwinTransformer/Swin-Transformer-Semantic-SegmentationThis is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Semantic Segmentation.
YujxZJCN / espnet
Forked from espnet/espnetEnd-to-End Speech Processing Toolkit
YujxZJCN / muzic
Forked from microsoft/muzicMuzic: Music Understanding and Generation with Artificial Intelligence
Decode *.ncm file (NetEase Cloud Music format)
Muzic: Music Understanding and Generation with Artificial Intelligence
PyTorch implementation of the paper ‟Beyond Narrative Description: Generating Poetry from Images” by B. Liu et al., 2018.