Skip to content
View agangzz's full-sized avatar

Block or report agangzz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python 2,046 144 Updated Dec 19, 2025

ConceptAttention: A method for interpreting multi-modal diffusion transformers.

Jupyter Notebook 401 26 Updated Nov 13, 2025
Python 33 6 Updated Jan 7, 2022

Improving Spoken Language Modeling with Phoneme Classification: A Simple Fine-tuning Approach

Jupyter Notebook 12 Updated Oct 30, 2024

Simultaneous speech-to-text model

Python 9,270 912 Updated Dec 19, 2025

Official implementation of YingMusic-SVC.

Python 88 7 Updated Dec 15, 2025
Python 12 Updated Oct 13, 2025

The ArtificialSongGenerator automatically composes and compiles the Artifical Audio Multitrack dataset (AAM).

Java 26 5 Updated Nov 17, 2025
Python 4 Updated Dec 8, 2025

A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics, and features robust zero-shot text-to-speech

Python 787 52 Updated Dec 8, 2025

Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯

Python 880 41 Updated Dec 10, 2025

Multilingual Voice Understanding Model

Python 7,189 668 Updated Aug 15, 2025
Python 9 Updated Dec 11, 2025

Official repository for the paper: Scaling Self-Supervised Representation Learning for Symbolic Piano Performance (ISMIR 2025)

Python 89 12 Updated Oct 8, 2025
Python 73 10 Updated Oct 16, 2025

Open Source code for our paper, Steering Autoregressive Music Generation with Recursive Feature Machines (Zhao et al., 2025). aka MusicRFM

Jupyter Notebook 25 1 Updated Oct 26, 2025

Wrapper around Panako for Spotify Sample ID internship work

Java 1 Updated Dec 13, 2024

An automatic sample identification (ASID) system using a contrastively trained GNN encoder.

Python 10 1 Updated Sep 21, 2025

Implementation of the experiments for "Semi-supervised Neural Chord Estimation Based on a Variational Autoencoder with Latent Chord Labels and Features"

Python 11 1 Updated Dec 3, 2020

Using ML for chord prediction in Jazz

Jupyter Notebook 1 Updated Apr 28, 2022

Chordify Annotator Subjectivity Dataset - A chord-Label harmony dataset with multiple reference annotations per song

Python 67 6 Updated Jun 14, 2019

Official Implementation of paper "BACHI: Boundary-Aware Symbolic Chord Recognition Through Masked Iterative Decoding on Pop and Classical Music"

Python 4 Updated Nov 19, 2025

Companion resources for the paper 'Transcribing Rhythmic Patterns of the Guitar Track in Polyphonic Music'

Python 14 Updated Oct 8, 2025

Deep learning based dependency parsing for music sequences

Jupyter Notebook 24 3 Updated Jul 19, 2023

"Joint Transcription of Acoustic Guitar Strumming Directions and Chords" - ISMIR2025

Python 4 Updated Sep 21, 2025

Python library to handle musical chords.

Python 281 49 Updated Dec 31, 2023
Python 9 Updated Oct 10, 2025
Next