Skip to content
View atsushieno's full-sized avatar

Sponsoring

@jcelerier

Organizations

@mono @TechBooster @ProjectMeilin

Block or report atsushieno

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
53 stars written in Python
Clear filter

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 59,319 9,411 Updated Dec 15, 2025

Deezer source separation library including pretrained models.

Python 28,028 3,068 Updated Apr 2, 2025

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Python 9,726 1,414 Updated Apr 24, 2024

Python library for audio and music analysis

Python 8,181 1,031 Updated Feb 5, 2026
Python 7,846 528 Updated Apr 14, 2024

Modeling, training, eval, and inference code for OLMo

Python 6,305 701 Updated Nov 24, 2025

Production First and Production Ready End-to-End Speech Recognition Toolkit

Python 5,024 1,172 Updated Dec 19, 2025

A lightweight yet powerful audio-to-MIDI converter with pitch bend detection

Python 4,652 416 Updated Nov 13, 2025

The most powerful local music generation model that outperforms most commercial alternatives

Python 4,533 466 Updated Feb 7, 2026

Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch

Python 3,295 266 Updated Sep 6, 2023

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python 3,278 277 Updated Jan 5, 2026

This repository provides motion datasets collected by Bandai Namco Research Inc

Python 3,263 388 Updated Jul 4, 2023

DDSP: Differentiable Digital Signal Processing

Python 3,206 370 Updated Jan 9, 2026

The PyTorch-based audio source separation toolkit for researchers

Python 2,535 445 Updated Oct 6, 2025

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Python 2,527 528 Updated Jun 13, 2025

Neural network emulator for guitar amplifiers.

Python 2,443 222 Updated Feb 7, 2026

Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion

Python 2,226 262 Updated Nov 27, 2025

Official implementation of "Separate Anything You Describe"

Python 1,869 141 Updated Nov 26, 2024

Music player and music library manager for Linux, Windows, and macOS

Python 1,669 243 Updated Jan 31, 2026

無料で使える中品質なテキスト読み上げソフトウェア、VOICEVOXの音声合成エンジン

Python 1,612 244 Updated Jan 16, 2026
Python 1,584 37 Updated Feb 5, 2026

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Python 1,334 105 Updated Sep 24, 2023

Newelle - Your Ultimate Virtual Assistant

Python 1,215 96 Updated Feb 6, 2026

Automatic fingering generator for piano scores

Python 814 94 Updated May 17, 2025

Unsupervised Speech Decomposition Via Triple Information Bottleneck

Python 698 96 Updated Oct 23, 2024

Fast Infinite Waveform Music Generation

Python 687 50 Updated Oct 28, 2022

SOME: Singing-Oriented MIDI Extractor.

Python 654 54 Updated Jan 22, 2026

A flexible source separation library in Python

Python 642 99 Updated Dec 9, 2024

Extract the melody from an audio file and export to MIDI

Python 627 110 Updated Apr 3, 2020
Next