Skip to content
View p3zo's full-sized avatar

Highlights

  • Pro

Block or report p3zo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Real-time streaming voice anonymization & voice conversion

Python 47 5 Updated Feb 5, 2026

The best ChatGPT that $100 can buy.

Python 42,376 5,472 Updated Feb 5, 2026

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,696 3,329 Updated Feb 6, 2026

OceanHackWeek - Tutorials

55 82 Updated Aug 25, 2025

A lightweight psychoacoustic bass enhancement plugin - in stereo where available!

Rust 193 6 Updated Dec 30, 2023

A lightweight, local-first, and 🆓 experiment tracking library from Hugging Face 🤗

Python 1,248 97 Updated Feb 4, 2026

Pytorch implementation of MaskGIT: Masked Generative Image Transformer (https://arxiv.org/pdf/2202.04200.pdf)

Python 467 39 Updated Sep 3, 2023

Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch

Python 1,545 91 Updated Apr 24, 2025

A unified tokenizer that is capable of both extracting semantic information and enabling high-fidelity audio reconstruction.

Python 132 10 Updated Sep 19, 2025

An opinionated docker container for a web-interface around the music organizer beets

TypeScript 320 20 Updated Jan 25, 2026

Beets plugin to manage external files

Python 129 25 Updated Feb 6, 2026

🎛 Stemgen is a Stem file generator. Convert any track into a Stem and have fun with Traktor.

Python 265 49 Updated Sep 1, 2025

Download Tidal tracks, videos, albums, playlists & artists! Tidal downloader that supports master quality.

Python 289 25 Updated Jan 18, 2026

Audio Dataset for training CLAP and other models

Python 729 59 Updated Jan 8, 2026

Official implementation of the paper MGE-LDM: Joint Latent Diffusion for Simultaneous Music Generation and Source Extraction

Python 16 Updated Jul 25, 2025

Generative Model Evaluation Lab - An evaluation suite for your generative models.

Python 7 Updated Dec 13, 2024

an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM

Rust 30,003 2,710 Updated Feb 6, 2026

TorchCFM: a Conditional Flow Matching library

Python 2,285 187 Updated Nov 11, 2025

🎥 Python and OpenCV-based scene cut/transition detection program & library.

Python 4,534 472 Updated Jan 25, 2026
Python 116 8 Updated Jan 26, 2026

Livecoding networked visuals in the browser

JavaScript 2,574 304 Updated Sep 14, 2025

Awesome list for vjing/visuals-related resources

336 14 Updated Feb 26, 2025

ACAV100M: Automatic Curation of Large-Scale Datasets for Audio-Visual Video Representation Learning. In ICCV, 2021.

Python 63 7 Updated Nov 18, 2021

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 2,045 162 Updated Apr 21, 2025

Frontier Multimodal Foundation Models for Image and Video Understanding

Jupyter Notebook 1,102 81 Updated Aug 14, 2025

Code, slides, and examples from my generative AI video course... taking you all the way from VAEs to near real-time Stable Diffusion with PyTorch and Hugging Face!

Jupyter Notebook 20 9 Updated Dec 19, 2024

Fine-tune Stable Audio Open with DiT ControlNet.

Python 249 9 Updated May 16, 2025

This repo contains the source code of the first deep learning-base singing voice beat tracking system. It leverages WavLM and DistilHuBERT pre-trained speech models to create vocal embeddings and t…

Python 33 4 Updated Sep 4, 2022

Accurate and general beat tracker

Python 226 42 Updated Feb 2, 2026

Flexible LoRA Implementation to use with stable-audio-tools

Python 79 6 Updated Sep 9, 2024
Next