Skip to content
View p3zo's full-sized avatar

Highlights

  • Pro

Block or report p3zo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 9,333 785 Updated Mar 26, 2026

ASCEND Chinese-English code-switching dataset

Jupyter Notebook 33 Updated Jul 12, 2022

Google Chromium, sans integration with Google

Python 26,891 1,199 Updated Jun 12, 2026

Skills for Real Engineers. Straight from my .claude directory.

Shell 130,240 11,366 Updated Jun 12, 2026

Peer-to-peer protocol for voice assistants

Python 368 49 Updated Jun 12, 2026

Command line utility for forced alignment using Kaldi

Python 1,834 287 Updated Jun 11, 2026

Robust Speech Recognition via Large-Scale Weak Supervision

Python 102,796 12,541 Updated Apr 15, 2026

[ICASSP'26] Real-time streaming voice anonymization & voice conversion

Python 76 9 Updated Apr 15, 2026

The best ChatGPT that $100 can buy.

Python 55,078 7,508 Updated May 5, 2026

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 17,386 3,437 Updated Jun 16, 2026

OceanHackWeek - Tutorials

55 82 Updated Feb 12, 2026

A lightweight psychoacoustic bass enhancement plugin - in stereo where available!

Rust 206 10 Updated Dec 30, 2023

A lightweight, local-first, and πŸ†“ experiment tracking library from Hugging Face πŸ€—

Python 1,529 119 Updated Jun 13, 2026

Pytorch implementation of MaskGIT: Masked Generative Image Transformer (https://arxiv.org/pdf/2202.04200.pdf)

Python 474 40 Updated Sep 3, 2023

Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch

Python 1,545 94 Updated Apr 24, 2025

A unified tokenizer that is capable of both extracting semantic information and enabling high-fidelity audio reconstruction.

Python 144 13 Updated Sep 19, 2025

An opinionated docker container for a web-interface around the music organizer beets

TypeScript 406 28 Updated Jun 11, 2026

Beets plugin to manage external files

Python 134 27 Updated Jun 15, 2026

πŸŽ› Stemgen is a Stem file generator. Convert any track into a Stem and have fun with Traktor.

Python 284 50 Updated Sep 1, 2025

Download Tidal tracks, videos, albums, playlists & artists! Tidal downloader that supports master quality.

Python 566 70 Updated Jun 9, 2026

Audio Dataset for training CLAP and other models

Python 740 59 Updated Jan 8, 2026

Official implementation of the paper MGE-LDM: Joint Latent Diffusion for Simultaneous Music Generation and Source Extraction

Python 20 1 Updated Feb 19, 2026

Generative Model Evaluation Lab - An evaluation suite for your generative models.

Python 7 Updated Dec 13, 2024

an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM

Rust 49,507 5,233 Updated Jun 16, 2026

TorchCFM: a Conditional Flow Matching library

Python 2,494 214 Updated Apr 20, 2026

πŸŽ₯ Python and OpenCV-based scene cut/transition detection program & library.

Python 4,928 501 Updated Jun 13, 2026
Python 121 10 Updated Jun 11, 2026

Livecoding networked visuals in the browser

JavaScript 2,658 314 Updated Apr 25, 2026

Awesome list for vjing/visuals-related resources

357 14 Updated Feb 26, 2025

ACAV100M: Automatic Curation of Large-Scale Datasets for Audio-Visual Video Representation Learning. In ICCV, 2021.

Python 64 7 Updated Nov 18, 2021
Next