Skip to content
View R3gm's full-sized avatar

Block or report R3gm

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

workflow orchestration UI and nodes editor for your own python codebase

TypeScript 39 3 Updated Oct 30, 2024
C# 282 28 Updated Sep 9, 2024

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 8,679 971 Updated Nov 9, 2025

C++ library for converting text to phonemes for Piper

C++ 134 111 Updated Jul 10, 2025

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 18,705 1,987 Updated Oct 21, 2025

I've been trying quite hard to use the IVONA Amy voice on Linux natively, from trying to reverse engineer the APK's and dll files, to hacking Waydroid to be compatible, and port forwarding/ssh via …

Shell 3 Updated Jun 30, 2023

All Algorithms implemented in Python

Python 213,038 49,245 Updated Nov 6, 2025

A Jupyter widgets-based interactive notebook for Google Colab to generate images using Stable Diffusion.

Jupyter Notebook 18 11 Updated Dec 13, 2023

fMRI-to-image reconstruction on the NSD dataset.

Jupyter Notebook 353 50 Updated May 22, 2024
Python 55 15 Updated Mar 13, 2024

a colab notebook repo for using Diffusers library (not a webui)

Jupyter Notebook 19 Updated Oct 2, 2023

Versatile audio super resolution (any -> 48kHz) with AudioSR.

Python 1,606 175 Updated Aug 27, 2025

Godot Engine – Multi-platform 2D and 3D game engine

C++ 103,213 23,572 Updated Nov 13, 2025

VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram

Python 261 32 Updated Jul 25, 2024

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 12,577 2,733 Updated Jun 22, 2025

OpenMMLab Semantic Segmentation Toolbox and Benchmark.

Python 9,394 2,793 Updated Aug 13, 2024

A curated list of open source projects used in nuclear science and engineering

423 82 Updated Nov 3, 2025

Demo Programs for the "Talking Head(?) Anime from a Single Image 3: Now the Body Too" Project

Python 1,011 105 Updated Aug 29, 2023

A MIT-licensed, deployable starter kit for building and customizing your own version of AI town - a virtual town where AI characters live, chat and socialize.

TypeScript 8,959 918 Updated Feb 13, 2025

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Python 13,352 2,550 Updated Jun 26, 2024

A Chess Bot powered by OpenAI's ChatGPT

Python 22 7 Updated Mar 7, 2024

A multi document reader and chatbot using LangChain and ChatGPT

Python 143 54 Updated Feb 7, 2024

template for duplicating and executing Hugging Face Spaces either on SM Studio Lab, Google Colab, or locally.

Jupyter Notebook 11 2 Updated Jan 9, 2023

📚 A collection of sketch based application papers.

673 66 Updated Nov 13, 2025

Panel: The powerful data exploration & web app framework for Python

Python 5,513 567 Updated Nov 12, 2025

A list of awesome beginners-friendly projects.

79,680 7,584 Updated Nov 12, 2025

A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, …

TypeScript 2,738 285 Updated Nov 12, 2025

A timeline of the latest AI models for audio generation, starting in 2023!

1,903 70 Updated Jan 4, 2024

Finetuning VITS Efficiently

Python 33 6 Updated Nov 6, 2023
Next