Skip to content
View R3gm's full-sized avatar

Block or report R3gm

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

workflow orchestration UI and nodes editor for your own python codebase

TypeScript 39 2 Updated Oct 30, 2024
C# 284 28 Updated Sep 9, 2024

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 8,643 966 Updated Oct 23, 2025

C++ library for converting text to phonemes for Piper

C++ 134 110 Updated Jul 10, 2025

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 18,612 1,974 Updated Oct 21, 2025

I've been trying quite hard to use the IVONA Amy voice on Linux natively, from trying to reverse engineer the APK's and dll files, to hacking Waydroid to be compatible, and port forwarding/ssh via …

Shell 3 Updated Jun 30, 2023

All Algorithms implemented in Python

Python 212,583 49,164 Updated Nov 6, 2025

A Jupyter widgets-based interactive notebook for Google Colab to generate images using Stable Diffusion.

Jupyter Notebook 18 11 Updated Dec 13, 2023

fMRI-to-image reconstruction on the NSD dataset.

Jupyter Notebook 352 50 Updated May 22, 2024
Python 55 15 Updated Mar 13, 2024

a colab notebook repo for using Diffusers library (not a webui)

Jupyter Notebook 19 Updated Oct 2, 2023

Versatile audio super resolution (any -> 48kHz) with AudioSR.

Python 1,602 175 Updated Aug 27, 2025

Godot Engine – Multi-platform 2D and 3D game engine

C++ 102,968 23,531 Updated Nov 6, 2025

VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram

Python 260 32 Updated Jul 25, 2024

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 12,562 2,729 Updated Jun 22, 2025

OpenMMLab Semantic Segmentation Toolbox and Benchmark.

Python 9,369 2,785 Updated Aug 13, 2024

A curated list of open source projects used in nuclear science and engineering

421 82 Updated Nov 3, 2025

Demo Programs for the "Talking Head(?) Anime from a Single Image 3: Now the Body Too" Project

Python 1,011 105 Updated Aug 29, 2023

A MIT-licensed, deployable starter kit for building and customizing your own version of AI town - a virtual town where AI characters live, chat and socialize.

TypeScript 8,943 915 Updated Feb 13, 2025

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Python 13,333 2,547 Updated Jun 26, 2024

A Chess Bot powered by OpenAI's ChatGPT

Python 22 7 Updated Mar 7, 2024

A multi document reader and chatbot using LangChain and ChatGPT

Python 143 54 Updated Feb 7, 2024

template for duplicating and executing Hugging Face Spaces either on SM Studio Lab, Google Colab, or locally.

Jupyter Notebook 11 2 Updated Jan 9, 2023

📚 A collection of sketch based application papers.

671 65 Updated Nov 6, 2025

Panel: The powerful data exploration & web app framework for Python

Python 5,508 567 Updated Nov 6, 2025

A list of awesome beginners-friendly projects.

79,452 7,575 Updated Oct 2, 2025

A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, …

TypeScript 2,719 285 Updated Nov 1, 2025

A timeline of the latest AI models for audio generation, starting in 2023!

1,904 69 Updated Jan 4, 2024

Finetuning VITS Efficiently

Python 33 6 Updated Nov 6, 2023
Next