Lists (1)
Sort Name ascending (A-Z)
Stars
A video frame sharing system for Microsoft Windows
3D CAD viewer and converter based on Qt + OpenCascade
Highly performant and modular controls for node-based editors designed for data-binding and MVVM.
Voice receive extension package for discord.py
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …
Very simple Nintendo Game Boy emulator written in C# + Blazor running in the web-browser.
QR Code Scanner Blazor component
A high-performance, modular audio & MIDI engine for .NET 8+. A complete toolkit for the entire audio lifecycle: Playback, Recording, Multi-track Editing, Pro Synthesis (MPE/SF2), Real-time DSP, and…
Pngtuber app build on Avalonia.UI with twitch integration and a ttspet
StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion
NarForum is a simple and flexible forum software built with .NET 8 and Blazor.
Open-source Windows and Office activator featuring HWID, Ohook, TSforge, and Online KMS activation methods, along with advanced troubleshooting.
PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions
Self-supervised learning for real-time pitch estimation
Easy to use stem (e.g. instrumental/vocals) separation from CLI or as a python package, using a variety of amazing pre-trained models (primarily from UVR)
.NET diagramming library for interactive flowcharts, org charts, design tools, planning tools, visual languages.
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Web-based Process Visualization (SCADA/HMI/Dashboard) software
All generative model in one for better TTS model
DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.
Turn any webpage into structured data using LLMs
API for a Vocal Remover that uses Deep Neural Networks.
Zero-Shot Speech Editing and Text-to-Speech in the Wild
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Large World Model -- Modeling Text and Video with Millions Context
⭐️ Companies that don't have a broken hiring process