Stars
.NET DSP library with a lot of audio processing functions
Video, Image and GIF upscale/enlarge(Super-Resolution) and Video frame interpolation. Achieved with Waifu2x, Real-ESRGAN, Real-CUGAN, RTX Video Super Resolution VSR, SRMD, RealSR, Anime4K, RIFE, IF…
A node-based image processing GUI aimed at making chaining image processing tasks easy and customizable. Born as an AI upscaling application, chaiNNer has grown into an extremely flexible and power…
Repo for SeedVR2 & SeedVR (CVPR2025 Highlight)
リアルタイムボイスチェンジャー Realtime Voice Changer
ncnn is a high-performance neural network inference framework optimized for the mobile platform
SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training
Official SeedVR2 Video Upscaler for ComfyUI
A machine learning-based video super resolution and frame interpolation framework. Est. Hack the Valley II, 2018.
Misc; latest version of waifu2x; 2D video to stereo 3D video conversion
Kinethreads: Soft Full-Body Haptic Exosuit using Low-Cost Motor-Pulley Mechanisms
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
SoftVC VITS Singing Voice Conversion
A handy quick tool for blocking mechanical keyboard chatter.
Easily train a good VC model with voice data <= 10 mins!
A recreation of Neuro-Sama originally created in 7 days.
A gallery that showcases on-device ML/GenAI use cases and allows people to try and use models locally.
Lightweight C# development environment for VSCode
Stable Diffusion web UI
Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Faster Whisper transcription with CTranslate2
Cross-platform .NET/Mono bindings for LibVLC
Cross-platform, customizable ML solutions for live and streaming media.