Lists (21)
Sort Name ascending (A-Z)
Starred repositories
Automated workflow to generate TTS datasets using A.I.
Music -> cover generator using music style (timbre + others) transfer
HT-Demucs & Spleeter in one UI live on HuggingFace
An OpenAI's Whisper-based full-stack project to transcribe audio and video files using React & Django.
On-device neural TTS for React Native. 31 languages. No API keys.
Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.
LLM Council works together to answer your hardest questions
A single CLAUDE.md file to improve Claude Code behavior, derived from Andrej Karpathy's observations on LLM coding pitfalls.
🪨 why use many token when few token do trick — Claude Code skill that cuts 65% of tokens by talking like caveman
An AI SKILL that provide design intelligence for building professional UI/UX multiple platforms
700+ curated UI sound effects for modern web apps. Browse, preview, and install sounds with a single command. Free and open source
VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning
An open-source, GPU-accelerated physics simulation engine built upon NVIDIA Warp, specifically targeting roboticists and simulation researchers.
U-Boot builds for Orange Pi 5 (and variants)
Ollama alternative for Rockchip NPU: An efficient solution for running AI and Deep learning models on Rockchip devices with optimized NPU support ( rkllm )
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
Open-source LLM knowledge platform: turn raw documents into a queryable RAG, an autonomous reasoning agent, and a self-maintaining Wiki.
This repo powers my experiment where ChatGPT manages a real-money micro-cap stock portfolio.
A collection of 🤗 Transformers.js demos and example applications
[NeurIPS' 25] Benchmark for evaluating TTS models on complex prosodic, expressiveness, and linguistic challenges.
50+ tutorials and implementations for Generative AI Agent techniques, from basic conversational bots to complex multi-agent systems.
A gallery that showcases on-device ML/GenAI use cases and allows people to try and use models locally.
100+ AI Agent & RAG apps you can actually run — clone, customize, ship.
It generates a valid poToken with visitorData fetched from YouTube.
A collection of resources and papers on Vector Quantized Variational Autoencoder (VQ-VAE) and its application
Using Fourier space properties to enhance inference speed and take advantage of paralellization.
Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input