Stars
AI-powered multi-voice audiobook generator — LLM script annotation, voice cloning, voice design, LoRA training, per-line style control, and export to MP3, chaptered M4B, or Audacity multi-track. Bu…
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
n8n skillset for Claude Code to build flawless n8n workflows
Universal skills loader for AI coding agents - npm i -g openskills
Cognee is the open-source AI memory platform for agents. Give your AI agents persistent long-term memory across sessions with a self-hosted knowledge graph engine.
Patterns and resources of low latency programming.
Gemma open-weight LLM library, from Google DeepMind
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
Fully automatic censorship removal for language models
Implement a reasoning LLM in PyTorch from scratch, step by step
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
Efficient Part-level 3D Object Generation via Dual Volume Packing
🔊 Text-Prompted Generative Audio Model
An open-source AI agent that brings the power of Gemini directly into your terminal.
Open standard for machine learning interoperability
Anthropic's educational courses
Model Context Protocol Server for Mobile Automation and Scraping (iOS, Android, Emulators, Simulators and Real Devices)
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
Robust Speech Recognition via Large-Scale Weak Supervision
[CVPR2025 Highlight] Video Generation Foundation Models: https://saiyan-world.github.io/goku/
The Frontend Stack for Agents & Generative UI. React, Angular, Mobile, Slack, and more. Makers of the AG-UI Protocol
GPT-4o-level, real-time spoken dialogue system.
自研零反射,零HooK,全动态化,插件化框架,全网唯一结合启动优化的插件化架构,适合小,中,大型项目均可的插件化架构
ASCII generator (image to text, image to image, video to video)