Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
FFmpeg bindings for Node.js. Features both low-level and high-level APIs, full hardware acceleration, TypeScript support, and modern async patterns
super expressive prompting model based on ltx2.3
Kanade is a single-layer disentangled speech tokenizer that extracts compact tokens suitable for both generative and discriminative modeling.
TokenSpeed is a speed-of-light LLM inference engine.
🎨 Local-first, open-source alternative to Anthropic's Claude Design. ⚡ 19 Skills · ✨ 71 brand-grade Design Systems 🖼 Generate web · desktop · mobile prototypes · slides · images · videos · HyperFra…
ArtifactFS is a filesystem driver designed to mount large git repos as quickly as possible, hydrating file contents on-the-fly instead of blocking on the initial clone. It's ideal for agents, sandb…
Write HTML. Render video. Built for agents.
Agent skills for Manim to create 3Blue1Brown style animations.
Open-source alternative to AI video platforms — Free AI image & video generation studio with 200+ models (Flux, Midjourney, Kling, Sora, Veo). No content filters. Self-hosted, MIT licensed.
Create stunning demos for free. Open-source, no subscriptions, no watermarks, and free for commercial use. An alternative to Screen Studio.
Multilingual neural TTS (6 languages: JA/EN/ZH/ES/FR/PT, code supports SV) — C++, C#, Rust, Go, Python, npm (WASM). VITS + Prosody, streaming, CUDA/CoreML/DirectML. pip install piper-plus | npm ins…
LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar
C++17 library for creating macOS Audio Server plugins.
The open-source AI voice studio. Clone, dictate, create.
Your Creative Copilot for Video Editing
The official repo of UL-UNAS, an ultra-lightweight SE model.
A SOTA Industrial-Grade All-in-One ASR system with ASR, VAD, LID, and Punc modules. FireRedASR2 supports Chinese (Mandarin, 20+ dialects/accents), English, code-switching, and both speech and singi…
A tool for running and customizing real-time, interactive generative AI pipelines and models
香蕉超市|各种玩法一键生成,无需提示词,支持局部涂选、连续编辑
Create graphics with a hand-drawn, sketchy, appearance
An open source collection of animated, interactive & fully customizable React components for building memorable websites.
A Semantically Consistent Dataset for Data-Efficient Query-Based Universal Sound Separation