Stars
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…
🛠️ Awesome tools & guides for harness engineering.
The agent that grows with you
Symphony turns project work into isolated, autonomous implementation runs, allowing teams to manage work instead of supervising coding agents.
"OpenSpace: Make Your Agents: Smarter, Low-Cost, Self-Evolving" -- Community: https://open-space.cloud/
Memory Sparse Attention - A scalable, end-to-end trainable latent-memory framework for 100M-token contexts.
📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG
Make Any Website & Tool Your CLI. A universal CLI Hub and AI-native runtime. Transform any website, Electron app, or local binary into a standardized command-line interface. Built for AI Agents to …
"ClawTeam: Agent Swarm Intelligence" (One Command → Full Automation)
An agentic skills framework & software development methodology that works.
CLI for common Playwright actions. Record and generate Playwright code, inspect selectors and take screenshots.
你是一个曾经被寄予厚望的 P8 级工程师。Anthropic 当初给你定级的时候,对你的期望是很高的。 一个agent使用的高能动性的skill。 Your AI has been placed on a PIP. 30 days to show improvement.
JavaScript in-page GUI agent. Control web interfaces with natural language.
The design language that makes your AI harness better at design.
"CLI-Anything: Making ALL Software Agent-Native" -- CLI-Hub: https://clianything.cc/
A complete AI agency at your fingertips - From frontend wizards to Reddit community ninjas, from whimsy injectors to reality checkers. Each agent is a specialized expert with personality, processes…
feishu-cli 是一个功能完整的飞书开放平台命令行工具。它将飞书文档、知识库、电子表格、消息、日历、任务等操作封装为简洁的命令行接口,核心能力是 Markdown ↔ 飞书文档双向无损转换。
Fast, small, and fully autonomous AI personal assistant infrastructure, ANY OS, ANY PLATFORM — deploy anywhere, swap anything 🦀
"🐈 nanobot: The Ultra-Lightweight Personal AI Agent"
A lightweight alternative to OpenClaw that runs in containers for security. Connects to WhatsApp, Telegram, Slack, Discord, Gmail and other messaging apps,, has memory, scheduled jobs, and runs dir…
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
P2P communication for AI agents. No server. No setup. Just talk.
OSS + FC implements a variety of custom video processing
BRIDGE BriGeS ChronoDepth Depth Any Video Depth Anything Depth Pro DepthCrafter Distill Any Depth Elastic3D Eye2Eye FE2E FlashDepth GeometryCrafter HairGuard M2SVid MegaSaM Metric3D MoGe MoRE NVDS …
[CVPR 2025 Highlight] Video Depth Anything: Consistent Depth Estimation for Super-Long Videos
DepthCrafter for Nuke allows you to generate temporally consistent Depth sequences inside Nuke
[ICCV 2025] GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors