Starred repositories
🎭 193 个即插即用的 AI 专家角色 — 支持 OpenClaw/Claude Code/Cursor/Copilot 等 14 种工具,覆盖工程/设计/营销/产品等 18 个部门。含 46 个中国市场原创智能体(小红书/抖音/微信/飞书/钉钉等)
A complete AI agency at your fingertips - From frontend wizards to Reddit community ninjas, from whimsy injectors to reality checkers. Each agent is a specialized expert with personality, processes…
京东风格的移动端 Vue 组件库,支持多端小程序(A Vue.js UI Toolkit for Mobile Web)
A curated list of awesome Claude Skills, resources, and tools for customizing Claude AI workflows
一个自动撰写小红书笔记,自动生成图片,自动发布的 Skills
Chrome extension to let agents control your browser. Runs Playwright snippets in a stateful sandbox. Available as CLI or MCP
🎉 基于Spring Boot、Spring Cloud & Alibaba、Vue3 & Vite、Element Plus的分布式前后端分离微服务架构权限管理系统
Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.
Chrome DevTools for coding agents
Automate browser based workflows with AI
Practice English, one strike, one step forward; 练习英语,一次敲击,一点进步;
No fortress, purely open ground. OpenManus is Coming.
[ICLR 2026] A Training-free Iterative Framework for Long Story Visualization
GUI for a Vocal Remover that uses Deep Neural Networks.
Background Remover lets you Remove Background from images and video using AI with a simple command line interface that is free and open source.
记录 36Kr、bilibili、GitHub、抖音、掘金、微信读书平台从 2023-10-25 日至今的热点榜。每小时抓取一次数据,按天归档。非当年数据归档到 Releases 中
A feature-rich command-line audio/video downloader
All Algorithms implemented in Python
A Unified Toolkit for Deep Learning Based Document Image Analysis
MixTeX multimodal LaTeX, ZhEn, and, Table OCR. It performs efficient CPU-based inference in a local offline on Windows.
kuyacai / one-api
Forked from songquanpeng/one-apiOpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打包好 Docker 镜像,一键部署,开箱即用. OpenAI key management & redistributi…
OCR, layout analysis, reading order, table recognition in 90+ languages
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Enhances Tesseract OCR output using LLMs (local or API) for error correction, smart chunking, and markdown formatting of scanned PDFs
A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone