Stars
Driving all platforms UI automation with vision-based model
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
DigitalPlat FreeDomain: Free Domain For Everyone
Neovate Code is a code agent to enhance your development. You can use it to generate code, fix bugs, review code, add tests, and more. You can run it in interactive mode or headless mode.
🥢像老乡鸡🐔那样做饭。主要部分于2024年完工,非老乡鸡官方仓库。文字来自《老乡鸡菜品溯源报告》,并做归纳、编辑与整理。CookLikeHOC.
#1 PDF Application on GitHub that lets you edit PDFs on any device anywhere
High performance self-hosted photo and video management solution.
Misc; latest version of waifu2x; 2D video to stereo 3D video conversion
Use Claude Code as the foundation for coding infrastructure, allowing you to decide how to interact with the model while enjoying updates from Anthropic.
An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.
ScriptCat, a browser extension that can execute userscript; 脚本猫,一个可以执行用户脚本的浏览器扩展
Tongyi Deep Research, the Leading Open-source Deep Research Agent
A cross-platform bilibili toolbox. 跨平台哔哩哔哩工具箱,支持下载视频、番剧等等各类资源
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.
Cross-platform, customizable ML solutions for live and streaming media.
Chrome MCP Server is a Chrome extension-based Model Context Protocol (MCP) server that exposes your Chrome browser functionality to AI assistants like Claude, enabling complex browser automation, c…
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
The Cursor for Designers • An Open-Source AI-First Design tool • Visually build, style, and edit your React App with AI
Open-source, secure environment with real-world tools for enterprise-grade agents.
[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model
[CVPR 2025] RollingDepth: Video Depth without Video Models
Use Hugging Face with JavaScript
Generate stereogram images (popularized as "Magic Eye") in the browser
A high-quality PDF to Markdown tool based on large language model visual recognition. 一款基于大模型视觉识别的高质量PDF转Markdown工具