Highlights
Lists (7)
Sort Name ascending (A-Z)
Stars
A WebDriver server for iOS and tvOS
Universal skills loader for AI coding agents - npm i -g openskills
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
A simple screen parsing tool towards pure vision based GUI agent
MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone
AgentCPM-GUI: An on-device GUI agent for operating Android apps, enhancing reasoning ability with reinforcement fine-tuning for efficient task execution.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Valdi is a cross-platform UI framework that delivers native performance without sacrificing developer velocity.
AndroidWorld is an environment and benchmark for autonomous agents
💫 Toolkit to help you get started with Spec-Driven Development
Spec-driven development for AI coding assistants.
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
AI-powered reverse engineering assistant that bridges IDA Pro with language models through MCP.
DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.
Trae Agent is an LLM-based agent for general purpose software engineering tasks.
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
LLM agents built for control. Designed for real-world use. Deployed in minutes.
An on-device debugger/JIT enabler for iOS versions 17.4+, powered by idevice.
Automate your mobile devices with natural language commands - an LLM agnostic mobile Agent 🤖
MS-Agent: Lightweight Framework for Empowering Agents with Autonomous Exploration in Complex Task Scenarios
The open-source CapCut alternative
The glamourous AI coding agent for your favourite terminal 💘
Enable Apple Intelligence on Macs sold in Mainland China with SIP enabled, tested on MacOS 15.4.1+ and 26+
Painless E2E Automation for Mobile and Web