-
autonoma Public
Forked from Autonoma-AI/autonomaOpen-source testing platform where AI agents navigate your app end-to-end and catch regressions on every PR. No test code required.
TypeScript Other UpdatedApr 3, 2026 -
chat-langchain Public
Forked from langchain-ai/chat-langchainPython MIT License UpdatedMar 30, 2026 -
mobile-mcp Public
Forked from mobile-next/mobile-mcpModel Context Protocol Server for Mobile Automation and Scraping (iOS, Android, Emulators, Simulators and Real Devices)
TypeScript Apache License 2.0 UpdatedMar 25, 2026 -
DataFlow Public
Forked from OpenDCAI/DataFlowEasy Data Preparation with latest LLMs-based Operators and Pipelines.
Python Apache License 2.0 UpdatedMar 17, 2026 -
playwriter Public
Forked from remorses/playwriterChrome extension to let agents control your browser. Runs Playwright snippets in a stateful sandbox. Available as CLI or MCP
HTML MIT License UpdatedMar 4, 2026 -
eigent Public
Forked from eigent-ai/eigentEigent: The Open Source Cowork Desktop to Unlock Your Exceptional Productivity.
TypeScript Apache License 2.0 UpdatedFeb 27, 2026 -
openclaw Public
Forked from openclaw/openclawYour own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
TypeScript MIT License UpdatedFeb 26, 2026 -
owl Public
Forked from camel-ai/owl🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation
Python UpdatedFeb 23, 2026 -
PaddleOCR Public
Forked from PaddlePaddle/PaddleOCRTurn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
Python Apache License 2.0 UpdatedFeb 16, 2026 -
GUI-Actor Public
Forked from microsoft/GUI-Actor[NeurIPS'25] GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents
Python MIT License UpdatedFeb 11, 2026 -
Qwen3-VL Public
Forked from QwenLM/Qwen3-VLQwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Jupyter Notebook Apache License 2.0 UpdatedJan 30, 2026 -
react-sip-kit Public
Forked from shervin-ghajar/react-sip-kitA web phone aiming to ease real time communication between your contacts
TypeScript MIT License UpdatedJan 15, 2026 -
agentic_rag_project Public
Forked from serkanyasr/agentic_rag_projectScalable Agentic RAG system using Pydantic AI, FastAPI & pgvector. Modular, production-ready foundation for document-based AI apps
Python MIT License UpdatedNov 3, 2025 -
-
yoflo-gui Public
Forked from CharlesCNorton/yoflo-guiReal-time object detection using Florence-2 with a user-friendly GUI.
Python MIT License UpdatedAug 7, 2025 -
webrtc-java Public
Forked from devopvoid/webrtc-javaWebRTC for desktop platforms running Java
C++ Apache License 2.0 UpdatedJul 25, 2025 -
ari-proxy Public
Forked from retel-io/ari-proxyAri-proxy connects Asterisk, an open source communication server, to the Apache Kafka distributed streaming platform.
Java GNU Affero General Public License v3.0 UpdatedJul 18, 2025 -
UGround Public
Forked from OSU-NLP-Group/UGround[ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents
Python MIT License UpdatedJul 18, 2025 -
SeeClick Public
Forked from njucckevin/SeeClickThe model, data and code for the visual GUI Agent SeeClick
HTML Apache License 2.0 UpdatedJul 13, 2025 -
mjSIP Public
Forked from haumacher/mjSIPmjSIP - a complete Java-based SIP stack implementation
Java GNU General Public License v2.0 UpdatedMay 11, 2025 -
Aria-UI Public
Forked from AriaUI/Aria-UIOpen-sourced, Fast and Context-aware Action Grounding from GUI Instructions for GUI/Computer-use Agents
Python UpdatedFeb 8, 2025 -
PDF-Extract-Kit Public
Forked from opendatalab/PDF-Extract-KitA Comprehensive Toolkit for High-Quality PDF Content Extraction
Python GNU Affero General Public License v3.0 UpdatedJan 3, 2025 -
visual-gui-grounding Public
Forked from krsx/visual-gui-groundingAdvanced Visual-Only GUI Grounding Framework with Visual Segmentation Model and Large Language Model
Python UpdatedDec 23, 2024 -
RealTimeOCR Public
Forked from AmmarMohamed0/RealTimeOCRRealTimeOCR is a computer vision tool combining YOLO for real-time object detection and PaddleOCR for text recognition in video streams, with customizable ROIs for precise targeting.
Python MIT License UpdatedOct 5, 2024 -
chartbrew Public
Forked from chartbrew/chartbrewOpen-source web platform used to create live reporting dashboards from APIs, MongoDB, Firestore, MySQL, PostgreSQL, and more 📈📊
JavaScript MIT License UpdatedAug 28, 2024 -
rsocket-py Public
Forked from rsocket/rsocket-pyRSocket implementation in Python
Python MIT License UpdatedAug 21, 2024 -
-
whisper Public
Forked from openai/whisperRobust Speech Recognition via Large-Scale Weak Supervision
Python MIT License UpdatedAug 8, 2024 -
waha Public
Forked from devlikeapro/wahaWAHA - WhatsApp HTTP API (REST API) that you can configure in a click! Two engines: chromium-based WEBJS and pure-websocket NOWEB
TypeScript Apache License 2.0 UpdatedAug 7, 2024 -
jitsi-meet Public
Forked from jitsi/jitsi-meetJitsi Meet - Secure, Simple and Scalable Video Conferences that you use as a standalone app or embed in your web application.
TypeScript Apache License 2.0 UpdatedAug 6, 2024