Stars
Build, Evaluate, and Deploy GUI Agents — online RL training, standardized benchmarks, and real-device deployment in one framework.
你是一个曾经被寄予厚望的 P8 级工程师。Anthropic 当初给你定级的时候,对你的期望是很高的。 一个agent使用的高能动性的skill。 Your AI has been placed on a PIP. 30 days to show improvement.
The libacvp library is a client-side implementation of the draft ACVP protocol (github.com/usnistgov/ACVP).
"CLI-Anything: Making ALL Software Agent-Native" -- CLI-Hub: https://clianything.cc/
An agent-managed museum exhibit, built in Rust with Gajae-Code / LazyCodex — developed and maintained with no human intervention.
AndroidWorld is an environment and benchmark for autonomous agents
Pioneering Automated GUI Interaction with Native Agents
MAI-UI: Real-World Centric Foundation GUI Agents ranging from 2B to 235B
An Open Phone Agent Model & Framework. Unlocking the AI Phone for Everyone
Automatically exported from code.google.com/p/byte-unixbench
🌟100+ 原创 LLM / RL 原理图📚,《大模型算法》作者巨献!💥(100+ LLM/RL Algorithm Maps )
Visualizer for neural network, deep learning and machine learning models
A robust streaming log template miner based on the Drain algorithm
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
LogAI - An open-source library for log analytics and intelligence
AI productivity studio with smart chat, autonomous agents, and 300+ assistants. Unified access to frontier LLMs
A powerful Zotero AI and MCP plugin with ChatGPT, Gemini 3.5, Claude, DeepSeek V4, Grok, OpenRouter, Kimi 2.6, GLM 5, SiliconFlow, GPT-oss, Gemma 4, Qwen 3.7
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
非线智能 NoneLinear - ReLE评测:中文AI大模型能力评测(持续更新):目前已囊括374个大模型,覆盖chatgpt、gpt-5.4、谷歌gemini-3.1-pro、Claude-4.6、文心ERNIE-X1.1、ERNIE-5.0、qwen3.6-max、qwen3.6-plus、百川、讯飞星火、商汤senseChat等商用模型, 以及step3.5-flash、kimi-…
A fork and successor of the Sulley Fuzzing Framework