Stars
Cli and MCP for gbox. Enable AI agents to operate Android/Browser/Desktop like human.
Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, DeepSeek, and more. Simple declarative configs with command li…
freeCodeCamp.org's open-source codebase and curriculum. Learn math, programming, and computer science for free.
LLM Reasoning and Generation Benchmark. Evaluate LLMs in complex scenarios systematically.