Stars
Cli and MCP for gbox. Enable AI agents to operate Android/Browser/Desktop like human.
Test your prompts, agents, and RAGs. AI Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with co…
freeCodeCamp.org's open-source codebase and curriculum. Learn math, programming, and computer science for free.
LLM Reasoning and Generation Benchmark. Evaluate LLMs in complex scenarios systematically.