Lists (21)
Sort Name ascending (A-Z)
Agent
Agent Framework
AutoTools
自动化工具Coding Skill
Components
database
develop thought
开发思考Finance
game
Graph Rag Agent
Graph Rag AgentLearning
LLM
prompt
提示词相关的仓库,可以后续做成skillRAG
Safety
Sandbox
Skill
Tools
Training LLMs
前端
司南
Stars
OmX - Oh My codeX: Your codex is not alone. Add hooks, agent teams, HUDs, and so much more.
Can run modifiable claude code source code
A cross-platform desktop All-in-One assistant tool for Claude Code, Codex, OpenCode, openclaw & Gemini CLI.
The repo is finally unlocked. enjoy the party! The fastest repo in history to surpass 100K stars ⭐. Join Discord: https://discord.gg/5TUQKqFWd Built in Rust using oh-my-codex.
哈喽!龙虾 🙋♀️ Adopt from scratch and build your first claw 🦞 来领养你的第一只龙虾!
📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程
Replace 'hub' with 'ingest' in any GitHub URL to get a prompt-friendly extract of a codebase
🛠️ Awesome tools & guides for harness engineering.
给 Claude Code 装上完整联网能力的 skill:三层通道调度 + 浏览器 CDP + 并行分治
TradingAgents: Multi-Agents LLM Financial Trading Framework
Spec-driven development (SDD) for AI coding assistants.
Fast, accurate & comprehensive text measurement & layout
Blazing 💥 fast terminal-ui for git written in rust 🦀
Build and run agents you can see, understand and trust.
SafeVerse: A Generative Evolution Arena for Trustworthy Embodied AI
A simulation platform for versatile Embodied AI research and developments.
Flames is a highly adversarial benchmark in Chinese for LLM's harmlessness evaluation developed by Shanghai AI Lab and Fudan NLP Group.
[ACL 2025] Data and Code for Paper VLSBench: Unveiling Visual Leakage in Multimodal Safety
Official Repo of "MMBench: Is Your Multi-modal Model an All-around Player?"
Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows
The All-in-one Judge Models introduced by Opencompass
A unified evaluation toolkit and leaderboard for rigorously assessing the scientific intelligence of large language and vision–language models across the full research workflow.
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.