Stars
Control and limit battery charging on Apple Silicon MacBooks.
Never Miss a Claude Code Renewal Window Again!
Official code repository for the paper "ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind"
[EMNLP 2025] DiagramEval: Evaluating LLM-Generated Diagrams via Graphs
[ACL'25 Main] Graph of Records: Boosting Retrieval Augmented Generation for Long-context Summarization with Graphs
GTAlign applies game-theoretic principles to fine-tune reasoning LLMs.
Time-R1: Framework and resources for endowing LLMs with comprehensive temporal reasoning (understanding, prediction, creative generation) using a novel three-stage RL curriculum. Includes the Time-…
[NeurIPS'25] Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning
Claude API Proxy platform - Seamlessly forward Claude Code requests to any OpenAI-compatible API service with smart model mapping, streaming support, deployed on Cloudflare Worker.
A set of beautifully-designed, accessible components and a code distribution platform. Works with your favorite frameworks. Open Source. Open Code.
A Bulletproof Way to Generate Structured JSON from Language Models
[ICML 2025]"Graph World Model", Tao Feng, Yexin Wu, Guanyu Lin, Jiaxuan You
Use Claude Code as the foundation for coding infrastructure, allowing you to decide how to interact with the model while enjoying updates from Anthropic.
An open-source AI agent that brings the power of Gemini directly into your terminal.
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…
TradingAgents: Multi-Agents LLM Financial Trading Framework
[ICLR 2025] "GraphRouter: A Graph-based Router for LLM Selections", Tao Feng, Yanzhen Shen, Jiaxuan You
[EMNLP 2025 Demo] TinyScientist: A Lightweight Framework for Building Research Agents
A live stream development of RL tunning for LLM agents
(ACL 2025 Main) Code for MultiAgentBench : Evaluating the Collaboration and Competition of LLM agents https://www.arxiv.org/pdf/2503.01935
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
Fully open reproduction of DeepSeek-R1
verl: Volcano Engine Reinforcement Learning for LLMs