Starred repositories
Claude-Codex Bridge integrates OpenAI Codex CLI into Claude Code, enabling multi-model collaboration. This allows Claude to consult Codex for code reviews, brainstorming, architecture analysis, and…
OpenRouterBench: A One-Stop Benchmark and Solution Suite for LLM Routing
[DAI 2025] Beyond GPT-5: Making LLMs Cheaper and Better via Performance–Efficiency Optimized Routing
Research Implementation: Large Language Models are Near-Optimal Decision-Makers with a Non-Human Learning Behavior
[AAAI 2026] The Avengers: A Simple Recipe for Uniting Smaller Language Models to Challenge Proprietary Giants
The Open-Source Data Annotation Platform
Data annotation toolbox supports image, audio and video data.
A Comprehensive Toolkit for High-Quality PDF Content Extraction
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".
ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Brain-inspired Cognitive Intelligence Engine (BrainCog) is a brain-inspired spiking neural network based platform for Brain-inspired Artificial Intelligence and simulating brains at multiple scales…
Repo for counting stars and contributing. Press F to pay respect to glorious developers.
A curated list of awesome Deep Reinforcement Learning resources.
This is the official implementation of Multi-Agent PPO (MAPPO).
Python Implementation of Reinforcement Learning: An Introduction
Source code accompanying 'Mathematics of Epidemics on Networks' by Kiss, Miller, and Simon http://www.springer.com/us/book/9783319508047 . Documentation for the software package is at https://epide…
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
person detect based on yolov3 with several Python scripts