OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 6,143 672 Updated Oct 8, 2025

yuchenlin / rebiber

A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).

Python 2,924 164 Updated Jul 9, 2025

opendilab / LightZero

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Python 1,441 173 Updated Oct 10, 2025

OpenBMB / RLPR

Extrapolating RLVR to General Domains without Verifiers

Python 172 8 Updated Aug 12, 2025

KellerJordan / Muon

Muon is an optimizer for hidden layers in neural networks

Python 1,824 84 Updated Jul 12, 2025

MoonshotAI / Moonlight

Muon is Scalable for LLM Training

1,323 69 Updated Aug 3, 2025

Aider-AI / aider

aider is AI pair programming in your terminal

Python 37,878 3,561 Updated Oct 5, 2025

trotsky1997 / openai_grading_fix

Python 6 Updated Feb 17, 2025

kanishkg / cognitive-behaviors

Python 208 12 Updated Mar 26, 2025

ruixin31 / Spurious_Rewards

Python 333 19 Updated Jul 29, 2025

open-thought / reasoning-gym

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,175 95 Updated Oct 6, 2025

JiuhaiChen / BLIP3o

Official implementation of BLIP3o-Series

Python 1,499 65 Updated Oct 3, 2025

MiniMax-AI / One-RL-to-See-Them-All

The official repo of One RL to See Them All: Visual Triple Unified Reinforcement Learning

Python 318 16 Updated May 31, 2025

volcengine / veScale

A PyTorch Native LLM Training Framework

Python 874 51 Updated Sep 12, 2025

InternLM / InternBootcamp

Python 318 24 Updated Aug 29, 2025

OpenBMB / MiniCPM-V

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,057 1,649 Updated Sep 24, 2025

ByteDance-Seed / Seed1.5-VL

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,450 59 Updated Jun 14, 2025

calubkk / RAAT

[ACL-2024]Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial Training

Python 37 3 Updated Oct 28, 2024

NVIDIA-NeMo / RL

Scalable toolkit for efficient model reinforcement

Python 921 152 Updated Oct 10, 2025

DreamLM / Dream

Dream 7B, a large diffusion language model

Python 1,001 55 Updated Sep 26, 2025

Joshua-Ren / Learning_dynamics_LLM

Jupyter Notebook 169 7 Updated May 16, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zefan Wang ZefanW

Achievements

Achievements

Block or report ZefanW

Stars

ML-GSAI / LLaDA

TsinghuaC3I / Unify-Post-Training

RUCAIBox / Passk_Training

basusourya / mirostat

MoonshotAI / Kimi-K2

LeapLabTHU / Absolute-Zero-Reasoner

huggingface / Math-Verify

allenai / OLMoE

HW-whistleblower / True-Story-of-Pangu

open-compass / opencompass