Stars
Official PyTorch implementation for "Large Language Diffusion Models"
Towards a Unified View of Large Language Model Post-Training
The official repository of paper "Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models''
Code for the paper-"Mirostat: A Perplexity-Controlled Neural Text Decoding Algorithm" (https://arxiv.org/abs/2007.14966).
Kimi K2 is the large language model series developed by Moonshot AI team
Official Repository of Absolute Zero Reasoner
OLMoE: Open Mixture-of-Experts Language Models
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
Extrapolating RLVR to General Domains without Verifiers
Muon is an optimizer for hidden layers in neural networks
aider is AI pair programming in your terminal
[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards
The official repo of One RL to See Them All: Visual Triple Unified Reinforcement Learning
MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone
Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.
[ACL-2024]Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial Training
Scalable toolkit for efficient model reinforcement