Skip to content
View ZefanW's full-sized avatar

Block or report ZefanW

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official PyTorch implementation for "Large Language Diffusion Models"

Python 3,019 201 Updated Sep 30, 2025

Towards a Unified View of Large Language Model Post-Training

Python 153 8 Updated Sep 8, 2025

The official repository of paper "Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models''

Python 91 4 Updated Aug 15, 2025

Code for the paper-"Mirostat: A Perplexity-Controlled Neural Text Decoding Algorithm" (https://arxiv.org/abs/2007.14966).

Jupyter Notebook 61 3 Updated Feb 7, 2022

Kimi K2 is the large language model series developed by Moonshot AI team

8,317 548 Updated Sep 11, 2025

Official Repository of Absolute Zero Reasoner

Python 1,708 282 Updated Aug 24, 2025
Python 960 45 Updated Jul 2, 2025

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook 879 80 Updated Sep 23, 2025

诺亚盘古大模型研发背后的真正的心酸与黑暗的故事。

11,391 1,372 Updated Jul 9, 2025

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 6,143 672 Updated Oct 8, 2025

A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).

Python 2,924 164 Updated Jul 9, 2025

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Python 1,441 173 Updated Oct 10, 2025

Extrapolating RLVR to General Domains without Verifiers

Python 172 8 Updated Aug 12, 2025

Muon is an optimizer for hidden layers in neural networks

Python 1,824 84 Updated Jul 12, 2025

Muon is Scalable for LLM Training

1,323 69 Updated Aug 3, 2025

aider is AI pair programming in your terminal

Python 37,878 3,561 Updated Oct 5, 2025
Python 6 Updated Feb 17, 2025
Python 333 19 Updated Jul 29, 2025

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,175 95 Updated Oct 6, 2025

Official implementation of BLIP3o-Series

Python 1,499 65 Updated Oct 3, 2025

The official repo of One RL to See Them All: Visual Triple Unified Reinforcement Learning

Python 318 16 Updated May 31, 2025

A PyTorch Native LLM Training Framework

Python 874 51 Updated Sep 12, 2025
Python 318 24 Updated Aug 29, 2025

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,057 1,649 Updated Sep 24, 2025

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,450 59 Updated Jun 14, 2025

[ACL-2024]Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial Training

Python 37 3 Updated Oct 28, 2024

Scalable toolkit for efficient model reinforcement

Python 921 152 Updated Oct 10, 2025

Dream 7B, a large diffusion language model

Python 1,001 55 Updated Sep 26, 2025
Jupyter Notebook 169 7 Updated May 16, 2025
Next