Skip to content
View chizhu's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report chizhu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This is the official repo for the paper "General365: Benchmarking General Reasoning in LLMs under High Difficulty and Diversity".

Python 79 3 Updated Apr 14, 2026

This is the official repo for the paper "AMO-Bench: Large Language Models Still Struggle in High School Math Competitions".

Python 128 3 Updated Feb 6, 2026

Reinforcement Learning via Self-Distillation (SDPO)

Python 947 107 Updated Feb 18, 2026

The absolute trainer to light up AI agents.

Python 17,306 1,512 Updated Apr 29, 2026

A version of verl to support diverse tool use [TMLR 2026]

Python 997 83 Updated Jun 8, 2026
Python 77 7 Updated Apr 20, 2026

Train your Agent model via our easy and efficient framework

Python 1,763 163 Updated Dec 5, 2025

Repository hosting code for "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).

Python 1,921 391 Updated Jun 12, 2026

Scalable toolkit for efficient model reinforcement

Python 1,726 422 Updated Jun 13, 2026

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 21,949 4,072 Updated Jun 13, 2026

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 3,392 545 Updated Jun 12, 2026

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Python 1,293 120 Updated Aug 16, 2025

ChatWiki 微信公众号的AI知识库工作流Agent平台,RAG大模型AI客服机器人,致力于成为垂直领域的coze、n8n。

Vue 1,975 310 Updated Jun 12, 2026

A PyTorch native platform for training generative AI models

Python 5,436 859 Updated Jun 13, 2026

一个很小很小的RAG系统

Python 383 37 Updated Apr 29, 2025

Synerise RecSys Challenge 2025

Python 99 45 Updated Jul 31, 2025

TrendPublish: 全自动 AI 内容生成与发布系统 | 微信公众号自动化 | 多源数据抓取 (Twitter/X、网站) | DeepseekAI、千问、讯飞模型 | 智能内容分析排序 | 定时发布 | 多模板支持 | Node.js | TypeScript | AI 技术趋势跟踪工具

TypeScript 2,989 416 Updated Jun 4, 2026

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 5,006 375 Updated Apr 6, 2026

Machine Learning Engineering Open Book

Python 18,107 1,150 Updated May 18, 2026

OpenMMLab Detection Toolbox and Benchmark

Python 32,754 9,826 Updated Aug 21, 2024

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 28,963 6,512 Updated Jun 13, 2026

Simple RL training for reasoning

Python 3,864 288 Updated Dec 23, 2025

Fully open reproduction of DeepSeek-R1

Python 26,302 2,438 Updated Apr 2, 2026

Reproduce R1 Zero on Logic Puzzle

Python 2,451 164 Updated Mar 20, 2025

Python3 package for Chinese/English OCR,use paddleocr-v5 onnx model(~20MB), with ultra-fast inference speed. 基于ppocr-v5-onnx模型推理,中英文OCR开源SOTA,推理速度超快。

Python 131 21 Updated Apr 11, 2026

A collection of LogitsProcessors to customize and enhance LLM behavior for specific tasks.

Python 391 25 Updated Jul 8, 2025

基于知识图谱的《红楼梦》人物关系可视化及问答系统

HTML 1,334 316 Updated Apr 23, 2019

An Open Large Reasoning Model for Real-World Solutions

Python 1,540 81 Updated Feb 13, 2026
Next