Skip to content
View BingguangHao's full-sized avatar

Block or report BingguangHao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

slime is an LLM post-training framework for RL Scaling.

Python 6,151 897 Updated Jun 16, 2026

A user-friendly & efficient knowledge distillation framework for LLMs, supporting off-policy, on-policy (OPD), cross-tokenizer, multimodal, and on-policy self-distillation.

Python 199 15 Updated Jun 5, 2026

通用高质量 Skills 合集🔥

JavaScript 1,940 143 Updated May 28, 2026

Search, understand, reproduce, and improve an idea with ease

Python 1,203 124 Updated Jun 16, 2026

A project implementing various agentic RL based on the Slime post-training framework

Python 468 32 Updated Apr 11, 2026

My learning notes for ML SYS.

Python 6,532 443 Updated Jun 8, 2026

Agentic Learning Powered by AWorld

Python 111 10 Updated Apr 16, 2026

Laos_System provides a configurable end-to-end pipeline that converts clinical speech/text notes into structured JSON documents for: admission, surgery and discharge.

Python 21 2 Updated Jan 7, 2026

The KCORES Agent benchmarking project is designed to evaluate the tool-call capabilities of single-modal/multi-modal models.

TypeScript 75 10 Updated Dec 16, 2025
Python 228 12 Updated Jun 2, 2025

This is a scaled agent data synthesis system for tool usage learning similar to kimi k2.

5 Updated Oct 26, 2025

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 22,004 4,085 Updated Jun 16, 2026

This is the official repository of the paper Exploring Superior Function Calls via Reinforcement Learning.

34 Updated Aug 11, 2025

This is the official repository of the paper "BalanceSFT: Improving LLM Function Calling with Balanced Training Signals and Data Hardness"

56 Updated Nov 24, 2025

Config files for my GitHub profile.

3 Updated Oct 16, 2025

🍧🍮🍒

Java 2 Updated Dec 13, 2022
C++ 3 Updated Dec 15, 2023

个人shell脚本每日练习

Shell 2 Updated Jan 16, 2024
Go 2 Updated Jan 25, 2024

based on redis pub/sub mq

C++ 6 Updated May 27, 2024
HTML 2 Updated Oct 20, 2025

记录JAVA多线程的学习情况

Java 2 Updated Apr 30, 2020
C++ 4 Updated Feb 29, 2024
JavaScript 4 Updated Feb 24, 2024

My hexo blog

HTML 3 Updated Dec 4, 2020

简单、易用、高性能的服务间远程调用管理、调度、负载均衡系统

C++ 29 Updated Jun 6, 2024

A ThreadPool Based On C++20

C++ 27 Updated Sep 1, 2024

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程

Jupyter Notebook 30,943 3,026 Updated Jun 3, 2026

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 24,537 2,812 Updated May 25, 2026
Next