BingguangHao

Follow

Harry BingguangHao

Follow

8 followers · 4 following

Achievements

Achievements

Stars

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 6,151 897 Updated Jun 16, 2026

songmzhang / KDFlow

A user-friendly & efficient knowledge distillation framework for LLMs, supporting off-policy, on-policy (OPD), cross-tokenizer, multimodal, and on-policy self-distillation.

Python 199 15 Updated Jun 5, 2026

xstongxue / best-skills

通用高质量 Skills 合集🔥

JavaScript 1,940 143 Updated May 28, 2026

inclusionAI / AWorld

Search, understand, reproduce, and improve an idea with ease

Python 1,203 124 Updated Jun 16, 2026

LMIS-ORG / slime-agentic

A project implementing various agentic RL based on the Slime post-training framework

Python 468 32 Updated Apr 11, 2026

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes for ML SYS.

Python 6,532 443 Updated Jun 8, 2026

inclusionAI / AWorld-RL

Agentic Learning Powered by AWorld

Python 111 10 Updated Apr 16, 2026

Applied-Machine-Learning-Lab / Awesome-Function-Callings

72 2 Updated Apr 8, 2026

AQ-MedAI / Laos_System

Laos_System provides a configurable end-to-end pipeline that converts clinical speech/text notes into structured JSON documents for: admission, surgery and discharge.

Python 21 2 Updated Jan 7, 2026

KCORES / silicon-rider-bench

The KCORES Agent benchmarking project is designed to evaluate the tool-call capabilities of single-modal/multi-modal models.

TypeScript 75 10 Updated Dec 16, 2025

NVlabs / Tool-N1

Python 228 12 Updated Jun 2, 2025

BingguangHao / Open-Agentic-tool-use

This is a scaled agent data synthesis system for tool usage learning similar to kimi k2.

5 Updated Oct 26, 2025

verl-project / verl

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 22,004 4,085 Updated Jun 16, 2026

BingguangHao / RLFC

This is the official repository of the paper Exploring Superior Function Calls via Reinforcement Learning.

34 Updated Aug 11, 2025

BingguangHao / BalanceSFT

This is the official repository of the paper "BalanceSFT: Improving LLM Function Calling with Balanced Training Signals and Data Hardness"

56 Updated Nov 24, 2025

JoyTsing / JoyTsing

Config files for my GitHub profile.

3 Updated Oct 16, 2025

JoyTsing / SeeleNote

🍧🍮🍒

Java 2 Updated Dec 13, 2022

JoyTsing / glade-backup

C++ 3 Updated Dec 15, 2023

JoyTsing / shell-practice

个人shell脚本每日练习

Shell 2 Updated Jan 16, 2024

JoyTsing / bookstore

Go 2 Updated Jan 25, 2024

JoyTsing / redis-mq

based on redis pub/sub mq

C++ 6 Updated May 27, 2024

JoyTsing / JoyTsing.github.io

HTML 2 Updated Oct 20, 2025

JoyTsing / Thread_JAVA

记录JAVA多线程的学习情况

Java 2 Updated Apr 30, 2020

JoyTsing / rtcserver

C++ 4 Updated Feb 29, 2024

JoyTsing / signaling

JavaScript 4 Updated Feb 24, 2024

JoyTsing / oldBlog

My hexo blog

HTML 3 Updated Dec 4, 2020

JoyTsing / Lazarus

简单、易用、高性能的服务间远程调用管理、调度、负载均衡系统

C++ 29 Updated Jun 6, 2024

JoyTsing / ThreadPool

A ThreadPool Based On C++20

C++ 27 Updated Sep 1, 2024

datawhalechina / self-llm

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程

Jupyter Notebook 30,943 3,026 Updated Jun 3, 2026

liguodongiot / llm-action

本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）

HTML 24,537 2,812 Updated May 25, 2026