Skip to content
View ji-huazhong's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Zhejiang University
  • Hangzhou
  • 20:33 (UTC +08:00)

Block or report ji-huazhong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Allow torch tensor memory to be released and resumed later

Python 251 58 Updated May 16, 2026

Ongoing research training transformer models at scale

Python 5 Updated Jun 15, 2026

UniRL is a Framework for Unified Multimodal Model Reinforcement Learning

Python 614 33 Updated Jun 15, 2026

An LLM post-training framework with vLLM for RL Scaling

Python 251 19 Updated Jun 15, 2026

Compact and Agent-Native MoE Training System

Python 194 15 Updated Jun 13, 2026

📰 Must-read papers and blogs on Speculative Decoding ⚡️

1,255 80 Updated Jun 2, 2026

how to optimize some algorithm in cuda.

Cuda 3,084 279 Updated Jun 9, 2026

FlashInfer: Kernel Library for LLM Serving

Python 5,795 1,050 Updated Jun 15, 2026

[Experimental] Miles-diffusion is an post-training framework for large-scale diffusion model training and production workloads, forked from and co-evolving with miles.

Python 17 5 Updated Jun 12, 2026

An Open-Source Large-Scale Reinforcement Learning Project for Search Agents

Python 593 38 Updated Nov 26, 2025

A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training

Python 850 58 Updated Jun 15, 2026

Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3 (NeurIPS'25).

Python 2,396 285 Updated Feb 20, 2026

A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM

Python 516 104 Updated Jun 13, 2026

A unified framework for building, running, and training general agents at scale.

Python 340 44 Updated Jun 15, 2026

Open MinT training runtime on veRL

Python 235 14 Updated May 18, 2026

TokenSpeed is a speed-of-light LLM inference engine.

Python 1,434 156 Updated Jun 15, 2026

🚀 An open-source, hands-on curriculum bridging the gap from basic RL concepts to LLM alignment, RLVR, and advanced Agentic systems.

Python 2,898 178 Updated Jun 12, 2026

verl Zero-Mismatch Dense/MoE HuggingFace Rollout

Python 53 5 Updated Jun 11, 2026

Multimodal RL training framework for diffusion & omni models

Python 359 55 Updated Jun 15, 2026

On demand communication

Python 34 2 Updated Apr 16, 2026

A kernel library written in tilelang

Python 1,587 138 Updated Apr 23, 2026

A minimal and fully-customizable CV template for Typst.

Typst 715 51 Updated Apr 6, 2025

Resume template for Typst. Mirror to https://typst.app/project/rVVa3y9vXemUKyvNKnabKV

Typst 152 13 Updated Jul 15, 2025

A simple, elegant, academic style CV template for typst. Support for English and Chinese (and more).

96 7 Updated Apr 9, 2023

An Asynchronous Reinforcement Learning Engine for Omni-Modal Post-Training at Scale

Python 423 47 Updated Jun 13, 2026

A project implementing various agentic RL based on the Slime post-training framework

Python 465 32 Updated Apr 11, 2026

Learning and Debugging for FSDP/FSDP2 Training

Python 17 Updated Feb 7, 2026
Python 633 66 Updated Aug 28, 2025

[ASPLOS'26] Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafter

Python 173 18 Updated Feb 27, 2026
Next