Skip to content
View ZiHAO-LI-cmd's full-sized avatar
😅
😅

Highlights

  • Pro

Organizations

@Global-CS-application @MaLA-LM @OpenEuroLLM

Block or report ZiHAO-LI-cmd

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🔥 A minimal training framework for scaling FLA models

Python 394 64 Updated Apr 22, 2026

We’re OpenLLM Europe 🇪🇺, an Open Source community committed to empower LLM projects in all European languages, specifically medium and low-resource languages.

66 10 Updated Jun 18, 2025
Python 610 60 Updated May 21, 2026

A lightweight, AI-native training framework for large language models. Designed for fast iteration, reproducible experiments, and modular configuration across SFT, RLVR, and evaluation workflows.

Python 575 43 Updated May 18, 2026

天朝禁书 - 亲自整理,亲自传播

TypeScript 268 25 Updated Jan 21, 2026

The LUMI AI Guide is designed to assist users in migrating their machine learning applications from smaller-scale computing environments to the LUMI supercomputer.

Python 77 20 Updated May 22, 2026

PyTorch building blocks for the OLMo ecosystem

Python 1,311 264 Updated Jun 22, 2026

A PyTorch native platform for training generative AI models

Python 17 1 Updated Apr 21, 2026

A PyTorch native platform for training generative AI models

Python 5,454 868 Updated Jun 22, 2026

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-V4, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, …

Python 14,575 1,487 Updated Jun 22, 2026

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 5,023 372 Updated Apr 6, 2026

Go ahead and axolotl questions

Python 12,071 1,373 Updated Jun 22, 2026

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 72,346 8,855 Updated Jun 21, 2026

📰 Must-read papers and blogs on Speculative Decoding ⚡️

1,259 80 Updated Jun 2, 2026

Fully open reproduction of DeepSeek-R1

Python 26,339 2,442 Updated Apr 2, 2026
Python 1,292 133 Updated May 20, 2026

slime is an LLM post-training framework for RL Scaling.

Python 6,649 958 Updated Jun 21, 2026

Python bindings for llama.cpp

Python 10,423 1,418 Updated Jun 22, 2026

An Extensible Deep Learning Library

Python 2,367 406 Updated May 16, 2026

Making large AI models cheaper, faster and more accessible

Python 41,402 4,506 Updated May 25, 2026

Minimalistic large language model 3D-parallelism training

Python 2,721 318 Updated May 26, 2026

A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.

Python 4,297 334 Updated Jun 13, 2026

Nano vLLM

Python 14,130 2,240 Updated Apr 26, 2026

s1: Simple test-time scaling

Python 6,655 757 Updated Jun 25, 2025

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 3,253 290 Updated Jun 22, 2026

Train transformer language models with reinforcement learning.

Python 18,686 2,799 Updated Jun 22, 2026

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 22,075 4,107 Updated Jun 22, 2026

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

Python 9,668 972 Updated Jun 17, 2026
Next