-
University of Helsinki
- Helsinki, Finland
- https://www.zihao.cool/
- in/zihao-li-968170212
Highlights
- Pro
Lists (6)
Sort Name ascending (A-Z)
Stars
🔥 A minimal training framework for scaling FLA models
We’re OpenLLM Europe 🇪🇺, an Open Source community committed to empower LLM projects in all European languages, specifically medium and low-resource languages.
A lightweight, AI-native training framework for large language models. Designed for fast iteration, reproducible experiments, and modular configuration across SFT, RLVR, and evaluation workflows.
The LUMI AI Guide is designed to assist users in migrating their machine learning applications from smaller-scale computing environments to the LUMI supercomputer.
PyTorch building blocks for the OLMo ecosystem
AMD-AGI / torchtitan-amd
Forked from pytorch/torchtitanA PyTorch native platform for training generative AI models
A PyTorch native platform for training generative AI models
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-V4, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, …
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
📰 Must-read papers and blogs on Speculative Decoding ⚡️
Fully open reproduction of DeepSeek-R1
slime is an LLM post-training framework for RL Scaling.
Making large AI models cheaper, faster and more accessible
Minimalistic large language model 3D-parallelism training
A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.
An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
Train transformer language models with reinforcement learning.
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)