🧠
Updating my brain ... ... zzZ
Student @ USTC BDAA-BASE,
Currently interested in LLM, specifically post-training.
-
University of Science and Technology of China
- Hefei, Anhui, China
-
05:08
(UTC -12:00)
Highlights
- Pro
-
-
-
rllm Public
Forked from rllm-org/rllmDemocratizing Reinforcement Learning for LLMs
Jupyter Notebook MIT License UpdatedApr 11, 2025 -
verl Public
Forked from volcengine/verlverl: Volcano Engine Reinforcement Learning for LLMs
Python Apache License 2.0 UpdatedMar 26, 2025 -
OpenRLHF Public
Forked from OpenRLHF/OpenRLHFAn Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Python Apache License 2.0 UpdatedMar 24, 2025 -
LLaMA-Factory Public
Forked from hiyouga/LLaMA-FactoryUnified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Python Apache License 2.0 UpdatedMar 23, 2025 -
evalhub Public
Forked from ysy-phoenix/evalhubAll-in-one benchmarking platform for evaluating LLM.
Python MIT License UpdatedMar 15, 2025 -
trl Public
Forked from huggingface/trlTrain transformer language models with reinforcement learning.
Python Apache License 2.0 UpdatedDec 15, 2024 -
examples Public
Forked from pytorch/examplesA set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
Python BSD 3-Clause "New" or "Revised" License UpdatedFeb 19, 2024 -
-
-
-
-
-
-