🫡
艰难困苦,玉汝于成
Student @ USTC BDAA-BASE,
Currently interested in LLM, specifically post-training/reasoning(math/code)/agent4se.
-
University of Science and Technology of China
- Hefei, Anhui, China
-
12:03
(UTC -12:00)
Highlights
- Pro
Stars
3
results
for forked starred repositories
Clear filter
Qihoo360 / 360-LLaMA-Factory
Forked from hiyouga/LlamaFactoryadds Sequence Parallelism into LLaMA-Factory
YoYiL / llama2-chinese
Forked from DLLXW/baby-llama2-chinese用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.
deepspeedai / Megatron-DeepSpeed
Forked from NVIDIA/Megatron-LMOngoing research training transformer language models at scale, including: BERT & GPT-2