zrbcool

Follow

🎯

Focusing

Robin Zhang zrbcool

🎯

Focusing

Follow

19 followers · 16 following

Beijing

Achievements

Achievements

Starred repositories

verl-project / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,245 3,512 Updated Mar 27, 2026

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,968 2,416 Updated Nov 24, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)

Python 9,255 905 Updated Mar 25, 2026

bytedance / flux

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

C++ 1,276 98 Updated Aug 28, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 9,075 1,130 Updated Feb 9, 2026

bitsandbytes-foundation / bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Python 8,082 838 Updated Mar 25, 2026

NVIDIA / TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…

Python 3,246 678 Updated Mar 25, 2026

sail-sg / zero-bubble-pipeline-parallelism

Forked from NVIDIA/Megatron-LM

Zero Bubble Pipeline Parallelism

Python 453 34 Updated May 7, 2025

pytorch / PiPPy

Pipeline Parallelism for PyTorch

Python 786 88 Updated Aug 21, 2024

wilicc / gpu-burn

Multi-GPU CUDA stress test

C++ 2,138 400 Updated Nov 4, 2025

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…

Python 13,380 1,302 Updated Mar 26, 2026

openai / tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 17,710 1,422 Updated Mar 27, 2026

ncabatoff / process-exporter

Prometheus exporter that mines /proc to report on selected processes

Go 2,098 307 Updated Apr 21, 2025

limbopro / Paolujichang

科学上网🕸️之跑路机场名单收集（2020-2026），欢迎投稿。Ad🔗🈲🙅❌

4,853 81 Updated Mar 17, 2026

alibaba / Pai-Megatron-Patch

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python 1,547 228 Updated Dec 15, 2025

hiyouga / LlamaFactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 69,137 8,434 Updated Mar 27, 2026

friskit-china / beijing-unicom-iptv-tweaker

北京联通IPTV相关脚本

Python 23 6 Updated Jun 1, 2020

wuwentao / bj-telecom-iptv

北京电信IPTV播放列表 Beijing Telecom IPTV playlist bj-telecom-iptv.m3u

57 12 Updated Dec 31, 2021

luogen1996 / RepAdapter

Official implementation of "Towards Efficient Visual Adaption via Structural Re-parameterization".

Python 188 17 Updated Apr 18, 2024

NVIDIA / nvbandwidth

A tool for bandwidth measurements on NVIDIA GPUs.

C++ 647 74 Updated Apr 15, 2025

microsoft / LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 13,366 891 Updated Dec 17, 2024

NVIDIA / nccl

Optimized primitives for collective multi-GPU communication

C++ 4,565 1,185 Updated Mar 25, 2026

NVIDIA / PyProf

A GPU performance profiling tool for PyTorch models

Python 511 50 Updated Jul 13, 2021

wgwang / awesome-LLMs-In-China

中国大模型

6,427 554 Updated Nov 30, 2024

deepspeedai / DeepSpeedExamples

Example models using DeepSpeed

Python 6,808 1,119 Updated Mar 4, 2026

facebookincubator / oomd

A userspace out-of-memory killer

C++ 2,020 159 Updated Mar 15, 2026

CVI-SZU / Linly

Chinese-LLaMA 1&2、Chinese-Falcon 基础模型；ChatFlow中文对话模型；中文OpenLLaMA模型；NLP预训练/指令微调数据集

Python 3,054 229 Updated Apr 14, 2024

tatsu-lab / stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 30,258 4,003 Updated Jul 17, 2024

nomic-ai / gpt4all

GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.

C++ 77,233 8,325 Updated May 27, 2025

bigscience-workshop / Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 1,438 227 Updated Mar 20, 2024

Starred topics

awesome-list

Kubernetes