zhanjiqing

🎯

Focusing

lucas zhanjiqing

🎯

Focusing

6 followers · 15 following

Stars

16 results for forked starred repositories

Clear filter

xhx1022 / vllm

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 1 Updated Jan 18, 2026

NiklasFreymuth / troll

Forked from verl-project/verl

TROLL: Trust Region Optimization for Large Language models

Python 7 Updated Feb 1, 2026

meituan-search / verl

Forked from verl-project/verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 9 1 Updated Feb 3, 2026

moojink / openvla-oft

Forked from openvla/openvla

Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Success

Python 1,014 121 Updated Sep 9, 2025

yuanzhoulvpi2017 / nano_rl

Forked from verl-project/verl

在verl上做reward的定制开发

Python 144 6 Updated May 22, 2025

sail-sg / VocabularyParallelism

Forked from NVIDIA/Megatron-LM

Vocabulary Parallelism

Python 25 Updated Mar 10, 2025

sail-sg / zero-bubble-pipeline-parallelism

Forked from NVIDIA/Megatron-LM

Zero Bubble Pipeline Parallelism

Python 449 31 Updated May 7, 2025

Gvilenius / zero-bubble-pipeline-parallelism

Forked from sail-sg/zero-bubble-pipeline-parallelism

Zero Bubble Pipeline Parallelism

Python 1 Updated Oct 24, 2024

thunlp / Seq1F1B

Forked from NVIDIA/Megatron-LM

Sequence-level 1F1B schedule for LLMs.

Python 38 2 Updated Aug 26, 2025

MayDomine / Seq1F1B

Forked from NVIDIA/Megatron-LM

Sequence-level 1F1B schedule for LLMs.

Python 19 3 Updated Jun 4, 2024

fanshiqing / grouped_gemm

Forked from tgale96/grouped_gemm

PyTorch bindings for CUTLASS grouped GEMM.

Cuda 184 48 Updated Dec 16, 2025

deepspeedai / Megatron-DeepSpeed

Forked from NVIDIA/Megatron-LM

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 2,224 370 Updated Aug 14, 2025

Adlik / smoothquantplus

Forked from mit-han-lab/smoothquant

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Python 23 Updated Mar 15, 2024

AniZpZ / smoothquant

Forked from mit-han-lab/smoothquant

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Python 11 1 Updated Dec 13, 2023

alibaba / Megatron-LLaMA

Forked from NVIDIA/Megatron-LM

Best practice for training LLaMA models in Megatron-LM

Python 664 57 Updated Jan 2, 2024

ProjectD-AI / LLaMA-Megatron-DeepSpeed

Forked from deepspeedai/Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 69 6 Updated Jul 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lucas zhanjiqing

Block or report zhanjiqing

Stars

xhx1022 / vllm

NiklasFreymuth / troll

meituan-search / verl

moojink / openvla-oft

yuanzhoulvpi2017 / nano_rl

sail-sg / VocabularyParallelism

sail-sg / zero-bubble-pipeline-parallelism

Gvilenius / zero-bubble-pipeline-parallelism

thunlp / Seq1F1B

MayDomine / Seq1F1B

fanshiqing / grouped_gemm

deepspeedai / Megatron-DeepSpeed

Adlik / smoothquantplus

AniZpZ / smoothquant

alibaba / Megatron-LLaMA

ProjectD-AI / LLaMA-Megatron-DeepSpeed