popomen

🎯

Focusing

Nan Zhe popomen

🎯

Focusing

8 followers · 34 following

Hangzhou, China

Achievements

Stars

huggingface / trl

Train transformer language models with reinforcement learning.

Python 16,736 2,371 Updated Dec 22, 2025

ByteDance-Seed / ByteCheckpoint

ByteCheckpoint: An Unified Checkpointing Library for LFMs

Python 256 18 Updated Dec 8, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & TIS & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,636 838 Updated Dec 18, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,696 2,866 Updated Dec 22, 2025

linux-rdma / perftest

Infiniband Verbs Performance Tests

C 889 363 Updated Dec 14, 2025

linux-rdma / rdma-core

RDMA core userspace libraries and daemons

C 2,079 805 Updated Dec 21, 2025

haoliuhl / ringattention

Large Context Attention

Python 754 52 Updated Oct 13, 2025

huggingface / accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 9,402 1,248 Updated Dec 17, 2025

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 32,167 6,626 Updated Dec 22, 2025

xlite-dev / Awesome-LLM-Inference

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

Python 4,853 329 Updated Nov 28, 2025

pytorch / torchtitan

A PyTorch native platform for training generative AI models

Python 4,863 648 Updated Dec 21, 2025

deepseek-ai / DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

4,987 530 Updated Sep 25, 2024

ZhuiyiTechnology / roformer

Rotary Transformer

Python 1,064 59 Updated Mar 21, 2022

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,830 1,815 Updated Oct 13, 2025

volcengine / veScale

Byted PyTorch Distributed for Hyperscale Training of LLMs and RLs

Python 909 53 Updated Nov 27, 2025

huggingface / transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 154,129 31,504 Updated Dec 22, 2025

mli / paper-reading

深度学习经典、新论文逐段精读

32,215 2,763 Updated Mar 22, 2025

deepspeedai / Megatron-DeepSpeed

Forked from NVIDIA/Megatron-LM

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 2,207 364 Updated Aug 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Nan Zhe popomen

Achievements

Achievements

Block or report popomen

Stars

huggingface / trl

ByteDance-Seed / ByteCheckpoint

OpenRLHF / OpenRLHF

volcengine / verl

linux-rdma / perftest

linux-rdma / rdma-core

haoliuhl / ringattention

huggingface / accelerate

huggingface / diffusers

xlite-dev / Awesome-LLM-Inference

pytorch / torchtitan

deepseek-ai / DeepSeek-V2

ZhuiyiTechnology / roformer

QwenLM / Qwen3

volcengine / veScale

huggingface / transformers

mli / paper-reading

deepspeedai / Megatron-DeepSpeed

HumanAIGC / EMO

koordinator-sh / koordinator

alibaba / x-deeplearning

sql-machine-learning / elasticdl

intelligent-machine-learning / dlrover

projectcalico / canal

flannel-io / flannel

grpc-ecosystem / grpc-gateway

facebookresearch / fairscale

hpcaitech / ColossalAI

Infrasys-AI / AISystem

k8sgpt-ai / k8sgpt