Skip to content
View popomen's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report popomen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Train transformer language models with reinforcement learning.

Python 16,736 2,371 Updated Dec 22, 2025

ByteCheckpoint: An Unified Checkpointing Library for LFMs

Python 256 18 Updated Dec 8, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & TIS & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,636 838 Updated Dec 18, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,696 2,866 Updated Dec 22, 2025

Infiniband Verbs Performance Tests

C 889 363 Updated Dec 14, 2025

RDMA core userspace libraries and daemons

C 2,079 805 Updated Dec 21, 2025

Large Context Attention

Python 754 52 Updated Oct 13, 2025

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 9,402 1,248 Updated Dec 17, 2025

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 32,167 6,626 Updated Dec 22, 2025

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

Python 4,853 329 Updated Nov 28, 2025

A PyTorch native platform for training generative AI models

Python 4,863 648 Updated Dec 21, 2025

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

4,987 530 Updated Sep 25, 2024

Rotary Transformer

Python 1,064 59 Updated Mar 21, 2022

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,830 1,815 Updated Oct 13, 2025

Byted PyTorch Distributed for Hyperscale Training of LLMs and RLs

Python 909 53 Updated Nov 27, 2025

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 154,129 31,504 Updated Dec 22, 2025

深度学习经典、新论文逐段精读

32,215 2,763 Updated Mar 22, 2025

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 2,207 364 Updated Aug 14, 2025

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

7,654 935 Updated Aug 21, 2024

A QoS-based scheduling system brings optimal layout and status to workloads such as microservices, web services, big data jobs, AI jobs, etc.

Go 1,632 396 Updated Dec 22, 2025

An industrial deep learning framework for high-dimension sparse data

PureBasic 4,305 1,029 Updated Sep 25, 2024

Kubernetes-native Deep Learning Framework

Python 744 116 Updated Jan 26, 2024

DLRover: An Automatic Distributed Deep Learning System

Python 1,611 203 Updated Dec 18, 2025

Policy based networking for cloud native applications

721 97 Updated Apr 3, 2020

flannel is a network fabric for containers, designed for Kubernetes

Go 9,360 2,901 Updated Dec 22, 2025

gRPC to JSON proxy generator following the gRPC HTTP spec

Go 19,736 2,360 Updated Dec 20, 2025

PyTorch extensions for high performance and large scale training.

Python 3,391 294 Updated Apr 26, 2025

Making large AI models cheaper, faster and more accessible

Python 41,298 4,546 Updated Dec 8, 2025

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 15,900 2,275 Updated Sep 3, 2025

Giving Kubernetes Superpowers to everyone

Go 7,245 911 Updated Dec 22, 2025
Next