Skip to content
View pandengyao's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report pandengyao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…

Python 3,143 628 Updated Feb 6, 2026

The best ChatGPT that $100 can buy.

Python 42,405 5,473 Updated Feb 6, 2026

Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.

Python 28,122 4,954 Updated Aug 18, 2024

大模型基础: 一文了解大模型基础知识

6,720 564 Updated Dec 18, 2025

Minimalist developer portfolio using Next.js 14, React, TailwindCSS, Shadcn UI and Magic UI

TypeScript 1,257 343 Updated Jan 13, 2026

Material for gpu-mode lectures

Jupyter Notebook 5,688 571 Updated Feb 1, 2026

how to optimize some algorithm in cuda.

Cuda 2,819 256 Updated Jan 31, 2026

Democratizing Reinforcement Learning for LLMs

Python 5,079 500 Updated Feb 6, 2026

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 3,962 340 Updated Nov 13, 2025

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 14,579 1,369 Updated Jan 31, 2026

🐶 Kubernetes CLI To Manage Your Clusters In Style!

Go 32,704 2,075 Updated Feb 5, 2026

Ongoing research training transformer models at scale

Python 15,150 3,570 Updated Feb 6, 2026

slime is an LLM post-training framework for RL Scaling.

Python 3,698 500 Updated Feb 5, 2026

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,318 129 Updated Nov 9, 2025

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 3,505 283 Updated Feb 6, 2026

KernelBench: Can LLMs Write GPU Kernels? - Benchmark + Toolkit with Torch -> CUDA (+ more DSLs)

Jupyter Notebook 792 133 Updated Jan 20, 2026
Python 130 7 Updated Aug 18, 2025

Fast and memory-efficient exact attention

Python 22,126 2,355 Updated Feb 5, 2026

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,736 2,031 Updated Jan 13, 2026

A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks …

Python 1,947 258 Updated Feb 6, 2026

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime

Python 2,581 296 Updated Feb 5, 2026

A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.

Python 2,378 274 Updated Feb 5, 2026

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 2,704 388 Updated Feb 6, 2026

[MLSys'25] QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving; [MLSys'25] LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention

C++ 810 58 Updated Mar 6, 2025

A framework for few-shot evaluation of language models.

Python 11,374 3,020 Updated Feb 6, 2026

PyTorch native quantization and sparsity for training and inference

Python 2,667 421 Updated Feb 6, 2026

AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。

Jupyter Notebook 5,996 821 Updated Dec 22, 2025

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 16,242 2,327 Updated Sep 3, 2025

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 23,474 3,094 Updated Aug 15, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 52,680 8,903 Updated Nov 12, 2025
Next