pandengyao

🎯

Focusing

Pandeng Yao pandengyao

🎯

Focusing

Co-evolving with Algorithms！

13 followers · 115 following

Baidu Inc
Beijing
05:06 (UTC +08:00)
@MrFrankYao

Achievements

Lists (1)

Sort

🚀 My stack

1 repository

Stars

NVIDIA / TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…

Python 3,143 628 Updated Feb 6, 2026

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 42,405 5,473 Updated Feb 6, 2026

d2l-ai / d2l-en

Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.

Python 28,122 4,954 Updated Aug 18, 2024

datawhalechina / so-large-lm

大模型基础: 一文了解大模型基础知识

6,720 564 Updated Dec 18, 2025

magicuidesign / portfolio

Minimalist developer portfolio using Next.js 14, React, TailwindCSS, Shadcn UI and Magic UI

TypeScript 1,257 343 Updated Jan 13, 2026

gpu-mode / lectures

Material for gpu-mode lectures

Jupyter Notebook 5,688 571 Updated Feb 1, 2026

BBuf / how-to-optim-algorithm-in-cuda

how to optimize some algorithm in cuda.

Cuda 2,819 256 Updated Jan 31, 2026

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Python 5,079 500 Updated Feb 6, 2026

PeterGriffinJin / Search-R1

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 3,962 340 Updated Nov 13, 2025

MathFoundationRL / Book-Mathematical-Foundation-of-Reinforcement-Learning

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 14,579 1,369 Updated Jan 31, 2026

derailed / k9s

🐶 Kubernetes CLI To Manage Your Clusters In Style!

Go 32,704 2,075 Updated Feb 5, 2026

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 15,150 3,570 Updated Feb 6, 2026

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 3,698 500 Updated Feb 5, 2026

TsinghuaC3I / Awesome-RL-for-LRMs

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,318 129 Updated Nov 9, 2025

inclusionAI / AReaL

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 3,505 283 Updated Feb 6, 2026

ScalingIntelligence / KernelBench

KernelBench: Can LLMs Write GPU Kernels? - Benchmark + Toolkit with Torch -> CUDA (+ more DSLs)

Jupyter Notebook 792 133 Updated Jan 20, 2026

ByteDance-Seed / cudaLLM

Python 130 7 Updated Aug 18, 2025

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 22,126 2,355 Updated Feb 5, 2026

openai / gpt-oss

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,736 2,031 Updated Jan 13, 2026

NVIDIA / Model-Optimizer

A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks …

Python 1,947 258 Updated Feb 6, 2026

intel / neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime

Python 2,581 296 Updated Feb 5, 2026

modelscope / evalscope

A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.

Python 2,378 274 Updated Feb 5, 2026

vllm-project / llm-compressor

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 2,704 388 Updated Feb 6, 2026

mit-han-lab / omniserve

[MLSys'25] QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving; [MLSys'25] LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention

C++ 810 58 Updated Mar 6, 2025

EleutherAI / lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 11,374 3,020 Updated Feb 6, 2026

pytorch / ao

PyTorch native quantization and sparsity for training and inference

Python 2,667 421 Updated Feb 6, 2026

Infrasys-AI / AIInfra

AIInfra（AI 基础设施）指AI系统从底层芯片等硬件，到上层软件栈支持AI大模型训练和推理。

Jupyter Notebook 5,996 821 Updated Dec 22, 2025

Infrasys-AI / AISystem

AISystem 主要是指AI系统，包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 16,242 2,327 Updated Sep 3, 2025

karpathy / minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 23,474 3,094 Updated Aug 15, 2024

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 52,680 8,903 Updated Nov 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pandeng Yao pandengyao

Achievements

Achievements

Block or report pandengyao

Lists (1)

🚀 My stack

Stars

NVIDIA / TransformerEngine

karpathy / nanochat

d2l-ai / d2l-en

datawhalechina / so-large-lm

magicuidesign / portfolio

gpu-mode / lectures

BBuf / how-to-optim-algorithm-in-cuda

rllm-org / rllm

PeterGriffinJin / Search-R1

MathFoundationRL / Book-Mathematical-Foundation-of-Reinforcement-Learning

derailed / k9s

NVIDIA / Megatron-LM

THUDM / slime

TsinghuaC3I / Awesome-RL-for-LRMs

inclusionAI / AReaL

ScalingIntelligence / KernelBench

ByteDance-Seed / cudaLLM

Dao-AILab / flash-attention

openai / gpt-oss

NVIDIA / Model-Optimizer

intel / neural-compressor

modelscope / evalscope

vllm-project / llm-compressor

mit-han-lab / omniserve

EleutherAI / lm-evaluation-harness

pytorch / ao

Infrasys-AI / AIInfra

Infrasys-AI / AISystem

karpathy / minGPT

karpathy / nanoGPT