Skip to content
View qipengwang's full-sized avatar

Block or report qipengwang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A single CLAUDE.md file to improve Claude Code behavior, derived from Andrej Karpathy's observations on LLM coding pitfalls.

55,092 4,673 Updated Apr 15, 2026

Public repository for Agent Skills

Python 119,651 13,844 Updated Apr 16, 2026

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

C++ 1,291 100 Updated Aug 28, 2025

High Performance LLM Inference Operator Library

C++ 828 82 Updated Apr 13, 2026

[SIGMOD 2025] PQCache: Product Quantization-based KVCache for Long Context LLM Inference

Python 85 25 Updated Dec 7, 2025

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 24,031 2,768 Updated Mar 12, 2026

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…

Python 13,779 1,360 Updated Apr 17, 2026

Zero Bubble Pipeline Parallelism

Python 453 34 Updated May 7, 2025

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 3,088 272 Updated Apr 18, 2026

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 70,236 8,599 Updated Apr 12, 2026

cluster data collected from production clusters in Alibaba for cluster management research

Jupyter Notebook 2,026 461 Updated Mar 12, 2026

Ring attention implementation with flash attention

Python 1,012 97 Updated Sep 10, 2025

Stateful LLM Serving

Python 98 15 Updated Mar 11, 2025

Large Language Model (LLM) Systems Paper List

1,927 99 Updated Apr 17, 2026

Official implementation of MASS: Multi-Agent Simulation Scaling for Portfolio Construction

Python 173 24 Updated Feb 9, 2026

Material for gpu-mode lectures

Jupyter Notebook 5,964 601 Updated Feb 1, 2026

🔥 The API to search, scrape, and interact with the web for AI

TypeScript 110,416 7,046 Updated Apr 18, 2026

best way to save what you love

Svelte 39,611 3,332 Updated Apr 6, 2026

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 343,151 55,438 Updated Mar 20, 2026

Curated list of project-based tutorials

263,482 34,268 Updated Aug 15, 2024

Interactive roadmaps, guides and other educational content to help developers grow in their careers.

TypeScript 353,128 43,944 Updated Apr 16, 2026

#1 PDF Application on GitHub that lets you edit PDFs on any device anywhere

TypeScript 76,916 6,670 Updated Apr 17, 2026

Quantization of Convolutional Neural networks.

Python 249 60 Updated Aug 5, 2024

[ICML 2024 Oral] Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs

Python 125 6 Updated Jul 4, 2025

汇总各大互联网公司容易考察的高频leetcode题🔥

19,884 2,755 Updated Mar 13, 2024
Python 355 45 Updated Apr 2, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 77,126 15,766 Updated Apr 18, 2026

🌟 Wiki of OI / ICPC for everyone. (某大型游戏线上攻略,内含炫酷算术魔法)

TypeScript 25,861 4,617 Updated Apr 17, 2026
Next