Skip to content
View qipengwang's full-sized avatar

Block or report qipengwang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Claude Code plugin that automatically captures everything Claude does during your coding sessions, compresses it with AI (using Claude's agent-sdk), and injects relevant context back into future …

TypeScript 68,225 5,811 Updated Apr 27, 2026

A single CLAUDE.md file to improve Claude Code behavior, derived from Andrej Karpathy's observations on LLM coding pitfalls.

92,475 8,885 Updated Apr 20, 2026

Public repository for Agent Skills

Python 124,626 14,588 Updated Apr 23, 2026

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

C++ 1,293 100 Updated Aug 28, 2025

High Performance LLM Inference Operator Library

C++ 836 82 Updated Apr 13, 2026

[SIGMOD 2025] PQCache: Product Quantization-based KVCache for Long Context LLM Inference

Python 87 25 Updated Dec 7, 2025

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 24,147 2,776 Updated Mar 12, 2026

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-R1, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, …

Python 13,924 1,382 Updated Apr 27, 2026

Zero Bubble Pipeline Parallelism

Python 452 34 Updated May 7, 2025

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 3,112 275 Updated Apr 27, 2026

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 70,669 8,634 Updated Apr 27, 2026

cluster data collected from production clusters in Alibaba for cluster management research

Jupyter Notebook 2,032 462 Updated Apr 27, 2026

Ring attention implementation with flash attention

Python 1,013 98 Updated Sep 10, 2025

Stateful LLM Serving

Python 99 15 Updated Mar 11, 2025

Large Language Model (LLM) Systems Paper List

1,942 101 Updated Apr 17, 2026

Official implementation of MASS: Multi-Agent Simulation Scaling for Portfolio Construction

Python 173 26 Updated Feb 9, 2026

Material for gpu-mode lectures

Jupyter Notebook 6,015 604 Updated Apr 22, 2026

🔥 The API to search, scrape, and interact with the web for AI

TypeScript 112,570 7,173 Updated Apr 27, 2026

best way to save what you love

Svelte 39,802 3,350 Updated Apr 6, 2026

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 344,799 55,718 Updated Mar 20, 2026

Curated list of project-based tutorials

264,180 34,351 Updated Aug 15, 2024

Interactive roadmaps, guides and other educational content to help developers grow in their careers.

TypeScript 353,740 43,973 Updated Apr 27, 2026

#1 PDF Application on GitHub that lets you edit PDFs on any device anywhere

TypeScript 77,867 6,803 Updated Apr 27, 2026

Quantization of Convolutional Neural networks.

Python 249 60 Updated Aug 5, 2024

[ICML 2024 Oral] Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs

Python 125 6 Updated Jul 4, 2025

汇总各大互联网公司容易考察的高频leetcode题🔥

19,897 2,756 Updated Mar 13, 2024
Python 355 46 Updated Apr 2, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 78,293 16,145 Updated Apr 27, 2026
Next