qipengwang

Follow

qipengwang

Follow

12 followers · 13 following

Peking University
http://qipengwang.github.io/

Achievements

Achievements

Stars

thedotmack / claude-mem

A Claude Code plugin that automatically captures everything Claude does during your coding sessions, compresses it with AI (using Claude's agent-sdk), and injects relevant context back into future …

TypeScript 69,741 5,956 Updated Apr 29, 2026

forrestchang / andrej-karpathy-skills

A single CLAUDE.md file to improve Claude Code behavior, derived from Andrej Karpathy's observations on LLM coding pitfalls.

99,468 9,690 Updated Apr 20, 2026

anthropics / skills

Public repository for Agent Skills

Python 126,080 14,776 Updated Apr 23, 2026

bytedance / flux

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

C++ 1,297 101 Updated Aug 28, 2025

Tencent / hpc-ops

High Performance LLM Inference Operator Library

C++ 837 83 Updated Apr 13, 2026

HugoZHL / PQCache

[SIGMOD 2025] PQCache: Product Quantization-based KVCache for Long Context LLM Inference

Python 87 25 Updated Dec 7, 2025

zbezj / HEU_KMS_Activator

41,208 3,847 Updated Apr 23, 2026

liguodongiot / llm-action

本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）

HTML 24,171 2,782 Updated Mar 12, 2026

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-R1, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, …

Python 13,961 1,389 Updated Apr 29, 2026

sail-sg / zero-bubble-pipeline-parallelism

Forked from NVIDIA/Megatron-LM

Zero Bubble Pipeline Parallelism

Python 452 34 Updated May 7, 2025

Efficient-ML / Qwen3-Quantization

Python 75 8 Updated Sep 19, 2025

alibaba / ROLL

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 3,114 275 Updated Apr 29, 2026

hiyouga / LlamaFactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 70,769 8,643 Updated Apr 29, 2026

alibaba / clusterdata

cluster data collected from production clusters in Alibaba for cluster management research

Jupyter Notebook 2,033 462 Updated Apr 27, 2026

zhuzilin / ring-flash-attention

Ring attention implementation with flash attention

Python 1,015 98 Updated Sep 10, 2025

WukLab / preble

Stateful LLM Serving

Python 99 16 Updated Mar 11, 2025

AmberLJC / LLMSys-PaperList

Large Language Model (LLM) Systems Paper List

1,955 101 Updated Apr 17, 2026

gta0804 / MASS

Official implementation of MASS: Multi-Agent Simulation Scaling for Portfolio Construction

Python 173 26 Updated Feb 9, 2026

gpu-mode / lectures

Material for gpu-mode lectures

Jupyter Notebook 6,029 607 Updated Apr 22, 2026

firecrawl / firecrawl

🔥 The API to search, scrape, and interact with the web for AI

TypeScript 113,105 7,200 Updated Apr 29, 2026

imputnet / cobalt

best way to save what you love

Svelte 39,951 3,369 Updated Apr 6, 2026

donnemartin / system-design-primer

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 346,237 55,868 Updated Mar 20, 2026

practical-tutorials / project-based-learning

Curated list of project-based tutorials

264,416 34,370 Updated Aug 15, 2024

nilbuild / developer-roadmap

Interactive roadmaps, guides and other educational content to help developers grow in their careers.

TypeScript 353,891 43,982 Updated Apr 29, 2026

Stirling-Tools / Stirling-PDF

#1 PDF Application on GitHub that lets you edit PDFs on any device anywhere

TypeScript 77,988 6,821 Updated Apr 29, 2026

submission2019 / cnn-quantization

Quantization of Convolutional Neural networks.

Python 249 60 Updated Aug 5, 2024

SNU-ARC / any-precision-llm

[ICML 2024 Oral] Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs

Python 125 6 Updated Jul 4, 2025

afatcoder / LeetcodeTop

汇总各大互联网公司容易考察的高频leetcode题🔥

19,898 2,759 Updated Mar 13, 2024

FMInference / DejaVu

Python 355 45 Updated Apr 2, 2024

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 78,613 16,255 Updated Apr 29, 2026