Skip to content
View qipengwang's full-sized avatar

Block or report qipengwang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[SIGMOD 2025] PQCache: Product Quantization-based KVCache for Long Context LLM Inference

Python 81 17 Updated Dec 7, 2025

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 22,404 2,621 Updated Dec 3, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 11,760 1,071 Updated Dec 21, 2025

Zero Bubble Pipeline Parallelism

Python 442 31 Updated May 7, 2025

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 2,507 180 Updated Dec 21, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 64,284 7,791 Updated Dec 21, 2025

cluster data collected from production clusters in Alibaba for cluster management research

Jupyter Notebook 1,921 450 Updated Nov 22, 2025

Ring attention implementation with flash attention

Python 950 91 Updated Sep 10, 2025

Stateful LLM Serving

Python 90 15 Updated Mar 11, 2025

Large Language Model (LLM) Systems Paper List

1,694 89 Updated Dec 21, 2025

Official implementation of MASS: Multi-Agent Simulation Scaling for Portfolio Construction

Python 155 20 Updated Nov 17, 2025

Material for gpu-mode lectures

Jupyter Notebook 5,438 552 Updated Dec 8, 2025

🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data

TypeScript 70,477 5,537 Updated Dec 19, 2025

best way to save what you love

Svelte 37,682 3,101 Updated Dec 20, 2025

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 329,867 53,702 Updated Nov 3, 2025

Curated list of project-based tutorials

253,130 33,075 Updated Aug 15, 2024

Interactive roadmaps, guides and other educational content to help developers grow in their careers.

TypeScript 345,843 43,532 Updated Dec 21, 2025

#1 PDF Application on GitHub that lets you edit PDFs on any device anywhere

TypeScript 71,468 6,066 Updated Dec 21, 2025

Quantization of Convolutional Neural networks.

Python 248 60 Updated Aug 5, 2024

[ICML 2024 Oral] Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs

Python 121 7 Updated Jul 4, 2025

汇总各大互联网公司容易考察的高频leetcode题🔥

19,690 2,756 Updated Mar 13, 2024
Python 350 45 Updated Apr 2, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 65,886 12,104 Updated Dec 21, 2025

🌟 Wiki of OI / ICPC for everyone. (某大型游戏线上攻略,内含炫酷算术魔法)

TypeScript 25,085 4,517 Updated Dec 21, 2025

A GPipe implementation in PyTorch

Python 858 98 Updated Jul 25, 2024

Lingvo

Python 2,856 453 Updated Dec 5, 2025

ShortcutsBench: A Large-Scale Real-World Benchmark for API-Based Agents

Python 107 4 Updated Jun 24, 2025

tensorflow in C++

C++ 42 12 Updated Aug 21, 2019
Next