Skip to content
View qzl164's full-sized avatar

Block or report qzl164

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Context engineering is the new vibe coding - it's the way to actually make AI coding assistants work. Claude Code is the best for this so that's what this repo is centered around, but you can apply…

Python 13,479 2,711 Updated Mar 16, 2026

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 3,426 547 Updated Jun 22, 2026

Community maintained hardware plugin for vLLM on Ascend

C++ 2,270 1,423 Updated Jun 22, 2026

A GPU cluster manager that configures and orchestrates inference engines like vLLM and SGLang for high-performance AI model deployment.

Python 5,193 549 Updated Jun 21, 2026

坚持分享 GitHub 上高质量、有趣实用的开源技术教程、开发者工具、编程网站、技术资讯。A list cool, interesting projects of GitHub.

46,844 4,647 Updated Dec 31, 2025

大模型算法岗面试题(含答案):常见问题和概念解析 "大模型面试题"、"算法岗面试"、"面试常见问题"、"大模型算法面试"、"大模型应用基础"

Jupyter Notebook 1,940 133 Updated May 7, 2026

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 12,709 1,063 Updated Apr 30, 2026

This is the official repository for The Hundred-Page Language Models Book by Andriy Burkov

Jupyter Notebook 2,157 364 Updated Feb 8, 2026

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 6,563 383 Updated Jun 22, 2026

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

Python 83,334 9,648 Updated Jun 22, 2026

动手学数据分析以项目为主线,知识点孕育其中,通过边学、边做、边引导来得到更好的学习效果

Jupyter Notebook 1,471 377 Updated May 29, 2024

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python 1,577 229 Updated Dec 15, 2025

《机器学习理论导引》(宝箱书)的证明、案例、概念补充与参考文献讲解。

Jupyter Notebook 1,704 187 Updated Apr 26, 2026

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 5,625 867 Updated Jun 22, 2026

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

Python 9,668 972 Updated Jun 17, 2026

AllenAI's post-training codebase

Python 3,759 548 Updated Jun 20, 2026

仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理

Jupyter Notebook 4,211 581 Updated Mar 26, 2026

A curated list of Large Language Model resources, covering model training, serving, fine-tuning, and building LLM applications.

4,969 705 Updated Aug 18, 2025

The LLM's practical guide: From the fundamentals to deploying advanced LLM and RAG apps to AWS using LLMOps best practices

Python 5,122 1,230 Updated Apr 22, 2026

Kubernetes Handbook (Kubernetes指南) https://kubernetes.feisky.xyz

Makefile 5,537 1,381 Updated Nov 25, 2025

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 11,297 1,163 Updated Jun 21, 2026

Free and Open Source, Distributed, RESTful Search Engine

Java 77,104 25,846 Updated Jun 22, 2026

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,759 273 Updated Jul 18, 2025

算法工程师(人工智能CV方向)面试问题及相关资料

3,018 453 Updated Aug 18, 2024

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…

Python 3,400 753 Updated Jun 17, 2026

Simple, safe way to store and distribute tensors

Rust 3,783 330 Updated Jun 19, 2026

Robust recipes to align language models with human and AI preferences

Python 5,614 492 Updated May 26, 2026

《大模型白盒子构建指南》:一个全手搓的Tiny-Universe

Jupyter Notebook 4,921 468 Updated Feb 12, 2026

A tutorial for CUDA&PyTorch

Cuda 458 59 Updated Mar 23, 2026
Next