Skip to content
View ZailiWang's full-sized avatar
  • Intel
  • Beijing

Block or report ZailiWang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 2,327 203 Updated Dec 23, 2025

Source code and demo for memory bank and SiliconFriend

Python 380 52 Updated May 24, 2023

Advanced Matrix Extensions (AMX) Guide

C++ 108 8 Updated Jan 11, 2022

Open-source implementation of AlphaEvolve

Python 4,968 765 Updated Dec 24, 2025
Python 125 6 Updated Aug 18, 2025

SGLang wheels for multiple platforms

11 1 Updated Oct 13, 2025

Nano vLLM

Python 10,114 1,267 Updated Nov 3, 2025

SYCL* Templates for Linear Algebra (SYCL*TLA) - SYCL based CUTLASS implementation for Intel GPUs

C++ 59 70 Updated Dec 25, 2025

🎨ComfyUI standalone pack with 40+ custom nodes. | ComfyUI 大号整合包,预装大量自定义节点(不含SD模型)

Python 424 80 Updated Dec 22, 2025

12 Lessons to Get Started Building AI Agents

Jupyter Notebook 47,523 16,358 Updated Dec 25, 2025

A programming framework for agentic AI

Python 52,849 8,030 Updated Oct 8, 2025

所有小初高、大学PDF教材。

Roff 63,268 14,042 Updated Oct 18, 2025

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 12,769 1,179 Updated Sep 26, 2025

🤗 smolagents: a barebones library for agents that think in code.

Python 24,569 2,210 Updated Dec 23, 2025

Library for building powerful interactive command line applications in Python

Python 10,167 760 Updated Nov 17, 2025

Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.

Jupyter Notebook 613 51 Updated Feb 24, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 66,141 12,178 Updated Dec 25, 2025

这是一个简单的技术科普教程项目,主要聚焦于解释一些有趣的,前沿的技术概念和原理。每篇文章都力求在 5 分钟内阅读完成。

6,443 583 Updated Nov 10, 2025

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discr…

Python 8,562 1,391 Updated Oct 14, 2025

Machine Learning Engineering Open Book

Python 16,091 988 Updated Dec 20, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 21,958 3,863 Updated Dec 25, 2025

PyTorch media decoding and encoding

Python 882 80 Updated Dec 24, 2025

PyTorch native post-training library

Python 5,629 693 Updated Dec 24, 2025

PyTorch native quantization and sparsity for training and inference

Python 2,592 389 Updated Dec 25, 2025

Markdown语法支持添加 emoji表情,输入不同的符号码(两个冒号包围的字符)可以显示出不同的表情

278 75 Updated Aug 5, 2018

A markdown version emoji cheat sheet

TypeScript 13,515 4,595 Updated Dec 25, 2025

Datasets, Transforms and Models specific to Computer Vision

Python 17,397 7,189 Updated Dec 24, 2025

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 3,806 290 Updated Dec 25, 2025

Intel staging area for llvm.org contribution. Home for Intel LLVM-based projects.

LLVM 1,404 803 Updated Dec 25, 2025

Intel® Graphics Compute Runtime for oneAPI Level Zero and OpenCL™ Driver

C++ 1,324 262 Updated Dec 25, 2025
Next