Skip to content
View shanshanpt's full-sized avatar
🗣️
Focusing
🗣️
Focusing

Organizations

@AlibabaPAI @DeepRec-AI

Block or report shanshanpt

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

20 results for sponsorable starred repositories
Clear filter

vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

Python 2,037 340 Updated Dec 17, 2025

Cost-efficient and pluggable Infrastructure components for GenAI inference

Go 4,472 498 Updated Dec 13, 2025

Community maintained hardware plugin for vLLM on Ascend

Python 1,471 665 Updated Dec 17, 2025

Fast and memory-efficient exact attention

Python 105 111 Updated Dec 17, 2025

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 2,411 319 Updated Dec 17, 2025

The definitive Web UI for local AI, with powerful features and easy setup.

Python 45,628 5,856 Updated Dec 15, 2025

Fast & memory efficient hashtable based on robin hood hashing for C++11/14/17/20

C++ 1,595 157 Updated May 1, 2023

experiments with inference on llama

Python 103 16 Updated Jun 6, 2024

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 30,881 3,776 Updated Dec 17, 2025

⚡ A Fast, Extensible Progress Bar for Python and CLI

Python 30,782 1,416 Updated May 22, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 65,614 12,027 Updated Dec 17, 2025
Python 6,802 1,149 Updated Nov 3, 2025

The official gpt4free repository | various collection of powerful language models | o4, o3 and deepseek r1, gpt-4.1, gemini 2.5

Python 65,635 13,686 Updated Dec 12, 2025

JSON for Modern C++

C++ 48,199 7,258 Updated Dec 15, 2025

Frame profiler

C++ 14,822 967 Updated Dec 16, 2025

The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.

C++ 1,695 627 Updated Dec 16, 2025

🐶 Kubernetes CLI To Manage Your Clusters In Style!

Go 32,195 2,025 Updated Dec 15, 2025

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.

LLVM 35,935 15,508 Updated Dec 17, 2025

Tapir extension to LLVM for optimizing Parallel Programs

LLVM 131 23 Updated Apr 20, 2020

A Go microservices framework

Go 22,640 2,397 Updated Nov 26, 2025