zsk97

Shunkang ZHANG zsk97

Ph.D. from HKUST Interest in ML for system, GPU, Database acceleration

10 followers · 0 following

Hong Kong
17:20 (UTC -12:00)

Achievements

MoE-Offload Public

Offload MoE expert with predicted pattern

Python 1 1 Updated Oct 5, 2024
flashinfer Public
Forked from flashinfer-ai/flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda Apache License 2.0 Updated Jul 30, 2024
vattention Public
Forked from microsoft/vattention

Dynamic Memory Management for Serving LLMs without PagedAttention

C MIT License Updated Jul 29, 2024
InfiniGen Public
Forked from snu-comparch/InfiniGen

InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)

Python Apache License 2.0 Updated Jul 10, 2024
streambox Public
Forked from CGCL-codes/streambox

Python Apache License 2.0 Updated May 28, 2024
MoECache Public

Implement the expert cache for MoE

Python 1 1 Updated May 18, 2024
Parcae Public
Forked from JF-D/Parcae

Python MIT License Updated Apr 22, 2024
orion Public
Forked from eth-easl/orion

An interference-aware scheduler for fine-grained GPU sharing

Python MIT License Updated Apr 15, 2024
LLaMA2-Accessory Public
Forked from marsggbo/LLaMA2-Accessory

Python Other Updated Apr 15, 2024
SpotServe Public
Forked from Hsword/SpotServe

SpotServe: Serving Generative Large Language Models on Preemptible Instances

Apache License 2.0 Updated Feb 22, 2024
streaming-llm Public
Forked from mit-han-lab/streaming-llm

Efficient Streaming Language Models with Attention Sinks

Python MIT License Updated Oct 5, 2023
chroma Public
Forked from chroma-core/chroma

the AI-native open-source embedding database

Python Apache License 2.0 Updated Aug 23, 2023
cutlass Public
Forked from NVIDIA/cutlass

CUDA Templates for Linear Algebra Subroutines

C++ Other Updated Aug 14, 2023
faiss Public
Forked from facebookresearch/faiss

A library for efficient similarity search and clustering of dense vectors.

C++ MIT License Updated Aug 11, 2023
hnswlib Public
Forked from nmslib/hnswlib

Header-only C++/python library for fast approximate nearest neighbors

C++ Apache License 2.0 Updated Aug 11, 2023
GPU-practice Public

The GPU kernel implementation practice

C++ Updated Aug 10, 2023
milvus Public
Forked from milvus-io/milvus

A cloud-native vector database, storage for next generation AI applications

Go Apache License 2.0 Updated Aug 5, 2023
vllm Public
Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python Apache License 2.0 Updated Jul 20, 2023
perftest Public
Forked from linux-rdma/perftest

Infiniband Verbs Performance Tests

C Other Updated Jul 17, 2023
linux Public
Forked from torvalds/linux

Linux kernel source tree

C Other Updated Jul 17, 2023
rdma-core Public
Forked from linux-rdma/rdma-core

RDMA core userspace libraries and daemons

C Other Updated Jul 16, 2023
gpgpu-sim_distribution Public
Forked from gpgpu-sim/gpgpu-sim_distribution

GPGPU-Sim provides a detailed simulation model of contemporary NVIDIA GPUs running CUDA and/or OpenCL workloads. It includes support for features such as TensorCores and CUDA Dynamic Parallelism as…

C++ Other Updated Jul 6, 2023
cuda-samples Public
Forked from NVIDIA/cuda-samples

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

C Other Updated Jun 30, 2023
README_tools Public
Forked from guodongxiaren/README

README文件语法解读，即Github Flavored Markdown语法介绍

The Unlicense Updated Mar 8, 2023
kvm-hello-world Public
Forked from dpw/kvm-hello-world

A minimal kvm example

C MIT License Updated Jul 30, 2022

Shunkang ZHANG zsk97

Achievements

Achievements

MoE-Offload Public

Uh oh!

flashinfer Public

Uh oh!

vattention Public

Uh oh!

InfiniGen Public

Uh oh!

streambox Public

Uh oh!

MoECache Public

Uh oh!

Parcae Public

Uh oh!

orion Public

Uh oh!

LLaMA2-Accessory Public

Uh oh!

SpotServe Public

Uh oh!

streaming-llm Public

Uh oh!

chroma Public

Uh oh!

cutlass Public

Uh oh!

faiss Public

Uh oh!

hnswlib Public

Uh oh!

GPU-practice Public

Uh oh!

milvus Public

Uh oh!

vllm Public

Uh oh!

perftest Public

Uh oh!

linux Public

Uh oh!

rdma-core Public

Uh oh!

gpgpu-sim_distribution Public

Uh oh!

cuda-samples Public

Uh oh!

README_tools Public

Uh oh!

kvm-hello-world Public

Uh oh!