kiraadven

kiraadven

1 follower · 11 following

Stars

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 7,265 983 Updated May 13, 2026

guaguaupup / cpp_interview

c++后台服务器开发面经或八股总结！(有深度有广度，和仅有概念的总结文章不同！)

2,198 276 Updated Sep 9, 2024

ovg-project / kvcached

Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond

Python 1,040 111 Updated May 16, 2026

NVIDIA / nvshmem

NVIDIA NVSHMEM is a parallel programming interface for NVIDIA GPUs based on OpenSHMEM. NVSHMEM can significantly reduce multi-process communication and coordination overheads by allowing programmer…

C++ 534 79 Updated May 5, 2026

aeron-io / aeron

Efficient reliable UDP unicast, UDP multicast, and IPC message transport

Java 8,638 1,038 Updated May 17, 2026

mansoor-mamnoon / limit-order-book

High-performance limit order book engine with C++ core and Python SDK. Processes 20M+ msgs/sec with µs latency. Supports real crypto/equity data replay, spread/imbalance/impact analytics, and backt…

C++ 47 20 Updated Aug 30, 2025

nkaz001 / hftbacktest

Free, open source, a high frequency trading and market making backtesting and trading bot, which accounts for limit orders, queue positions, and latencies, utilizing full tick data for trades and o…

Rust 4,074 790 Updated Dec 23, 2025

CalvinXKY / InfraTech

分享AI Infra知识&代码练习：PyTorch/vLLM/SGLang框架入门⚡️、性能加速🚀、大模型基础🧠、AI软硬件🔧等

Jupyter Notebook 2,281 193 Updated May 8, 2026

ai-dynamo / nixl

NVIDIA Inference Xfer Library (NIXL)

C++ 1,035 319 Updated May 17, 2026

Levitate-Qian / latex-resume-template

保研/求职latex简历模版

TeX 34 4 Updated Mar 23, 2025

gpu-mode / lectures

Material for gpu-mode lectures

Jupyter Notebook 6,082 611 Updated May 9, 2026

LMCache / LMCache

Supercharge Your LLM with the Fastest KV Cache Layer

Python 8,283 1,179 Updated May 18, 2026

ultraworkers / claw-code

The repo is finally unlocked. enjoy the party! The fastest repo in history to surpass 100K stars ⭐. Join Discord: https://discord.gg/5TUQKqFWd Built in Rust using oh-my-codex.

Rust 191,813 109,919 Updated May 16, 2026

Tencent / hpc-ops

High Performance LLM Inference Operator Library

C++ 849 84 Updated Apr 13, 2026

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 9,633 1,245 Updated May 13, 2026

ray-project / ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 42,572 7,583 Updated May 17, 2026

FerranAgulloLopez / vLLMBatchingMemoryGap

Forked from vllm-project/vllm

Fork of vLLM for developing the paper "Mind the Memory Gap: Unveiling GPU Bottlenecks in Large-Batch LLM Inference"

Python 8 2 Updated Mar 5, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kiraadven

Block or report kiraadven

Stars

deepseek-ai / DeepGEMM

guaguaupup / cpp_interview

ovg-project / kvcached

NVIDIA / nvshmem

aeron-io / aeron

mansoor-mamnoon / limit-order-book

nkaz001 / hftbacktest

CalvinXKY / InfraTech

ai-dynamo / nixl

Levitate-Qian / latex-resume-template

gpu-mode / lectures

LMCache / LMCache

ultraworkers / claw-code

Tencent / hpc-ops

deepseek-ai / DeepEP

ray-project / ray

FerranAgulloLopez / vLLMBatchingMemoryGap

llumnix-project / llumnix-ray

vllm-project / vllm

Hackl0us / SS-Rule-Snippet

mybearyZhang / ipr-1

GeeeekExplorer / nano-vllm

CMU-SAFARI / ramulator2

facebookresearch / Replica-Dataset

krrish94 / chamferdist

sucong426 / VPN

Comfy-Org / ComfyUI

concept-graphs / concept-graphs

IDEA-Research / GroundingDINO

openai / CLIP