botbw

botbw botbw

61 followers · 269 following

Nanyang Technological University
Singapore
16:55 (UTC +08:00)

Achievements

x2 x2

Achievements

x2 x2

Highlights

Stars

vllm-project / guidellm

Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs

Python 1,270 168 Updated Jun 12, 2026

lucifer1004 / VeloQ

Agent-friendly GPU profile-query CLI

Rust 82 2 Updated Jun 12, 2026

DarkSharpness / dark-trace

JavaScript 3 Updated Jun 5, 2026

pie-project / pie

Pie: Programmable LLM Serving

Rust 175 22 Updated Jun 16, 2026

mit-han-lab / KernelWiki

Python 252 27 Updated Jun 9, 2026

gau-nernst / learn-cuda

Learn CUDA with PyTorch

Cuda 333 50 Updated Jun 1, 2026

deepseek-ai / DualPipe

A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.

Python 2,967 326 Updated Jan 14, 2026

microsoft / tokenweave

Accepted to MLSys 2026

Python 87 7 Updated Apr 19, 2026

LoongServe / LoongServe

Jupyter Notebook 134 15 Updated Nov 11, 2024

SaladDay / cc-switch-cli

⭐️ A cross-platform CLI All-in-One assistant tool for Claude Code, Codex & Gemini CLI.

Rust 3,570 205 Updated Jun 15, 2026

uccl-project / mKernel

mKernel: fast multi-node, multi-GPU fused kernels

Cuda 233 22 Updated Jun 8, 2026

microsoft / vattention

Dynamic Memory Management for Serving LLMs without PagedAttention

C 493 42 Updated Jun 10, 2026

gty111 / gLLM

An Efficient and Versatile Inference Engine for Distributed LLM Serving

Python 60 4 Updated Jun 16, 2026

openai / parameter-golf

Train the smallest LM you can that fits in 16MB. Best model wins!

Python 5,131 3,338 Updated May 4, 2026

xiehuanyi / LP_Bench

LP_Bench

Python 14 4 Updated Feb 27, 2026

HPMLL / BurstGPT

A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems

Python 270 15 Updated Mar 19, 2026

SJTU-IPADS / MetaAttention

MetaAttention: A Unified and Performant Attention Framework Across Hardware Backends(PPoPP'26)

C++ 15 3 Updated Dec 31, 2025

HydraQYH / swizzled_layout_gemm

Using a swizzled hierarchical layout for GEMM

Python 4 Updated Jun 9, 2026

Imbad0202 / academic-research-skills

Academic Research Skills for Claude Code: research → write → review → revise → finalize

Python 31,899 2,627 Updated Jun 15, 2026

hao-ai-lab / DistCA

Efficient Long-context Language Model Training by Core Attention Disaggregation

Python 105 7 Updated Apr 7, 2026

LeNPaul / academic

A Jekyll theme for academia

HTML 232 228 Updated Jul 8, 2024

marin-community / marin

Open-source framework for the research and development of foundation models.

Python 1,115 132 Updated Jun 16, 2026

BBuf / KDA-Pilot

Python 183 29 Updated Jun 15, 2026

Azure / AzurePublicDataset

Microsoft Azure Traces

Jupyter Notebook 1,147 182 Updated Jun 3, 2026

nex-agi / NexVenusCL

Nex Venus Communication Library

C++ 76 7 Updated Nov 17, 2025

LLMServe / dLoRA-artifact

Jupyter Notebook 32 8 Updated May 28, 2024

S-LoRA / S-LoRA

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Python 1,913 124 Updated Jan 21, 2024

wilicc / gpu-burn

Multi-GPU CUDA stress test

C++ 2,229 407 Updated May 31, 2026

openinfer-project / openinfer

Pure Rust + CUDA LLM inference engine

Rust 412 53 Updated Jun 16, 2026

foundry-org / foundry

Foundry materializes CUDA graphs along with its execution context to disk to support fast cold start of serving engines.

C++ 36 4 Updated Jun 15, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

botbw botbw

Achievements

Achievements

Highlights

Block or report botbw

Stars

vllm-project / guidellm

lucifer1004 / VeloQ

DarkSharpness / dark-trace

pie-project / pie

mit-han-lab / KernelWiki

gau-nernst / learn-cuda

deepseek-ai / DualPipe

microsoft / tokenweave

LoongServe / LoongServe

SaladDay / cc-switch-cli

uccl-project / mKernel

microsoft / vattention

gty111 / gLLM

openai / parameter-golf

xiehuanyi / LP_Bench

HPMLL / BurstGPT

SJTU-IPADS / MetaAttention

HydraQYH / swizzled_layout_gemm

Imbad0202 / academic-research-skills

hao-ai-lab / DistCA

LeNPaul / academic

marin-community / marin

BBuf / KDA-Pilot

Azure / AzurePublicDataset

nex-agi / NexVenusCL

LLMServe / dLoRA-artifact

S-LoRA / S-LoRA

wilicc / gpu-burn

openinfer-project / openinfer

foundry-org / foundry