Skip to content
View lbh2001's full-sized avatar
🎣
Fishing
🎣
Fishing

Organizations

@bullfrog-store

Block or report lbh2001

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 2,338 205 Updated Dec 23, 2025

Header-only C++ binding for libzmq

C++ 2,245 797 Updated Dec 19, 2025

Flash Attention from Scratch on CUDA Ampere

Assembly 99 13 Updated Sep 1, 2025

[ICLR2025] Codebase for "ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing", built on Megatron-LM.

Python 104 7 Updated Dec 20, 2024

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 4,303 360 Updated Dec 25, 2025

Nano vLLM

Python 10,128 1,270 Updated Nov 3, 2025

This repository is responsible for the LLVM-related parts of Jeandle.

LLVM 143 28 Updated Dec 16, 2025

Jeandle is a Just-in-Time compiler for Java. It is built on OpenJDK and leverages the LLVM compiler infrastructure to generate machine code, aiming to provide powerful compilation optimizations and…

Java 381 50 Updated Dec 24, 2025

A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.

Python 2,171 248 Updated Dec 25, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 40,474 7,039 Updated Dec 24, 2025

Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serving systems.

Python 249 43 Updated Dec 10, 2025

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,480 480 Updated Dec 25, 2025

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 7,430 638 Updated Dec 25, 2025

how to optimize some algorithm in cuda.

Cuda 2,715 244 Updated Dec 23, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 66,163 12,181 Updated Dec 25, 2025

从无名小卒到大模型(LLM)大英雄~ 欢迎关注后续!!!

Jupyter Notebook 1,897 130 Updated Nov 22, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 21,967 3,866 Updated Dec 25, 2025

FlashInfer: Kernel Library for LLM Serving

Python 4,358 616 Updated Dec 25, 2025

My learning notes for ML SYS.

Python 4,805 309 Updated Dec 24, 2025

Optimized primitives for collective multi-GPU communication

C++ 4,332 1,099 Updated Dec 25, 2025

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

C++ 23,524 5,913 Updated Dec 25, 2025

brpc is an Industrial-grade RPC framework using C++ Language, which is often used in high performance system such as Search, Storage, Machine learning, Advertisement, Recommendation etc. "brpc" mea…

C++ 17,423 4,081 Updated Dec 22, 2025

📝A simple and elegant markdown editor, available for Linux, macOS and Windows.

JavaScript 52,913 3,901 Updated Nov 19, 2025

科技爱好者周刊,每周五发布

81,381 3,801 Updated Dec 19, 2025

Scalable NameNode RPC Proxy for HDFS Federation

Java 86 16 Updated Apr 19, 2016

A Vector Database Tutorial (over CMU-DB's BusTub system)

C++ 740 22 Updated Jan 19, 2025

The official home of the Presto distributed SQL query engine for big data

Java 16,601 5,512 Updated Dec 25, 2025

贺师俊与360的劳动争议诉讼

2,441 160 Updated Mar 19, 2024

A light-weight RPC implement of google protobuf RPC framework.

C++ 2,148 653 Updated Aug 24, 2023

An industrial-grade C++ implementation of RAFT consensus algorithm based on brpc, widely used inside Baidu to build highly-available distributed systems.

C++ 4,196 912 Updated Oct 25, 2024
Next