Skip to content
View lbh2001's full-sized avatar
🎣
Fishing
🎣
Fishing

Organizations

@bullfrog-store

Block or report lbh2001

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A GPU cluster manager that configures and orchestrates inference engines like vLLM and SGLang for high-performance AI model deployment.

Python 4,739 488 Updated Mar 27, 2026

This repo release the detailed benchmark code and results of Sea Labs AI.

Python 17 1 Updated Jan 3, 2026

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 3,796 525 Updated Mar 13, 2026

Header-only C++ binding for libzmq

C++ 2,281 797 Updated Mar 23, 2026

Flash Attention from Scratch on CUDA Ampere

Assembly 157 22 Updated Sep 1, 2025

[ICLR2025] Codebase for "ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing", built on Megatron-LM.

Python 109 11 Updated Dec 20, 2024

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 5,432 487 Updated Mar 27, 2026

Nano vLLM

Python 12,468 1,793 Updated Nov 3, 2025

This repository is responsible for the LLVM-related parts of Jeandle.

LLVM 155 34 Updated Mar 27, 2026

Jeandle is a Just-in-Time compiler for Java. It is built on OpenJDK and leverages the LLVM compiler infrastructure to generate machine code, aiming to provide powerful compilation optimizations and…

Java 423 56 Updated Mar 27, 2026

A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.

Python 2,562 292 Updated Mar 27, 2026

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 41,880 7,391 Updated Mar 27, 2026

Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serving systems.

Python 285 50 Updated Mar 25, 2026

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,989 632 Updated Mar 27, 2026

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 7,725 674 Updated Mar 27, 2026

how to optimize some algorithm in cuda.

Cuda 2,891 265 Updated Mar 24, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 74,499 14,843 Updated Mar 27, 2026

从无名小卒到大模型(LLM)大英雄~ 欢迎关注后续!!!

Jupyter Notebook 2,105 144 Updated Nov 22, 2025

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 25,101 5,031 Updated Mar 27, 2026

FlashInfer: Kernel Library for LLM Serving

Python 5,227 829 Updated Mar 27, 2026

My learning notes for ML SYS.

Python 5,789 374 Updated Mar 19, 2026

Optimized primitives for collective multi-GPU communication

C++ 4,565 1,186 Updated Mar 25, 2026

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

C++ 23,784 5,979 Updated Mar 27, 2026

brpc is an Industrial-grade RPC framework using C++ Language, which is often used in high performance system such as Search, Storage, Machine learning, Advertisement, Recommendation etc. "brpc" mea…

C++ 17,472 4,100 Updated Mar 26, 2026

📝A simple and elegant markdown editor, available for Linux, macOS and Windows.

JavaScript 54,749 4,062 Updated Mar 4, 2026

科技爱好者周刊,每周五发布

86,798 3,921 Updated Mar 27, 2026

Scalable NameNode RPC Proxy for HDFS Federation

Java 87 16 Updated Apr 19, 2016

A Vector Database Tutorial (over CMU-DB's BusTub system)

C++ 757 23 Updated Jan 19, 2025

The official home of the Presto distributed SQL query engine for big data

Java 16,668 5,534 Updated Mar 27, 2026

贺师俊与360的劳动争议诉讼

2,431 157 Updated Mar 19, 2024
Next