Skip to content
View lbh2001's full-sized avatar
🎣
Fishing
🎣
Fishing

Organizations

@bullfrog-store

Block or report lbh2001

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Header-only C++ binding for libzmq

C++ 2,240 795 Updated Dec 19, 2025

Flash Attention from Scratch on CUDA Ampere

Assembly 96 12 Updated Sep 1, 2025

[ICLR2025] Codebase for "ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing", built on Megatron-LM.

Python 102 7 Updated Dec 20, 2024

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 4,259 349 Updated Dec 19, 2025

Nano vLLM

Python 9,796 1,232 Updated Nov 3, 2025

This repository is responsible for the LLVM-related parts of Jeandle.

LLVM 141 26 Updated Dec 16, 2025

Jeandle is a Just-in-Time compiler for Java. It is built on OpenJDK and leverages the LLVM compiler infrastructure to generate machine code, aiming to provide powerful compilation optimizations and…

Java 377 50 Updated Dec 19, 2025

A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.

Python 2,136 243 Updated Dec 18, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 40,407 7,019 Updated Dec 19, 2025

Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serving systems.

Python 249 42 Updated Dec 10, 2025

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,451 475 Updated Dec 19, 2025

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 7,404 635 Updated Dec 19, 2025

how to optimize some algorithm in cuda.

Cuda 2,696 244 Updated Dec 6, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 65,780 12,068 Updated Dec 19, 2025

从无名小卒到大模型(LLM)大英雄~ 欢迎关注后续!!!

Jupyter Notebook 1,887 128 Updated Nov 22, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 21,748 3,810 Updated Dec 19, 2025

FlashInfer: Kernel Library for LLM Serving

Cuda 4,306 606 Updated Dec 19, 2025

My learning notes for ML SYS.

Python 4,701 298 Updated Dec 19, 2025

Optimized primitives for collective multi-GPU communication

C++ 4,323 1,093 Updated Dec 2, 2025

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

C++ 23,512 5,911 Updated Dec 19, 2025

brpc is an Industrial-grade RPC framework using C++ Language, which is often used in high performance system such as Search, Storage, Machine learning, Advertisement, Recommendation etc. "brpc" mea…

C++ 17,415 4,083 Updated Dec 18, 2025

📝A simple and elegant markdown editor, available for Linux, macOS and Windows.

JavaScript 52,839 3,890 Updated Nov 19, 2025

科技爱好者周刊,每周五发布

80,922 3,776 Updated Dec 19, 2025

Scalable NameNode RPC Proxy for HDFS Federation

Java 86 16 Updated Apr 19, 2016

A Vector Database Tutorial (over CMU-DB's BusTub system)

C++ 740 22 Updated Jan 19, 2025

The official home of the Presto distributed SQL query engine for big data

Java 16,594 5,510 Updated Dec 19, 2025

贺师俊与360的劳动争议诉讼

2,441 160 Updated Mar 19, 2024

A light-weight RPC implement of google protobuf RPC framework.

C++ 2,148 654 Updated Aug 24, 2023

An industrial-grade C++ implementation of RAFT consensus algorithm based on brpc, widely used inside Baidu to build highly-available distributed systems.

C++ 4,195 912 Updated Oct 25, 2024

🚧 Build a SQL optimizer in 1000 lines of Rust using egg.

Rust 85 13 Updated Feb 6, 2023
Next