-
Beijing Jiaotong University
- Hangzhou, China
-
05:34
(UTC +08:00) - https://www.iamhlbx.xyz
Lists (4)
Sort Name ascending (A-Z)
Starred repositories
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
An industrial-grade C++ implementation of RAFT consensus algorithm based on brpc, widely used inside Baidu to build highly-available distributed systems.
A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
TiDB operator creates and manages TiDB clusters running in Kubernetes.
Neon: Serverless Postgres. We separated storage and compute to offer autoscaling, code-like database branching, and scale to zero.
Real-time analytics on Postgres tables
A Datacenter Scale Distributed Inference Serving Framework
📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉
📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
A high-throughput and memory-efficient inference and serving engine for LLMs
SGLang is a high-performance serving framework for large language models and multimodal models.
A General-purpose Task-parallel Programming System in C++
The official GitHub page for the survey paper "A Survey of Large Language Models".
Probably the fastest coroutine lib in the world!
FlatBuffers: Memory Efficient Serialization Library
Open source transactional distributed database. Linear scalability and proven fault-tolerance on commodity hardware or cloud infrastructure without compromising performance.
LevelDB is a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values.
Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
NoSQL data store using the Seastar framework, compatible with Apache Cassandra and Amazon DynamoDB
MS-Agent: a lightweight framework to empower agentic execution of complex tasks
KubeBlocks is a Kubernetes Operator designed to manage a variety of databases and streaming systems, including MySQL, PostgreSQL, MongoDB, Redis, RabbitMQ, RocketMQ, and more, within Kubernetes env…
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
A simple and easy-to-use Go mocking library derived from ByteDance's internal best practices
Curve is a sandbox project hosted by the CNCF Foundation. It's cloud-native, high-performance, and easy to operate. Curve is an open-source distributed storage system for block and shared file stor…