Lists (1)
Sort Name ascending (A-Z)
Starred repositories
CockroachDB — the cloud native, distributed SQL database designed for high availability, effortless scale, and control over data placement.
SGLang is a fast serving framework for large language models and vision language models.
🦜🔗 The platform for reliable agents.
You like pytorch? You like micrograd? You love tinygrad! ❤️
Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
A high-throughput and memory-efficient inference and serving engine for LLMs
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Tantivy is a full-text search engine library inspired by Apache Lucene and written in Rust
Development repository for the Triton language and compiler
Bear is a tool that generates a compilation database for clang tooling.
GoogleTest - Google Testing and Mocking Framework
A retargetable MLIR-based machine learning compiler and runtime toolkit.
RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing …
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning …
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
A minimal, resource efficient unikernel for cloud services
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
📚 深入浅出分布式基础架构,Linux 与操作系统篇 | 分布式系统篇 | 分布式计算篇 | 数据库篇 | 网络篇 | 虚拟化与编排篇 | 大数据与云计算篇
Large Language Model (LLM) Systems Paper List
Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 17+ clouds, or on-prem).
FlashInfer: Kernel Library for LLM Serving
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
Ceph is a distributed object, block, and file storage platform