Highlights
- Pro
Stars
Protocol Buffers - Google's data interchange format
Tensors and Dynamic neural networks in Python with strong GPU acceleration
This repository is for active development of the Azure SDK for Python. For consumers of the SDK we recommend visiting our public developer docs at https://learn.microsoft.com/python/azure/ or our v…
A Datacenter Scale Distributed Inference Serving Framework
SGLang is a fast serving framework for large language models and vision language models.
SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, xDC replica…
A library for efficient similarity search and clustering of dense vectors.
TPU inference for vLLM, with unified JAX and PyTorch support.
A high-throughput and memory-efficient inference and serving engine for LLMs
Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 17+ clouds, or on-prem).
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Open source documentation of Microsoft Azure
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
High accuracy RAG for answering questions from scientific documents with citations
Scalable toolkit for efficient model reinforcement
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Cloud-native high-performance edge/middle/service proxy
A distributed approximate nearest neighborhood search (ANN) library which provides a high quality vector index build, search and distributed online serving toolkits for large scale vector search sc…
A Rust framework for correct and performant distributed systems
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
🦜🔗 The platform for reliable agents.
UCCL is an efficient communication library for GPUs, covering collectives, P2P (e.g., KV cache transfer, RL weight transfer), and EP (e.g., GPU-driven)
SkyRL: A Modular Full-stack RL Library for LLMs
The Postgres development platform. Supabase gives you a dedicated Postgres database to build your web, mobile, and AI applications.