Stars
Protocol Buffers - Google's data interchange format
Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 17+ clouds, or on-prem).
A Datacenter Scale Distributed Inference Serving Framework
Emscripten: An LLVM-to-WebAssembly Compiler
Production-Grade Container Scheduling and Management
🍻 Default formulae for the missing package manager for macOS (or Linux)
Tensors and Dynamic neural networks in Python with strong GPU acceleration
The Moby Project - a collaborative project for the container ecosystem to assemble container-based systems
Open standard for machine learning interoperability
Durable Task Framework extension for Azure Functions
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
Slim(toolkit): Don't change anything in your container image and minify it by up to 30x (and for compiled languages even more) making it secure too! (free and open source)
Train transformer language models with reinforcement learning.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
Ongoing research training transformer models at scale
Command line tools for Azure Functions
The JavaScript / Wasm runtime that powers Cloudflare Workers
General-purpose programming language and toolchain for maintaining robust, optimal, and reusable software.
This repository is for active development of the Azure SDK for .NET. For consumers of the SDK we recommend visiting our public developer docs at https://learn.microsoft.com/dotnet/azure/ or our ver…
NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extraction uses specialized NVIDIA NIM microservices to find, con…
Pluggable in-process caching engine to build and scale high performance services
Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
A PyTorch native platform for training generative AI models