Stars
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
SGLang is a fast serving framework for large language models and vision language models.
A query and indexing engine for Redis, providing secondary indexing, full-text search, vector similarity search and aggregations.
Ongoing research training transformer models at scale
PyTorch compiler that accelerates training and inference. Get built-in optimizations for performance, memory, parallelism, and easily write your own.
ncnn is a high-performance neural network inference framework optimized for the mobile platform
Simple, scalable AI model deployment on GPU clusters
openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 300+ supported cars.
Open standard for machine learning interoperability
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
MariaDB server is a community developed fork of MySQL server. Started by core members of the original MySQL team, MariaDB actively works with outside developers to deliver the most featureful, stab…
An open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.
An open-source AI agent that brings the power of Gemini directly into your terminal.
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
On-device AI across mobile, embedded and edge for PyTorch
Code at the speed of thought – Zed is a high-performance, multiplayer code editor from the creators of Atom and Tree-sitter.
A concise but complete full-attention transformer with a set of promising experimental features from various papers
You like pytorch? You like micrograd? You love tinygrad! ❤️
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Development repository for the Triton language and compiler
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.