Stars
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
An enterprise-class UI design language and React UI library
A high-throughput and memory-efficient inference and serving engine for LLMs
openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 300+ supported cars.
Development repository for the Triton language and compiler
On-device AI across mobile, embedded and edge for PyTorch
Slint is an open-source declarative GUI toolkit to build native user interfaces for Rust, C++, JavaScript, or Python apps.
Supercharge Your LLM Application Evaluations 🚀
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
SGLang is a fast serving framework for large language models and vision language models.
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Simple, safe way to store and distribute tensors
Cloud-native high-performance edge/middle/service proxy
Delivers efficient, stable, and secure data distribution and acceleration powered by P2P technology, with an optional content‑addressable filesystem that accelerates OCI container launch.
For developers, who are building real-time data-driven applications, Redis is the preferred, fastest, and most feature-rich cache, data structure server, and document and vector query engine.
✌️ A spring physics based React animation library
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
An extremely fast Python package and project manager, written in Rust.
Empowering Data Intelligence with Distributed SQL for Sharding, Scalability, and Security Across All Databases.
Visualizer for neural network, deep learning and machine learning models
A concise but complete full-attention transformer with a set of promising experimental features from various papers