Stars
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
π Make websites accessible for AI agents. Automate tasks online with ease.
A high-throughput and memory-efficient inference and serving engine for LLMs
Interact with your documents using the power of GPT, 100% privately, no data leaks
A collection of learning resources for curious software engineers
The definitive Web UI for local AI, with powerful features and easy setup.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 17+ clouds, or on-prem).
Count the MACs / FLOPs of your PyTorch model.
Training and serving large-scale neural networks with auto parallelization.
A Python framework to write Kubernetes operators in just a few lines of code
Extended pickling support for Python objects
π₯ Blazing fast bulk data transfers between any cloud π₯
Distribute and run AI workloads on Kubernetes magically in Python, like PyTorch for ML infra.
UI tool for fine-tuning and testing your own LoRA models base on LLaMA, GPT-J and more. One-click run on Google Colab. + A Gradio ChatGPT-like Chat UI to demonstrate your language models.
A collection of incremental learning paper implementations including PODNet (ECCV20) and Ghost (CVPR-W21).
Module, Model, and Tensor Serialization/Deserialization
Balsa is a learned SQL query optimizer. It tailor optimizes your SQL queries to find the best execution plans for your hardware and engine.
Simple Waymo Open Dataset Reader
CATS: the Climate-Aware Task Scheduler π πββ¬
Deadline-based hyperparameter tuning on RayTune.
Train with big data on any cloud
Code and instructions used in EuroSys 25' paper: SpotHedge: Serving AI Models on Spot Instances.
Create tables of contents (TOCs) for your Markdown files !