Skip to content
View concretevitamin's full-sized avatar

Highlights

  • Pro

Organizations

@amplab @amplab-extras @ucbrise @neurocard @skypilot-org

Block or report concretevitamin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale

Rust 5,045 367 Updated Dec 22, 2025

Lightweight Durable Golang Workflows

Go 541 42 Updated Dec 19, 2025

utilities for skypilot development

Shell 1 Updated Nov 21, 2025

A collection of reproducible inference engine benchmarks

Shell 38 1 Updated Apr 22, 2025

Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 20+ clouds, or on-prem).

Python 9,133 882 Updated Dec 23, 2025

Releasing the spot availability traces used in "Can't Be Late" paper.

24 Updated Mar 31, 2024

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Python 3,570 288 Updated May 21, 2025

⚡️ A fast and flexible PyTorch inference server that runs locally, on any cloud or AI HW.

Python 146 12 Updated Jun 8, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 66,021 12,144 Updated Dec 23, 2025

UI tool for fine-tuning and testing your own LoRA models base on LLaMA, GPT-J and more. One-click run on Google Colab. + A Gradio ChatGPT-like Chat UI to demonstrate your language models.

Python 474 99 Updated May 29, 2023

Self-hosted AI coding assistant

Rust 32,625 1,661 Updated Dec 15, 2025

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,329 4,779 Updated Jun 2, 2025
Python 28 7 Updated May 2, 2023

Examples and instructions about use LLMs (especially ChatGPT) for PhD

107 1 Updated Mar 18, 2023

Distribute and run AI workloads on Kubernetes magically in Python, like PyTorch for ML infra.

Python 1,136 48 Updated Dec 23, 2025

Docker abstraction for SkyPilot

Python 4 Updated Jan 2, 2023

Tutorial to get started with SkyPilot!

Jupyter Notebook 58 9 Updated May 15, 2024

Launch jobs on sky

Python 5 1 Updated Dec 8, 2025

Training and serving large-scale neural networks with auto parallelization.

Python 3,172 356 Updated Dec 9, 2023

🔥 Blazing fast bulk data transfers between any cloud 🔥

Python 1,204 71 Updated May 11, 2024

A Domain-Agnostic Benchmark for Self-Supervised Learning

Python 108 12 Updated May 10, 2023

Run-time data-access policy enforcement for web applications.

Java 7 Updated May 27, 2022

Aqueduct is no longer being maintained. Aqueduct allows you to run LLM and ML workloads on any cloud infrastructure.

Go 521 20 Updated Jun 7, 2023

Move fast from data science prototype to pipeline. Capture, analyze, and transform messy notebooks into data pipelines with just two lines of code.

Jupyter Notebook 670 57 Updated Feb 22, 2025

Balsa is a learned SQL query optimizer. It tailor optimizes your SQL queries to find the best execution plans for your hardware and engine.

Python 143 34 Updated Jun 13, 2022

Source code and datasets for Ekya, a system for continuous learning on the edge.

Jupyter Notebook 111 22 Updated Mar 10, 2022

A library that translates Python and NumPy to optimized distributed systems code.

Python 131 26 Updated Sep 17, 2022

State-of-the-art neural cardinality estimators for join queries

Python 80 29 Updated Oct 6, 2020
Next