Stars
A community trust management system based on explicit vouches to participate.
Supporting code for Welch Labs AI Book
OxiBonsai is a zero-FFI, zero-C/C++ inference engine for PrismML's sub-2-bit Bonsai family — both the 1-bit line (Q1_0_g128) and the ternary line (TQ2_0_g128). It runs on CPU (SIMD), Apple Silicon …
Run agents like Hermes and OpenClaw more securely inside NVIDIA OpenShell with managed inference
Hundreds of models & providers. One command to find what runs on your hardware.
Multiple Instance Learning for Spatial Transcriptomics
Fully local, private and cross platform Speech-to-Text with LLM Post-processing
Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, Slurm, 20+ clouds, on-prem).
Robyn is a Super Fast Async Python Web Framework with a Rust runtime.
HTTP routing and request-handling library for Rust that focuses on ergonomics and modularity
A tool for creating and running Linux containers using lightweight virtual machines on a Mac. It is written in Swift, and optimized for Apple silicon.
Low-latency AI engine for mobile devices & wearables
An extremely fast Python type checker and language server, written in Rust.
Pruna is a model optimization framework built for developers, enabling you to deliver faster, more efficient models with minimal overhead.
On-device AI across mobile, embedded and edge for PyTorch
Inference server benchmarking tool
A curated list of materials on AI efficiency
A language server for Zig supporting developers with features like autocomplete and goto definition