-
Lambda, Inc.
Stars
FlashInfer: Kernel Library for LLM Serving
An extremely fast Python package and project manager, written in Rust.
An extensible, state of the art columnar file format. Formerly at @spiraldb, now an Incubation Stage project at LFAI&Data, part of the Linux Foundation.
pizlonator / fil-c
Forked from llvm/llvm-projectFil-C: completely compatible memory safety for C and C++
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without lossing end-to-end metrics across language, image, and video models.
verl: Volcano Engine Reinforcement Learning for LLMs
SkyRL: A Modular Full-stack RL Library for LLMs
CUDA Templates and Python DSLs for High-Performance Linear Algebra
A lightweight, local-first, and free experiment tracking library from Hugging Face 🤗
A high-throughput and memory-efficient inference and serving engine for LLMs
SGLang is a fast serving framework for large language models and vision language models.
Model Compression Toolbox for Large Language Models and Diffusion Models
DeepEP: an efficient expert-parallel communication library
Deep learning in Rust, with shape checked tensors and neural networks
[Unmaintained, see README] An ecosystem of Rust libraries for working with large language models
LLaMa 7b with CUDA acceleration implemented in rust. Minimal GPU memory needed!
Build full-stack apps on your own infrastructure.
A cross-platform GUI library for Rust, inspired by Elm
Build smaller, faster, and more secure desktop and mobile applications with a web frontend.
Terraform enables you to safely and predictably create, change, and improve infrastructure. It is a source-available tool that codifies APIs into declarative configuration files that can be shared …
Stockfish NNUE (Chess evaluation) trainer in Pytorch
A free and strong UCI chess engine