Skip to content
View coreylowman's full-sized avatar

Sponsors

@lwwmanning
@TSYCapital
@skinner
@TimerErTim
@scooter-dangle

Block or report coreylowman

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

FlashInfer: Kernel Library for LLM Serving

Cuda 3,872 534 Updated Oct 9, 2025

kernels, of the mega variety

Python 579 25 Updated Sep 28, 2025

An extremely fast Python package and project manager, written in Rust.

Rust 69,516 2,096 Updated Oct 9, 2025

An extensible, state of the art columnar file format. Formerly at @spiraldb, now an Incubation Stage project at LFAI&Data, part of the Linux Foundation.

Rust 1,797 71 Updated Oct 10, 2025

Fil-C: completely compatible memory safety for C and C++

C 1,419 43 Updated Oct 7, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,784 710 Updated Oct 9, 2025

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without lossing end-to-end metrics across language, image, and video models.

Cuda 2,496 238 Updated Oct 8, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 14,125 2,519 Updated Oct 9, 2025

SkyRL: A Modular Full-stack RL Library for LLMs

Python 994 129 Updated Oct 10, 2025

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 8,555 1,474 Updated Sep 25, 2025

A Quirky Assortment of CuTe Kernels

Python 613 48 Updated Oct 9, 2025

A lightweight, local-first, and free experiment tracking library from Hugging Face 🤗

Python 931 61 Updated Oct 9, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 59,719 10,588 Updated Oct 9, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 18,711 3,099 Updated Oct 10, 2025

Model Compression Toolbox for Large Language Models and Diffusion Models

Python 662 58 Updated Aug 14, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 8,587 947 Updated Oct 9, 2025

learn from your favorite tech companies

TypeScript 165 16 Updated Aug 1, 2025
Rust 11 Updated Jul 25, 2024

Deep learning in Rust, with shape checked tensors and neural networks

Rust 1,852 104 Updated Jul 23, 2024

[Unmaintained, see README] An ecosystem of Rust libraries for working with large language models

Rust 6,136 375 Updated Jun 24, 2024
Rust 1 Updated May 14, 2023

LLaMa 7b with CUDA acceleration implemented in rust. Minimal GPU memory needed!

Rust 109 6 Updated Jul 27, 2023
Rust 93 17 Updated Jan 9, 2025

Build full-stack apps on your own infrastructure.

TypeScript 24,648 1,927 Updated Oct 6, 2025

A cross-platform GUI library for Rust, inspired by Elm

Rust 27,836 1,381 Updated Oct 8, 2025

Build smaller, faster, and more secure desktop and mobile applications with a web frontend.

Rust 97,196 3,093 Updated Oct 9, 2025

Terraform enables you to safely and predictably create, change, and improve infrastructure. It is a source-available tool that codifies APIs into declarative configuration files that can be shared …

Go 46,735 10,057 Updated Oct 9, 2025

Stockfish NNUE (Chess evaluation) trainer in Pytorch

Python 418 115 Updated Oct 4, 2025

A free and strong UCI chess engine

C++ 13,917 2,638 Updated Oct 7, 2025

SC2 API for Rust

Rust 45 22 Updated Feb 5, 2025
Next