epwalsh

Pete Walsh epwalsh

Research engineer working on open source LLMs | Python | Rust | Neovim | Life is a changelog, change your life and keep a changelog

1k followers · 0 following

Central Oregon
15:01 (UTC -08:00)
@epwalsh

Achievements

x4 x4 x3 x4

Achievements

x4 x4 x3 x4

Organizations

Lists (15)

Sort

Research Projects 🧑‍🔬

Rust Cargo Extensions 🦀

2 repositories

Rust CLI Tools 🦀

9 repositories

Rust Machine Learning Tools 🦀

4 repositories

Rust Tools 🦀

7 repositories

Rust + Vim 🦀

2 repositories

Template Repositories

1 repository

Vim Plugins ⌨️

6 repositories

Stars

fla-org / flash-linear-attention

🚀 Efficient implementations of state-of-the-art linear attention models

Python 4,077 331 Updated Dec 18, 2025

stevearc / aerial.nvim

Neovim plugin for a code outline window

Lua 2,166 107 Updated Nov 25, 2025

Dao-AILab / quack

A Quirky Assortment of CuTe Kernels

Python 696 64 Updated Dec 16, 2025

meta-pytorch / BackendBench

Ship correct and fast LLM kernels to PyTorch

Python 126 15 Updated Dec 18, 2025

pytorch / helion

A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.

Python 688 89 Updated Dec 18, 2025

character-ai / pipelining-sft

Simple and efficient DeepSeek V3 SFT using pipeline parallel and expert parallel, with both FP8 and BF16 trainings

Python 101 19 Updated Jul 27, 2025

exelban / stats

macOS system monitor in your menu bar

Swift 35,323 1,128 Updated Dec 7, 2025

stevearc / conform.nvim

Lightweight yet powerful formatter plugin for Neovim

Lua 4,746 263 Updated Dec 14, 2025

webinstall / webi-installers

Primary and community-submitted packages for webinstall.dev

Shell 2,655 291 Updated Oct 21, 2025

fanshiqing / grouped_gemm

Forked from tgale96/grouped_gemm

PyTorch bindings for CUTLASS grouped GEMM.

Cuda 174 46 Updated Dec 16, 2025

marin-community / draccus

Configuration with Dataclasses+YAML+Argparse. Fork of Pyrallis

Python 74 16 Updated Oct 30, 2025

patrick-kidger / equinox

Elegant easy-to-use neural networks + scientific computing in JAX. https://docs.kidger.site/equinox/

Python 2,719 177 Updated Dec 5, 2025

AI-Hypercomputer / maxtext

A simple, performant and scalable Jax LLM!

Python 2,046 441 Updated Dec 18, 2025

microsoft / microxcaling

PyTorch emulation library for Microscaling (MX)-compatible data formats

Python 326 41 Updated Jun 18, 2025

allenai / OLMo-core

PyTorch building blocks for the OLMo ecosystem

Python 590 107 Updated Dec 18, 2025

meta-pytorch / torchft

Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)

Python 457 53 Updated Dec 6, 2025

gpu-mode / resource-stream

GPU programming related news and material links

1,872 110 Updated Sep 17, 2025

zhuzilin / ring-flash-attention

Ring attention implementation with flash attention

Python 947 90 Updated Sep 10, 2025

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 5,955 451 Updated Dec 18, 2025

tgale96 / grouped_gemm

PyTorch bindings for CUTLASS grouped GEMM.

Cuda 134 79 Updated May 29, 2025

Zyphra / Zamba2

PyTorch implementation of models from the Zamba2 series.

Python 186 17 Updated Jan 23, 2025

state-spaces / mamba

Mamba SSM architecture

Python 16,756 1,541 Updated Nov 11, 2025

facebookresearch / optimizers

For optimization algorithm research and development.

Python 553 60 Updated Dec 16, 2025

guanyingc / latex_paper_writing_tips

Tips for Writing a Research Paper using LaTeX

TeX 3,636 404 Updated May 4, 2023

NVIDIA / TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…

Python 3,011 581 Updated Dec 18, 2025

lucidrains / nGPT-pytorch

Quick implementation of nGPT, learning entirely on the hypersphere, from NvidiaAI

Python 294 24 Updated Jun 3, 2025

databricks / megablocks

Python 1,510 219 Updated Jun 26, 2025

pytorch / ao

PyTorch native quantization and sparsity for training and inference

Python 2,579 386 Updated Dec 18, 2025

huggingface / safetensors

Simple, safe way to store and distribute tensors

Python 3,557 285 Updated Dec 18, 2025

Azure / MS-AMP

Microsoft Automatic Mixed Precision Library

Python 630 49 Updated Dec 1, 2025

Pete Walsh epwalsh

Organizations

Lists (15)

Beaker 🧪

GitHub Actions ▶️

GPU Kernel Development

LLMs

NLP Research 🔡

Python Deep Learning Tools 🐍

Python Tools 🐍