- Central Oregon
-
00:53
(UTC -07:00) - @epwalsh
Lists (16)
Sort Name ascending (A-Z)
Stars
Context window optimization for AI coding agents. Sandboxes tool output, 98% reduction. 12 platforms
An agentic skills framework & software development methodology that works.
A lightweight Model Context Protocol (MCP) server for safe Obsidian vault access
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Accelerating MoE with IO and Tile-aware Optimizations
🚀 Efficient implementations for emerging model architectures
Ship correct and fast LLM kernels to PyTorch
A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.
Simple and efficient DeepSeek V3 SFT using pipeline parallel and expert parallel, with both FP8 and BF16 trainings
Lightweight yet powerful formatter plugin for Neovim
Primary and community-submitted packages for webinstall.dev
fanshiqing / grouped_gemm
Forked from tgale96/grouped_gemmPyTorch bindings for CUTLASS grouped GEMM.
Configuration with Dataclasses+YAML+Argparse. Fork of Pyrallis
Elegant easy-to-use neural networks + scientific computing in JAX. https://docs.kidger.site/equinox/
A simple, performant and scalable Jax LLM!
PyTorch emulation library for Microscaling (MX)-compatible data formats
PyTorch building blocks for the OLMo ecosystem
Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)
GPU programming related news and material links
Ring attention implementation with flash attention
Efficient Triton Kernels for LLM Training
PyTorch implementation of models from the Zamba2 series.