abatilo

Aaron Batilo abatilo

If I don't have to do it, I won't. If I have to do it, I'll do it as quickly as possible.

61 followers · 1 following

Sponsoring

Achievements

x3 x4 x2

Achievements

x3 x4 x2

Highlights

Organizations

Lists (1)

Sort

Training stack

3 repositories

Starred repositories

lightseekorg / TorchSpec

A PyTorch native library for training speculative decoding models

Python 89 14 Updated Apr 28, 2026

ai-dynamo / aitune

NVIDIA AITune is an inference toolkit designed for tuning and deploying Deep Learning models with a focus on NVIDIA GPUs.

Python 260 29 Updated Mar 13, 2026

NVIDIA-NeMo / ProRL-Agent-Server

Python 128 21 Updated Apr 26, 2026

googleworkspace / cli

Google Workspace CLI — one command-line tool for Drive, Gmail, Calendar, Sheets, Docs, Chat, Admin, and more. Dynamically built from Google Discovery Service. Includes AI agent skills.

Rust 25,495 1,310 Updated Apr 28, 2026

stablyai / agent-slack

Slack automation CLI for AI agents

TypeScript 398 35 Updated Apr 28, 2026

simonw / go-to-wheel

Wrap Go binaries in Python wheels

Python 93 4 Updated Feb 10, 2026

alvarobartt / hf-mem

A CLI to estimate inference memory requirements for Hugging Face models, written in Python.

Python 913 83 Updated Apr 27, 2026

joelreymont / dots

Connect the dots - minimal task tracker in Zig

Zig 115 6 Updated Jan 19, 2026

gastownhall / gastown

Gas Town - multi-agent workspace manager

Go 14,720 1,340 Updated Apr 25, 2026

sgl-project / mini-sglang

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,073 599 Updated Mar 13, 2026

zai-org / GLM-ASR

GLM-ASR-Nano: A robust, open-source speech recognition model with 1.5B parameters

Python 797 72 Updated Mar 6, 2026

NVIDIA / NVSentinel

NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated computing environments

Go 265 72 Updated Apr 27, 2026

MoonshotAI / checkpoint-engine

Checkpoint-engine is a simple middleware to update model weights in LLM inference engines

Python 951 83 Updated Feb 28, 2026

NVIDIA / nsight-python

Nsight Python is a Python kernel profiling interface based on NVIDIA Nsight Tools

Python 197 13 Updated Apr 24, 2026

meta-pytorch / KernelAgent

Autonomous GPU Kernel Generation & Optimization via Deep Agents

Python 387 65 Updated Apr 23, 2026

AMD-AGI / Primus-SaFE

Primus-SaFE(Stability and Fault Endurance)

Go 56 Updated Apr 28, 2026

huggingface / picotron

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 2,163 181 Updated Aug 26, 2025

alibaba / ROLL

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 3,110 275 Updated Apr 28, 2026

awslabs / git-remote-s3

Python 754 33 Updated Mar 14, 2026

cfregly / ai-performance-engineering

Python 1,345 192 Updated Mar 31, 2026

run-house / kubetorch

Distribute and run AI workloads on Kubernetes magically in Python, like PyTorch for ML infra.

Python 1,192 56 Updated Apr 13, 2026

ovg-project / kvcached

Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond

Python 897 105 Updated Apr 26, 2026

kubernetes-sigs / agent-sandbox

agent-sandbox enables easy management of isolated, stateful, singleton workloads, ideal for use cases like AI agent runtimes.

Go 1,954 224 Updated Apr 28, 2026

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 52,616 7,038 Updated Apr 14, 2026

gastownhall / beads

Beads - A memory upgrade for your coding agent

Go 22,366 1,465 Updated Apr 27, 2026

obra / superpowers

An agentic skills framework & software development methodology that works.

Shell 170,496 15,053 Updated Apr 28, 2026

SemiAnalysisAI / InferenceX

Open Source Continuous Inference Benchmarking Qwen3.5, DeepSeek, GPTOSS - GB200 NVL72 vs MI355X vs B200 vs GB300 NVL72 vs H100 & soon™ TPUv6e/v7/Trainium2/3

Python 891 152 Updated Apr 28, 2026