Skip to content
View abatilo's full-sized avatar

Sponsoring

@neovim
@jart

Highlights

  • Pro

Organizations

@cloudwaste

Block or report abatilo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A PyTorch native library for training speculative decoding models

Python 89 13 Updated Apr 27, 2026

NVIDIA AITune is an inference toolkit designed for tuning and deploying Deep Learning models with a focus on NVIDIA GPUs.

Python 260 29 Updated Mar 13, 2026

Google Workspace CLI — one command-line tool for Drive, Gmail, Calendar, Sheets, Docs, Chat, Admin, and more. Dynamically built from Google Discovery Service. Includes AI agent skills.

Rust 25,463 1,309 Updated Apr 24, 2026

Slack automation CLI for AI agents

TypeScript 395 35 Updated Apr 20, 2026

Wrap Go binaries in Python wheels

Python 93 4 Updated Feb 10, 2026

A CLI to estimate inference memory requirements for Hugging Face models, written in Python.

Python 912 83 Updated Apr 27, 2026

Connect the dots - minimal task tracker in Zig

Zig 115 6 Updated Jan 19, 2026

Gas Town - multi-agent workspace manager

Go 14,697 1,336 Updated Apr 25, 2026

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,071 599 Updated Mar 13, 2026

GLM-ASR-Nano: A robust, open-source speech recognition model with 1.5B parameters

Python 797 72 Updated Mar 6, 2026

NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated computing environments

Go 264 72 Updated Apr 27, 2026

Checkpoint-engine is a simple middleware to update model weights in LLM inference engines

Python 949 83 Updated Feb 28, 2026

Nsight Python is a Python kernel profiling interface based on NVIDIA Nsight Tools

Python 197 13 Updated Apr 24, 2026

Autonomous GPU Kernel Generation & Optimization via Deep Agents

Python 387 65 Updated Apr 23, 2026

Primus-SaFE(Stability and Fault Endurance)

Go 56 Updated Apr 27, 2026

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 2,162 181 Updated Aug 26, 2025

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 3,112 275 Updated Apr 27, 2026
Python 754 33 Updated Mar 14, 2026

Distribute and run AI workloads on Kubernetes magically in Python, like PyTorch for ML infra.

Python 1,192 56 Updated Apr 13, 2026

Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond

Python 895 105 Updated Apr 26, 2026

agent-sandbox enables easy management of isolated, stateful, singleton workloads, ideal for use cases like AI agent runtimes.

Go 1,950 221 Updated Apr 25, 2026

The best ChatGPT that $100 can buy.

Python 52,592 7,026 Updated Apr 14, 2026

Beads - A memory upgrade for your coding agent

Go 22,068 1,451 Updated Apr 27, 2026

An agentic skills framework & software development methodology that works.

Shell 169,644 14,978 Updated Apr 27, 2026

Open Source Continuous Inference Benchmarking Qwen3.5, DeepSeek, GPTOSS - GB200 NVL72 vs MI355X vs B200 vs GB300 NVL72 vs H100 & soon™ TPUv6e/v7/Trainium2/3

Python 888 149 Updated Apr 27, 2026

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 403 41 Updated Apr 27, 2026

GenAI inference performance benchmarking tool

Python 178 87 Updated Apr 24, 2026

A storage solution for PyTorch tensors with distributed tensor support.

Python 73 10 Updated Apr 26, 2026
Next