Skip to content
View abatilo's full-sized avatar

Sponsoring

@neovim
@jart

Highlights

  • Pro

Organizations

@cloudwaste

Block or report abatilo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

NVIDIA AITune is an inference toolkit designed for tuning and deploying Deep Learning models with a focus on NVIDIA GPUs.

Python 236 26 Updated Mar 13, 2026

Google Workspace CLI — one command-line tool for Drive, Gmail, Calendar, Sheets, Docs, Chat, Admin, and more. Dynamically built from Google Discovery Service. Includes AI agent skills.

Rust 24,818 1,264 Updated Apr 8, 2026

Slack automation CLI for AI agents

TypeScript 383 31 Updated Apr 15, 2026

Wrap Go binaries in Python wheels

Python 91 4 Updated Feb 10, 2026

A CLI to estimate inference memory requirements for Hugging Face models, written in Python.

Python 905 82 Updated Apr 7, 2026

Connect the dots - minimal task tracker in Zig

Zig 114 6 Updated Jan 19, 2026

Gas Town - multi-agent workspace manager

Go 14,176 1,284 Updated Apr 15, 2026

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,001 578 Updated Mar 13, 2026

GLM-ASR-Nano: A robust, open-source speech recognition model with 1.5B parameters

Python 791 72 Updated Mar 6, 2026

NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated computing environments

Go 256 69 Updated Apr 16, 2026

Checkpoint-engine is a simple middleware to update model weights in LLM inference engines

Python 940 82 Updated Feb 28, 2026

Nsight Python is a Python kernel profiling interface based on NVIDIA Nsight Tools

Python 191 13 Updated Apr 15, 2026

Autonomous GPU Kernel Generation & Optimization via Deep Agents

Python 369 61 Updated Apr 14, 2026

Primus-SaFE(Stability and Fault Endurance)

Go 56 Updated Apr 16, 2026

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 2,149 178 Updated Aug 26, 2025

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 3,083 269 Updated Apr 16, 2026
Python 740 33 Updated Mar 14, 2026

Distribute and run AI workloads on Kubernetes magically in Python, like PyTorch for ML infra.

Python 1,178 53 Updated Apr 13, 2026

Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond

Python 858 100 Updated Apr 7, 2026

agent-sandbox enables easy management of isolated, stateful, singleton workloads, ideal for use cases like AI agent runtimes.

Go 1,825 199 Updated Apr 15, 2026

The best ChatGPT that $100 can buy.

Python 51,944 6,902 Updated Apr 14, 2026

Beads - A memory upgrade for your coding agent

Go 20,814 1,393 Updated Apr 15, 2026

An agentic skills framework & software development methodology that works.

Shell 155,517 13,498 Updated Apr 16, 2026

Open Source Continuous Inference Benchmarking Qwen3.5, DeepSeek, GPTOSS - GB200 NVL72 vs MI355X vs B200 vs GB300 NVL72 vs H100 & soon™ TPUv6e/v7/Trainium2/3

Python 806 134 Updated Apr 16, 2026

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 396 42 Updated Apr 16, 2026

GenAI inference performance benchmarking tool

Python 172 85 Updated Apr 15, 2026

A storage solution for PyTorch tensors with distributed tensor support.

Python 71 10 Updated Apr 15, 2026

PyTorch-native post-training at scale

Python 667 97 Updated Apr 16, 2026
Next