Skip to content
View abatilo's full-sized avatar

Sponsoring

@neovim
@jart

Highlights

  • Pro

Organizations

@cloudwaste

Block or report abatilo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
Python 1,059 78 Updated Dec 18, 2025

GLM-ASR-Nano: A robust, open-source speech recognition model with 1.5B parameters

Python 573 51 Updated Dec 12, 2025

NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated computing environments

Go 127 29 Updated Dec 18, 2025

Checkpoint-engine is a simple middleware to update model weights in LLM inference engines

Python 865 71 Updated Dec 18, 2025

Nsight Python is a Python kernel profiling interface based on NVIDIA Nsight Tools

Python 75 6 Updated Dec 18, 2025

Autonomous GPU Kernel Generation via Deep Agents

Python 187 20 Updated Dec 18, 2025

Primus-SaFE(Stability and Fault Endurance)

Go 44 Updated Dec 18, 2025

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 1,923 149 Updated Aug 26, 2025

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 2,496 180 Updated Dec 18, 2025
Python 701 30 Updated Oct 31, 2025

Distribute and run AI workloads on Kubernetes magically in Python, like PyTorch for ML infra.

Python 1,132 47 Updated Dec 18, 2025

Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond

Python 716 73 Updated Nov 30, 2025

agent-sandbox enables easy management of isolated, stateful, singleton workloads, ideal for use cases like AI agent runtimes.

Go 499 70 Updated Dec 18, 2025

The best ChatGPT that $100 can buy.

Python 38,858 4,903 Updated Dec 9, 2025

Beads - A memory upgrade for your coding agent

Go 5,746 355 Updated Dec 18, 2025

Claude Code superpowers: core skills library

Shell 10,261 868 Updated Dec 18, 2025

Open Source Continuous Inference Benchmarking - GB200 NVL72 vs MI355X vs B200 vs H200 vs MI325X & soon™ TPUv6e/v7/Trainium2/3/GB300 NVL72 - DeepSeek 670B MoE, GPTOSS

Python 399 65 Updated Dec 18, 2025

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 337 34 Updated Dec 18, 2025

GenAI inference performance benchmarking tool

Python 137 54 Updated Dec 18, 2025

A storage solution for PyTorch tensors with distributed tensor support.

Python 47 6 Updated Dec 18, 2025

PyTorch-native post-training at scale

Python 569 71 Updated Dec 18, 2025

Post-training with Tinker

Python 2,572 243 Updated Dec 18, 2025

GitHub Copilot CLI brings the power of Copilot coding agent directly to your terminal.

Shell 6,016 707 Updated Dec 18, 2025

nv-one-logger enables tracking of GPU application progress over time and can help to identify overhead from workload and cluster inefficiencies to provide efficiency metrics.

Python 21 3 Updated Nov 6, 2025

Nano vLLM

Python 9,726 1,226 Updated Nov 3, 2025

Agent Builder and Runtime by Docker Engineering

Go 1,753 201 Updated Dec 18, 2025

Intelligent Router for Mixture-of-Models

Go 2,478 326 Updated Dec 18, 2025

Environments for LLM Reinforcement Learning

Python 3,639 454 Updated Dec 18, 2025

Pipeline parallelism for the minimalist

Python 37 1 Updated Aug 6, 2025
Next