Skip to content
View AHEADer's full-sized avatar

Block or report AHEADer

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AI agents running research on single-GPU nanochat training automatically

Python 60,363 8,394 Updated Mar 26, 2026

A PyTorch native library for training speculative decoding models

Python 58 9 Updated Mar 27, 2026

Give your agents the power of the Hugging Face ecosystem

Python 9,962 608 Updated Mar 25, 2026

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 9,783 1,023 Updated Mar 27, 2026

Engine-agnostic LLM gateway in Rust. Full OpenAI & Anthropic API compatibility across SGLang, vLLM, TRT-LLM, OpenAI, Gemini & more. Industry-first gRPC pipeline, KV cache-aware routing, chat histor…

Rust 133 38 Updated Mar 29, 2026

Automated GPU Kernel Generation via Co-Evolving Intrinsic World Model

Python 91 17 Updated Mar 2, 2026

Public repository for Agent Skills

Python 105,590 11,684 Updated Mar 25, 2026

An agentic skills framework & software development methodology that works.

Shell 122,435 9,960 Updated Mar 26, 2026
C++ 35 3 Updated Mar 5, 2026

Benchmark SGLang on SLURM

Python 23 37 Updated Mar 27, 2026

đź’« Toolkit to help you get started with Spec-Driven Development

Python 83,389 7,133 Updated Mar 27, 2026

Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.

Python 123 68 Updated Mar 27, 2026

Debug the intermediate outputs of two models.

HTML 3 Updated Aug 8, 2025

An Adaptive Pencil Decomposition Library for NVIDIA GPUs

C++ 82 13 Updated Mar 23, 2026

System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge

Go 3,558 592 Updated Mar 29, 2026

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 15,821 1,517 Updated Mar 4, 2026

CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning

Python 296 112 Updated Nov 3, 2025

A configuration framework that enhances Claude Code with specialized commands, cognitive personas, and development methodologies.

Python 22,024 1,863 Updated Mar 22, 2026

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen3.5, GPT-OSS, Llama, and more!

Python 9,079 775 Updated Mar 28, 2026

MOSS-TTSD is a spoken dialogue generation model designed for expressive multi-speaker synthesis. It features long-context modeling, flexible speaker control, and multilingual support, while enablin…

Python 1,225 117 Updated Mar 23, 2026

SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

Python 18,875 2,040 Updated Mar 24, 2026

A generative speech model for daily dialogue.

Python 39,002 4,234 Updated Jan 18, 2026

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 13,215 2,223 Updated Mar 29, 2026

Kernels, of the mega variety :)

Python 696 49 Updated Mar 29, 2026

FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae, Traycer AI…

133,622 33,685 Updated Mar 28, 2026

A prefill & decode disaggregated LLM serving framework with shared GPU memory and fine-grained compute isolation.

Python 125 15 Updated Dec 25, 2025

Distributed Compiler based on Triton for Parallel Systems

Python 1,398 135 Updated Mar 11, 2026

Invert scroll direction for physical scroll wheels while maintaining "Natural" scrolling for trackpads on MacOS

Swift 3,986 86 Updated Mar 29, 2026
Next