Skip to content
View AHEADer's full-sized avatar

Block or report AHEADer

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[Notice] The repo temporarily locked while ownership transfer. in the meantime we maintain on here: https://github.com/ultraworkers/claw-code-parity. The fastest repo in history to surpass 100K sta…

Rust 139,364 101,594 Updated Apr 2, 2026

AI agents running research on single-GPU nanochat training automatically

Python 63,885 9,014 Updated Mar 26, 2026

A PyTorch native library for training speculative decoding models

Python 63 9 Updated Apr 2, 2026

Give your agents the power of the Hugging Face ecosystem

Python 10,016 609 Updated Apr 1, 2026

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 9,787 1,025 Updated Mar 30, 2026

Engine-agnostic LLM gateway in Rust. Full OpenAI & Anthropic API compatibility across SGLang, vLLM, TRT-LLM, OpenAI, Gemini & more. Industry-first gRPC pipeline, KV cache-aware routing, chat histor…

Rust 137 39 Updated Apr 2, 2026

Automated GPU Kernel Generation via Co-Evolving Intrinsic World Model

Python 92 17 Updated Mar 2, 2026

Public repository for Agent Skills

Python 108,948 12,183 Updated Mar 25, 2026

An agentic skills framework & software development methodology that works.

Shell 131,440 10,881 Updated Apr 2, 2026
C++ 35 3 Updated Mar 5, 2026

Benchmark SGLang on SLURM

Python 24 39 Updated Apr 1, 2026

đź’« Toolkit to help you get started with Spec-Driven Development

Python 84,624 7,244 Updated Apr 2, 2026

Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.

Python 125 72 Updated Apr 2, 2026

Debug the intermediate outputs of two models.

HTML 3 Updated Aug 8, 2025

An Adaptive Pencil Decomposition Library for NVIDIA GPUs

C++ 83 13 Updated Mar 23, 2026

System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge

Go 3,580 593 Updated Apr 2, 2026

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 15,875 1,526 Updated Mar 4, 2026

CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning

Python 296 114 Updated Nov 3, 2025

A configuration framework that enhances Claude Code with specialized commands, cognitive personas, and development methodologies.

Python 22,108 1,863 Updated Mar 22, 2026

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen3.5, GPT-OSS, Llama, and more!

Python 9,119 783 Updated Apr 2, 2026

MOSS-TTSD is a spoken dialogue generation model designed for expressive multi-speaker synthesis. It features long-context modeling, flexible speaker control, and multilingual support, while enablin…

Python 1,229 118 Updated Mar 23, 2026

SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

Python 18,908 2,047 Updated Mar 30, 2026

A generative speech model for daily dialogue.

Python 39,015 4,233 Updated Jan 18, 2026

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 13,241 2,241 Updated Apr 2, 2026

Kernels, of the mega variety :)

Python 698 53 Updated Apr 1, 2026

FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae, Traycer AI…

134,093 33,802 Updated Mar 28, 2026

A prefill & decode disaggregated LLM serving framework with shared GPU memory and fine-grained compute isolation.

Python 125 15 Updated Dec 25, 2025

Distributed Compiler based on Triton for Parallel Systems

Python 1,400 136 Updated Mar 11, 2026
Next