Skip to content
View a1trl9's full-sized avatar
🦊
🦊

Block or report a1trl9

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

TokenSpeed is a speed-of-light LLM inference engine.

Python 1,447 158 Updated Jun 17, 2026

Vera: a programming language designed for LLMs to write

Python 382 20 Updated Jun 16, 2026

Offline optimization of your disaggregated Dynamo graph

Python 336 127 Updated Jun 17, 2026

A PyTorch-native inference engine with cache, parallelism, quantization and cpu offload for DiTs.

Python 1,203 75 Updated Jun 16, 2026

Your Neovim AI sidekick

Lua 2,657 135 Updated Apr 22, 2026

Visualize CPython's specializing, adaptive interpreter. 🔥

Python 674 13 Updated May 19, 2024

TORCH_TRACE parser for PT2

Rust 86 28 Updated May 11, 2026

Our first fully AI generated deep learning system

Python 628 48 Updated Feb 2, 2026

The Modular Platform (includes MAX & Mojo)

Mojo 26,342 2,840 Updated Jun 16, 2026

FlagTree is a unified compiler supporting multiple AI chip backends for custom Deep Learning operations, which is forked from triton-lang/triton.

C++ 287 81 Updated Jun 17, 2026

JAX backend for SGL

Python 280 105 Updated Jun 17, 2026

Autonomous GPU Kernel Generation & Optimization via Deep Agents

Python 453 75 Updated Jun 6, 2026

A Lightweight LLM Post-Training Library

Python 2,346 310 Updated Jun 17, 2026

magic-trace collects and displays high-resolution traces of what a process is doing

OCaml 6,102 194 Updated Jun 15, 2026

🤖 MLE-Agent: Your intelligent companion for seamless AI engineering and research. 🔍 Integrate with arxiv and paper with code to provide better code/research plans 🧰 OpenAI, Anthropic, Gemini, Ollam…

Python 1,562 106 Updated Jul 27, 2025

LMCache: Supercharge Your LLM with the Fastest KV Cache Layer

Python 9,207 1,337 Updated Jun 17, 2026

NVIDIA Inference Xfer Library (NIXL)

C++ 1,086 355 Updated Jun 17, 2026

A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.

Python 885 152 Updated Jun 17, 2026

Wave: Python Domain-Specific Language for High Performance Machine Learning

Python 58 32 Updated Jun 8, 2026

jax-triton contains integrations between JAX and OpenAI Triton

Python 462 57 Updated Jun 1, 2026

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

Cuda 2,315 221 Updated Jun 16, 2026

Development repository for the Triton-Linalg conversion

C++ 221 31 Updated Feb 7, 2025

Efficient Triton Kernels for LLM Training

Python 6,441 541 Updated Jun 16, 2026

FlagGems is an operator library for large language models implemented in the Triton Language.

Python 1,029 414 Updated Jun 17, 2026

Maple Mono: Open source monospace font with round corner, ligatures and Nerd-Font icons for IDE and terminal, fine-grained customization options. 带连字和控制台图标的圆角等宽字体,中英文宽度完美2:1,细粒度的自定义选项

Python 26,598 1,083 Updated Jun 12, 2026

QwQ is the reasoning model series developed by Qwen team, Alibaba Cloud.

Python 520 26 Updated Mar 27, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 83,109 18,131 Updated Jun 17, 2026

IREE's PyTorch Frontend, based on Torch Dynamo.

Python 110 82 Updated Jun 8, 2026
Next