ai-hpc

🦙

Research Physical AI agent safety, performance, memory

NVIDIAN ai-hpc

🦙

Research Physical AI agent safety, performance, memory

I am gpu computing expert who always challenging to build better solutions. Love to find solutions with limited resources.

1.6k followers · 64 following

Achievements

x2 x3 x3

Achievements

x2 x3 x3

Highlights

Organizations

Stars

BBuf / tvm_mlir_learn

compiler learning resources collect.

Python 2,749 370 Updated May 20, 2026

mit-han-lab / KernelWiki

Python 265 32 Updated Jun 9, 2026

mit-han-lab / llm-awq

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 3,568 316 Updated Jul 17, 2025

mit-han-lab / vlash

Real-Time VLAs via Future-state-aware Asynchronous Inference.

Python 418 33 Updated Apr 22, 2026

z-lab / dflash

DFlash: Block Diffusion for Flash Speculative Decoding

Python 5,193 373 Updated May 10, 2026

open-lm-engine / coda-kernels

CODA: Rewriting Transformer Blocks as GEMM-Epilogue Programs

Python 216 22 Updated Jun 22, 2026

NVIDIA / CompileIQ

An Optimizer for Nvidia Compilers.

Python 100 6 Updated Jun 15, 2026

zezhishao / DailyArXiv

Daily ArXiv Papers.

Python 448 100 Updated Jun 21, 2026

iree-org / iree

A retargetable MLIR-based machine learning compiler and runtime toolkit.

C++ 3,818 933 Updated Jun 22, 2026

triton-lang / triton

Development repository for the Triton language and compiler

MLIR 19,498 2,954 Updated Jun 22, 2026

lightseekorg / tokenspeed

TokenSpeed is a speed-of-light LLM inference engine.

Python 1,480 166 Updated Jun 22, 2026

ROCm / FlyDSL

FlyDSL is the Python front‑end of the project: Flexible LaYout DSL.

Python 205 67 Updated Jun 22, 2026

NVIDIA / TensorRT-Edge-LLM

High-performance, light-weight C++ LLM and VLM Inference Software for Physical AI

Python 445 79 Updated Jun 3, 2026

FasterDecoding / Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,749 201 Updated Jun 25, 2024

apache / tvm

Open Machine Learning Compiler Framework

Python 13,486 3,899 Updated Jun 22, 2026

tiiuae / Falcon-H1

All information and news with respect to Falcon-H1 series

119 15 Updated Oct 9, 2025

Tencent / AngelSlim

Model compression toolkit engineered for enhanced usability, comprehensiveness, and efficiency.

Python 1,325 153 Updated Jun 22, 2026

ai-hpc / llm-inference-viz

Interactive 3D visualization of dense decoder-only LLM inference. Companion to the AI Inference Engineer 2026 course.

TypeScript 1 Updated Jun 15, 2026

EleutherAI / cookbook

Deep learning for dummies. All the practical details and useful utilities that go into working with real models.

Python 844 44 Updated Mar 15, 2026

Light-Heart-Labs / DreamServer

Turn your PC, Mac, or Linux box into an AI server. LLM inference, chat UI, voice, agents, workflows, RAG, and image generation.

Shell 2,142 334 Updated Jun 22, 2026

harvard-edge / cs249r_book

Machine Learning Systems

Python 24,983 3,002 Updated Jun 22, 2026

GeniePod / genie-home-runtime

Rust home automation runtime for Genie: local device graph, deterministic actuation safety, audit logs, and AI-native home-control APIs.

Rust 1 Updated Apr 26, 2026

GeniePod / genie-ai-runtime

Jetson Orin-tuned LLM inference runtime for gemma 4, qwen 3.5 — memory-first, power-aware, zero-allocation. C++17 + CUDA.

Cuda 1 2 Updated Jun 22, 2026

vincentkoc / tokenjuice

🧃 Token weight loss. Lean output compaction for terminal-heavy agent workflows. Works as a native CLI tool or as an extension to popular coding and agent frameworks.

TypeScript 460 48 Updated Jun 18, 2026

openclaw / openclaw

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 379,951 79,547 Updated Jun 22, 2026

ai-hpc / jetson-esp-hosted

Forked from espressif/esp-hosted

Hosted Solution (Jetson Linux) with ESP32 (Wi-Fi + BT + BLE)

C 1 Updated Apr 20, 2026

Rust 1 Updated Jan 28, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NVIDIAN ai-hpc

Achievements

Achievements

Highlights

Organizations

Block or report ai-hpc

Stars

BBuf / tvm_mlir_learn

mit-han-lab / KernelWiki

mit-han-lab / llm-awq

mit-han-lab / vlash

z-lab / dflash

open-lm-engine / coda-kernels

NVIDIA / CompileIQ

zezhishao / DailyArXiv

iree-org / iree

triton-lang / triton

lightseekorg / tokenspeed

ROCm / FlyDSL

NVIDIA / TensorRT-Edge-LLM

FasterDecoding / Medusa

apache / tvm

tiiuae / Falcon-H1

Tencent / AngelSlim

ai-hpc / llm-inference-viz

EleutherAI / cookbook

Light-Heart-Labs / DreamServer

harvard-edge / cs249r_book

GeniePod / genie-home-runtime

GeniePod / genie-ai-runtime

vincentkoc / tokenjuice

openclaw / openclaw

ai-hpc / jetson-esp-hosted

GeniePod / genie-hardware

GeniePod / genie-claw

jetsonhacks / NemoClaw-Orin

ai-hpc / triton-vm-prover