Skip to content
View ai-hpc's full-sized avatar
🦙
Research Physical AI agent safety, performance, memory
🦙
Research Physical AI agent safety, performance, memory

Highlights

  • Pro

Organizations

@openclaw @GeniePod @FastCrest

Block or report ai-hpc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

compiler learning resources collect.

Python 2,749 370 Updated May 20, 2026
Python 265 32 Updated Jun 9, 2026

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 3,568 316 Updated Jul 17, 2025

Real-Time VLAs via Future-state-aware Asynchronous Inference.

Python 418 33 Updated Apr 22, 2026

DFlash: Block Diffusion for Flash Speculative Decoding

Python 5,193 373 Updated May 10, 2026

CODA: Rewriting Transformer Blocks as GEMM-Epilogue Programs

Python 216 22 Updated Jun 22, 2026

An Optimizer for Nvidia Compilers.

Python 100 6 Updated Jun 15, 2026

Daily ArXiv Papers.

Python 448 100 Updated Jun 21, 2026

A retargetable MLIR-based machine learning compiler and runtime toolkit.

C++ 3,818 933 Updated Jun 22, 2026

Development repository for the Triton language and compiler

MLIR 19,498 2,954 Updated Jun 22, 2026

TokenSpeed is a speed-of-light LLM inference engine.

Python 1,480 166 Updated Jun 22, 2026

FlyDSL is the Python front‑end of the project: Flexible LaYout DSL.

Python 205 67 Updated Jun 22, 2026

High-performance, light-weight C++ LLM and VLM Inference Software for Physical AI

Python 445 79 Updated Jun 3, 2026

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,749 201 Updated Jun 25, 2024

Open Machine Learning Compiler Framework

Python 13,486 3,899 Updated Jun 22, 2026

All information and news with respect to Falcon-H1 series

119 15 Updated Oct 9, 2025

Model compression toolkit engineered for enhanced usability, comprehensiveness, and efficiency.

Python 1,325 153 Updated Jun 22, 2026

Interactive 3D visualization of dense decoder-only LLM inference. Companion to the AI Inference Engineer 2026 course.

TypeScript 1 Updated Jun 15, 2026

Deep learning for dummies. All the practical details and useful utilities that go into working with real models.

Python 844 44 Updated Mar 15, 2026

Turn your PC, Mac, or Linux box into an AI server. LLM inference, chat UI, voice, agents, workflows, RAG, and image generation.

Shell 2,142 334 Updated Jun 22, 2026

Machine Learning Systems

Python 24,983 3,002 Updated Jun 22, 2026

Rust home automation runtime for Genie: local device graph, deterministic actuation safety, audit logs, and AI-native home-control APIs.

Rust 1 Updated Apr 26, 2026

Jetson Orin-tuned LLM inference runtime for gemma 4, qwen 3.5 — memory-first, power-aware, zero-allocation. C++17 + CUDA.

Cuda 1 2 Updated Jun 22, 2026

🧃 Token weight loss. Lean output compaction for terminal-heavy agent workflows. Works as a native CLI tool or as an extension to popular coding and agent frameworks.

TypeScript 460 48 Updated Jun 18, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 379,951 79,547 Updated Jun 22, 2026

Hosted Solution (Jetson Linux) with ESP32 (Wi-Fi + BT + BLE)

C 1 Updated Apr 20, 2026

GeniePod Home V1 hardware: MVP testing build, wiring, BOM, and planned interface-board/enclosure docs.

2 Updated May 21, 2026

🦞 Low-latency, limited-context AI harness for private on-device homes.

Rust 50 40 Updated Jun 22, 2026

Install and run NemoClaw on NVIDIA Jetson Orin with a patched OpenShell cluster image and streamlined onboarding.

Shell 26 9 Updated Apr 17, 2026

High-performance C++/CUDA GPU-accelerated STARK prover for Triton VM

Rust 1 Updated Jan 28, 2026
Next