Skip to content
View ai-hpc's full-sized avatar
πŸ¦™
Research Physical AI agent safety, performance, memory
πŸ¦™
Research Physical AI agent safety, performance, memory

Highlights

  • Pro

Organizations

@openclaw @GeniePod @FastCrest

Block or report ai-hpc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A retargetable MLIR-based machine learning compiler and runtime toolkit.

C++ 3,804 927 Updated Jun 14, 2026

Development repository for the Triton language and compiler

MLIR 19,437 2,937 Updated Jun 14, 2026

TokenSpeed is a speed-of-light LLM inference engine.

Python 1,427 156 Updated Jun 14, 2026

FlyDSL is the Python front‑end of the project: Flexible LaYout DSL.

Python 202 63 Updated Jun 14, 2026

High-performance, light-weight C++ LLM and VLM Inference Software for Physical AI

Python 436 78 Updated Jun 3, 2026

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,751 201 Updated Jun 25, 2024

Open Machine Learning Compiler Framework

Python 13,465 3,891 Updated Jun 14, 2026

All information and news with respect to Falcon-H1 series

119 15 Updated Oct 9, 2025

Model compression toolkit engineered for enhanced usability, comprehensiveness, and efficiency.

Python 1,305 150 Updated Jun 12, 2026

Interactive 3D visualization of dense decoder-only LLM inference. Companion to the AI Inference Engineer 2026 course.

TypeScript 1 Updated Jun 2, 2026

Deep learning for dummies. All the practical details and useful utilities that go into working with real models.

Python 844 44 Updated Mar 15, 2026

Turn your PC, Mac, or Linux box into an AI server. LLM inference, chat UI, voice, agents, workflows, RAG, and image generation.

Shell 1,992 300 Updated Jun 14, 2026

Machine Learning Systems

Python 24,877 2,989 Updated Jun 14, 2026

Rust home automation runtime for Genie: local device graph, deterministic actuation safety, audit logs, and AI-native home-control APIs.

Rust 1 Updated Apr 26, 2026

Jetson Orin-tuned LLM inference runtime for GenieClaw β€” memory-first, power-aware, zero-allocation. C++17 + CUDA.

Cuda 1 2 Updated Jun 6, 2026

πŸ§ƒ Token weight loss. Lean output compaction for terminal-heavy agent workflows. Works as a native CLI tool or as an extension to popular coding and agent frameworks.

TypeScript 440 44 Updated Jun 12, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 378,643 79,182 Updated Jun 14, 2026

Hosted Solution (Jetson Linux) with ESP32 (Wi-Fi + BT + BLE)

C 1 Updated Apr 20, 2026

GeniePod Home V1 hardware: MVP testing build, wiring, BOM, and planned interface-board/enclosure docs.

2 Updated May 21, 2026

🦞 Low-latency, limited-context AI harness for private on-device homes.

Rust 46 40 Updated Jun 12, 2026

Install and run NemoClaw on NVIDIA Jetson Orin with a patched OpenShell cluster image and streamlined onboarding.

Shell 26 9 Updated Apr 17, 2026

High-performance C++/CUDA GPU-accelerated STARK prover for Triton VM

Rust 1 Updated Jan 28, 2026

Master AI inference, AI agent harness systems, and hardware engineering β€” then design a physical AI chip. That is the goal.

HTML 191 33 Updated Jun 13, 2026

openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 300+ supported cars.

Python 61,358 10,978 Updated Jun 14, 2026

High-performance C++/CUDA GPU-accelerated XNT Miner

C++ 2 Updated Feb 13, 2026

High-performance C++/CUDA GPU-accelerated STARK prover for Triton VM

Rust 4 1 Updated Jan 28, 2026
Rust 2 1 Updated Jun 13, 2026
Rust 12 8 Updated Jun 12, 2026

anonymous peer-to-peer cash

Rust 107 45 Updated Jun 12, 2026
Next