Skip to content
View tmm1's full-sized avatar

Sponsoring

@mattn
@jart
@mschoch
@joshdholtz
@BurntSushi
@matthuisman
@ziglang
@formkit

Highlights

  • Pro

Organizations

@rubinius @postrank-labs @graphite-project @fancybits

Block or report tmm1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Megakernel to match Apple Silicon Efficiency at 2x the Throughput on a RTX 3090

Cuda 156 14 Updated Apr 9, 2026

A subset of Go that translates to C

Go 484 11 Updated Apr 9, 2026
Rust 1,135 81 Updated Apr 9, 2026

Grapevine is a self-hostable realtime unified context system

Python 33 1 Updated Mar 3, 2026

⚡ Native MLX Swift LLM inference server for Apple Silicon. OpenAI-compatible API, SSD streaming for 100B+ MoE models, TurboQuant KV cache compression, + iOS iPhone app.

Swift 259 10 Updated Apr 10, 2026

A free and open immersive video player for the Apple Vision Pro.

Swift 126 18 Updated Feb 4, 2026

LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar

Python 9,298 785 Updated Apr 10, 2026

Native macOS Wayland Compositor written in Rust using Smithay. Experience seamless Linux app streaming on macOS without XQuartz.

Rust 801 8 Updated Mar 31, 2026
Go 166 7 Updated Mar 30, 2026

Zig INferenCe Engine — Local LLM inference on AMD GPUs and Apple Silicon

Zig 290 9 Updated Apr 9, 2026

The agent that grows with you

Python 45,476 5,877 Updated Apr 10, 2026

Use your cursor subscription in opencode

TypeScript 193 14 Updated Mar 22, 2026

Sub-millisecond VM sandboxes for AI agents via copy-on-write forking

Rust 2,117 93 Updated Mar 21, 2026

Official Codebase for "Neural Thickets: Diverse Task Experts Are Dense Around Pretrained Weights"

Python 529 54 Updated Apr 2, 2026

A lightweight cli for running single-purpose AI agents. Define focused agents in TOML, trigger them from anywhere; pipes, git hooks, cron, or the terminal.

Go 787 24 Updated Apr 6, 2026

Metal Flash Attention for MLX

Python 12 Updated Apr 6, 2026

Native Mac OS GUI for Using mlx-lm-lora.

Swift 59 3 Updated Dec 19, 2025

Run AI agents isolated in a macOS user account and sandbox-exec. Configured to run Claude Code, OpenAI Codex, Cursor Agent, Google Gemini.

Shell 216 11 Updated Apr 3, 2026

Go bindings to QuickJS

Go 3 Updated Dec 6, 2023

Go bindings to QuickJS

Go 168 21 Updated Apr 4, 2026

Nightshift uses your leftover Claude / Codex budget to surprise you with useful PRs. Love them or leave them.

Go 959 39 Updated Apr 9, 2026

Hypernetworks that update LLMs to remember factual information

Python 665 71 Updated Mar 2, 2026

The production engine for directional ablation. Unalign / remove models censorship efficiently on any hardware.

Python 24 2 Updated Mar 21, 2026

Flux 2 image generation model pure C inference

C 1,918 132 Updated Feb 13, 2026

PDF Parser for AI-ready data. Automate PDF accessibility. Open-source.

Java 13,916 1,162 Updated Apr 9, 2026

The inference engine the open-source world built for itself.

Python 145 1 Updated Mar 30, 2026

Apple HomeKit for ESPHome - Alpha WIP

C++ 280 37 Updated Feb 21, 2026
Go 13 2 Updated Apr 1, 2026
Next