Skip to content
View AlexCheema's full-sized avatar
  • University of Oxford

Sponsoring

@mudler

Block or report AlexCheema

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A repo of useful MLX skills.

Python 79 4 Updated Jan 25, 2026

Infer Ring is an iOS and macOS app that facilitates cross-device LLM inference using MLX

Swift 10 2 Updated Mar 22, 2026

mactop - Apple Silicon Monitor Top

Go 1,124 41 Updated Mar 23, 2026

Archive: Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

Python 20 2 Updated Dec 17, 2025

⚙️ A macOS DMG package builder with a native GUI and CLI support

Go 12 Updated Jun 14, 2025

Pretraining and inference code for a large-scale depth-recurrent language model

Python 868 78 Updated Dec 29, 2025

Artificial Neural Engine Machine Learning Library

Python 1,548 69 Updated Mar 10, 2026

The CLI for GPUs

Python 400 9 Updated Mar 21, 2026

Vane is an AI-powered answering engine.

TypeScript 33,428 3,624 Updated Mar 10, 2026

A systematic reasoning MCP server implementation for Claude Desktop with beam search and thought evaluation.

TypeScript 277 33 Updated Jan 8, 2025

EXO Gym is an open-source Python toolkit that facilitates distributed AI research.

Python 103 21 Updated Dec 1, 2025

Private Web Search for Local LLMs

Rust 174 29 Updated Feb 27, 2025

Inference Llama models in one file of pure C for Windows 98 running on 25-year-old hardware

C 341 37 Updated Dec 28, 2024

AI wearables. Put it on, speak, transcribe, automatically

Dart 7,867 1,437 Updated Mar 26, 2026

Sidecar is the AI brains for the Aide editor and works alongside it, locally on your machine

Rust 601 99 Updated May 14, 2025

The open-source AI-native IDE

TypeScript 2,200 345 Updated Feb 25, 2025

The Modular Platform (includes MAX & Mojo)

Mojo 25,807 2,784 Updated Mar 26, 2026

Official inference framework for 1-bit LLMs

Python 36,704 3,182 Updated Mar 10, 2026

MLX: An array framework for Apple silicon

C++ 24,795 1,605 Updated Mar 25, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 74,401 14,805 Updated Mar 26, 2026

convert images, video to ascii!

Zig 516 23 Updated Sep 2, 2025

The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text.

C++ 4,450 416 Updated Mar 26, 2026

Galadriel TEE oracle configuration and verification code [Deprecated]

Python 16 6 Updated Oct 10, 2024

Generate accurate transcripts using Apple's MLX framework

Python 454 46 Updated Apr 26, 2025

Distributed Inference for mlx LLm

Python 100 10 Updated Aug 1, 2024

Run frontier AI locally.

Python 43,014 2,969 Updated Mar 26, 2026

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 32,001 4,006 Updated Mar 26, 2026

Action to enable running Vulkan apps on GitHub runners

C++ 29 7 Updated Dec 3, 2025

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 15,253 1,287 Updated May 23, 2024

Text-To-Speech, RAG, and LLMs. All local!

JavaScript 1,899 110 Updated Dec 9, 2024
Next