Highlights
- Pro
Starred repositories
Your Personal AI Assistant; easy to install, deploy on your own machine or on the cloud; supports multiple chat apps with easily extensible capabilities.
Python bindings for access to the on-device model at the core of Apple Intelligence through the Foundation Models framework
The batteries-included agent harness.
Claude support for Apple Foundation Models
Open-source AI coworker, with memory
TurboQuant: Near-optimal KV cache quantization for LLM inference (3-bit keys, 2-bit values) with Triton kernels + vLLM integration
LMCache: Supercharge Your LLM with the Fastest KV Cache Layer
MegaDetector is an AI model that helps conservation folks spend less time doing boring things with camera trap images.
AI models trained by Google to classify species in images from motion-triggered wildlife cameras.
Gemma open-weight LLM library, from Google DeepMind
A complete AI agency at your fingertips - From frontend wizards to Reddit community ninjas, from whimsy injectors to reality checkers. Each agent is a specialized expert with personality, processes…
A tool for creating and running Linux containers using lightweight virtual machines on a Mac. It is written in Swift, and optimized for Apple silicon.
AI Edge Quantizer: flexible post training quantization for LiteRT models.
Agent skills for Qdrant vector search: scaling, performance optimization, search quality, monitoring, deployment, model migration, version upgrades, and SDK usage across Python, TypeScript, Rust, G…
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
ai-generated apps , full stack + generative UI
A set of beautifully-designed, accessible components and a code distribution platform. Works with your favorite frameworks. Open Source. Open Code.
Lemonade helps users discover and run local AI apps by serving optimized LLMs right from their own GPUs and NPUs. Join our discord: https://discord.gg/5xXzkMu8Zk
AtomicBot-ai / Atomic-Chat
Forked from janhq/janLocal AI app and inference engine for agents. Run open-weight LLMs locally — private, 100% offline on your computer.
A lightweight sandboxing tool for enforcing filesystem and network restrictions on arbitrary processes at the OS level, without requiring a container.
Persistent Context Across Sessions for Every Agent – Captures everything your agent does during sessions, compresses it with AI, and injects relevant context back into future sessions. Works with C…
A benchmark built to evaluate and improve agent capabilities for supporting legal work.
A library for PyTorch model compression and optimizations for deployment via Core AI on Apple silicon.