Highlights
Lists (3)
Sort Name ascending (A-Z)
Stars
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architecture from Pixels
Run Claude Code 100% on-device with local AI on Apple Silicon. MLX-native Anthropic-API server, 65 tok/s Qwen 3.5 122B, Llama 3.3 70B, Gemma 4 31B. Private, offline, airgap-ready. Built for NDA / l…
Learn it. Build it. Ship it for others.
Learn it. Build it. Ship it for others.
VIP cheatsheet for Stanford's CME 295 Transformers and Large Language Models
Anant08 / stanford-cme-295-transformers-large-language-models
Forked from afshinea/stanford-cme-295-transformers-large-language-modelsVIP cheatsheet for Stanford's CME 295 Transformers and Large Language Models
[MLsys2026]: RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.
VS Code rebuilt on Tauri. Same architecture, 96% smaller. Early release.
Anant08 / claw-code
Forked from ultraworkers/claw-codeThe fastest repo in history to surpass 50K stars ⭐, reaching the milestone in just 2 hours after publication. Better Harness Tools, not merely storing the archive of leaked Claude Code but make rea…
The repo is finally unlocked. enjoy the party! The fastest repo in history to surpass 100K stars ⭐. Join Discord: https://discord.gg/5TUQKqFWd Built in Rust using oh-my-codex.
From a goal to a task DAG, automatically. TypeScript-native multi-agent orchestration with MCP and live tracing. Three runtime dependencies.
Claude code source code
Fast, accurate & comprehensive text measurement & layout
Anant08 / txtai
Forked from neuml/txtai💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows
💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows
Self-hosted AI accounting app. LLM analyzer for receipts, invoices, transactions with custom prompts and categories
Create and share 3D architectural projects.
Anant08 / flash-moe
Forked from danveloper/flash-moeRunning a big model on a small laptop
Running a big model on a small laptop
Fully local web research and report writing assistant
Zero-dependency GPT in pure Rust — a faithful port of Karpathy's microGPT.py that trains ~4,500x faster.
AirLLM 70B inference with single 4GB GPU
Smart Model Routing for Agents. Cut Costs up to 70% 🦚
Open-Source Machine Translation Quality Estimation in PyTorch
A high-performance, OpenTelemetry-compliant tracing SDK designed specifically for Large Language Model (LLM) applications and AI workloads.
Deploy any AI model, agent, database, RAG, and pipeline locally or remotely in minutes