Stars
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
Multi-agent OpenCode plugin for automated academic illustration generation
CLI/GUI for managing the battery charging status for Apple silicon (M1, M2, M3) Macs
Comprehensive open-source library of AI research and engineering skills for any AI model. Package the skills and your claude code/codex/gemini agent will be an AI research agent with full horsepowe…
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
2026 AI/ML internship & new graduate job list updated daily
A high-throughput and memory-efficient inference and serving engine for LLMs
Achieve state of the art inference performance with modern accelerators on Kubernetes
An instrumentation tool to monitor queue depths in tokio channels
A metasearch library that aggregates results from diverse web search services
Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond
Cost-efficient and pluggable Infrastructure components for GenAI inference
[OSDI'24] Serving LLM-based Applications Efficiently with Semantic Variable
[ASPLOS'25] Towards End-to-End Optimization of LLM-based Applications with Ayo
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
Tempo is a system for declarative, efficient, end-to-end compiled dynamic deep learning
Large Language Model (LLM) Systems Paper List
Lightweight coding agent that runs in your terminal
Analyze computation-communication overlap in V3/R1.
Replace 'hub' with 'ingest' in any GitHub URL to get a prompt-friendly extract of a codebase
Any model. Any hardware. Zero compromise. Built with @ziglang / @openxla / MLIR / @bazelbuild
Watches files and records, or triggers actions, when they change.
Dynamic resources changes for multi-dimensional parallelism training
Fully open reproduction of DeepSeek-R1
Golang bindings for Nvidia Datacenter GPU Manager (DCGM)
NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs