Skip to content
View marwage's full-sized avatar

Block or report marwage

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
127 results for source starred repositories
Clear filter

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 166,294 26,314 Updated Feb 5, 2026

2026 AI/ML internship & new graduate job list updated daily

4,650 187 Updated Feb 5, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 69,567 13,207 Updated Feb 5, 2026

Deploy the SC2 system on Kubernetes.

Python 10 5 Updated May 7, 2025

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 2,450 307 Updated Feb 5, 2026

An instrumentation tool to monitor queue depths in tokio channels

Rust 11 Updated Oct 29, 2025

DDGS | Dux Distributed Global Search. A metasearch library that aggregates results from diverse web search services

Python 2,143 212 Updated Dec 19, 2025

Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond

Python 772 85 Updated Jan 10, 2026

Cost-efficient and pluggable Infrastructure components for GenAI inference

Go 4,601 525 Updated Feb 5, 2026

[OSDI'24] Serving LLM-based Applications Efficiently with Semantic Variable

Python 209 11 Updated Sep 21, 2024

[ASPLOS'25] Towards End-to-End Optimization of LLM-based Applications with Ayo

Python 63 8 Updated Aug 5, 2025

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Shell 64,291 4,885 Updated Feb 4, 2026

Tempo is a system for declarative, efficient, end-to-end compiled dynamic deep learning

Python 28 3 Updated Oct 21, 2025

Large Language Model (LLM) Systems Paper List

1,802 95 Updated Jan 30, 2026

Lightweight coding agent that runs in your terminal

Rust 59,050 7,703 Updated Feb 5, 2026

Analyze computation-communication overlap in V3/R1.

1,141 146 Updated Mar 21, 2025

Replace 'hub' with 'ingest' in any GitHub URL to get a prompt-friendly extract of a codebase

Python 13,859 1,037 Updated Feb 5, 2026

A resilient distributed training framework

Python 96 9 Updated Apr 11, 2024

Any model. Any hardware. Zero compromise. Built with @ziglang / @openxla / MLIR / @bazelbuild

Zig 3,129 116 Updated Feb 5, 2026

Watches files and records, or triggers actions, when they change.

C++ 13,491 1,050 Updated Feb 5, 2026

Dynamic resources changes for multi-dimensional parallelism training

Go 30 4 Updated Aug 22, 2025

Fully open reproduction of DeepSeek-R1

Python 25,857 2,413 Updated Nov 24, 2025

Golang bindings for Nvidia Datacenter GPU Manager (DCGM)

C 147 42 Updated Feb 2, 2026

NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs

C++ 658 82 Updated Dec 4, 2025

Recipes to scale inference-time compute of open models

Python 1,124 130 Updated May 22, 2025

Machine Learning Interviews from FAANG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc. Blog: mlengineer.io.

12,365 1,998 Updated Aug 31, 2023

Use your Neovim like using Cursor AI IDE!

Lua 17,308 794 Updated Feb 3, 2026

A low-latency & high-throughput serving engine for LLMs

Python 471 61 Updated Jan 8, 2026

g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains

Python 4,213 367 Updated Dec 30, 2025
Next