Skip to content
View marwage's full-sized avatar

Block or report marwage

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An instrumentation tool to monitor queue depths in tokio channels

Rust 10 Updated Oct 29, 2025

DDGS | Dux Distributed Global Search. A metasearch library that aggregates results from diverse web search services

Python 1,916 184 Updated Nov 5, 2025

Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond

Python 606 52 Updated Nov 4, 2025

Cost-efficient and pluggable Infrastructure components for GenAI inference

Go 4,340 479 Updated Nov 4, 2025

[OSDI'24] Serving LLM-based Applications Efficiently with Semantic Variable

Python 190 9 Updated Sep 21, 2024

[ASPLOS'25] Towards End-to-End Optimization of LLM-based Applications with Ayo

Python 48 5 Updated Aug 5, 2025

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

TypeScript 41,431 2,710 Updated Nov 4, 2025

Tempo is a system for declarative, efficient, end-to-end compiled dynamic deep learning

Python 21 2 Updated Oct 21, 2025

Large Language Model (LLM) Systems Paper List

1,580 86 Updated Nov 4, 2025

Lightweight coding agent that runs in your terminal

Rust 49,834 6,152 Updated Nov 5, 2025

Analyze computation-communication overlap in V3/R1.

1,112 143 Updated Mar 21, 2025

Replace 'hub' with 'ingest' in any GitHub URL to get a prompt-friendly extract of a codebase

Python 13,015 976 Updated Nov 5, 2025

A resilient distributed training framework

Python 96 9 Updated Apr 11, 2024

Any model. Any hardware. Zero compromise. Built with @ziglang / @openxla / MLIR / @bazelbuild

Zig 2,847 111 Updated Nov 4, 2025

Watches files and records, or triggers actions, when they change.

C++ 13,336 1,039 Updated Nov 4, 2025

Dynamic resources changes for multi-dimensional parallelism training

Go 29 3 Updated Aug 22, 2025

Fully open reproduction of DeepSeek-R1

Python 25,613 2,401 Updated Sep 8, 2025

Golang bindings for Nvidia Datacenter GPU Manager (DCGM)

C 138 39 Updated Oct 24, 2025

NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs

C++ 609 73 Updated Oct 14, 2025

Recipes to scale inference-time compute of open models

Python 1,115 125 Updated May 22, 2025

Machine Learning Interviews from FAANG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc. Blog: mlengineer.io.

11,570 1,893 Updated Aug 31, 2023

Use your Neovim like using Cursor AI IDE!

Lua 16,366 746 Updated Nov 4, 2025

A low-latency & high-throughput serving engine for LLMs

Python 436 58 Updated Oct 16, 2025

g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains

Python 4,224 376 Updated Sep 11, 2025

Minimal, single page, smooth-scrolling theme for Hugo static site generator.

HTML 708 273 Updated Jan 30, 2025

Microsoft Azure Traces

Jupyter Notebook 1,019 167 Updated Oct 20, 2025

A bibliography and survey of the papers surrounding o1

TeX 1,208 51 Updated Nov 16, 2024

Official inference library for Mistral models

Jupyter Notebook 10,531 981 Updated Mar 20, 2025

📺 Discover the latest machine learning / AI courses on YouTube.

17,006 2,076 Updated Jan 22, 2024
Next