Skip to content
View arhsis's full-sized avatar
  • 02:53 (UTC -12:00)

Highlights

  • Pro

Block or report arhsis

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[HPCA'26] Towards Resource-Efficient Serverless LLM Inference with SLINFER

Python 2 2 Updated Mar 23, 2026

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 3,242 289 Updated Jun 18, 2026

Foundry materializes CUDA graphs along with its execution context to disk to support fast cold start of serving engines.

C++ 36 4 Updated Jun 15, 2026

A framework for generating realistic LLM serving workloads

Python 153 14 Updated May 11, 2026

Winner 🏆 (Agent-only) MLSys 2026 - FlashInfer AI Kernel Generation Contest for the DeepSeek Sparse Attention (DSA) track with an average speedup of 34.93x

Python 135 10 Updated Jun 10, 2026

Step-by-step GEMM optimization, one hardware feature at a time. High performance CuTeDSL kernels for H100, B200 and RTX 50s GPUs. This repo also includes my CuTeDSL solution to MLSys26 kernel compe…

Python 11 4 Updated May 14, 2026

Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.

Python 67,981 5,728 Updated Jun 18, 2026

tmux source code

C 46,680 2,696 Updated Jun 18, 2026

Book in preparation: introduction to theoretical computer science

TeX 1,038 204 Updated Mar 18, 2024

Lightweight agent multiplexer, all in one Web dashboard

Python 32 3 Updated Jun 2, 2026

DeepSeek-native AI coding agent for your terminal. Engineered around prefix-cache stability — leave it running.

Go 23,085 1,384 Updated Jun 18, 2026

OpenURMA: A Clean-Room Open Implementation of the Unified Bus Protocol

C++ 25 Updated Jun 12, 2026

Google's open source distributed agent runtime

Go 1,712 93 Updated Jun 18, 2026

AI agent toolkit: unified LLM API, agent loop, TUI, coding agent CLI

TypeScript 63,804 7,752 Updated Jun 18, 2026

From Automated Idea Factory to Realization

Shell 1,145 94 Updated Jun 13, 2026

Apache OpenDAL: One Layer, All Storage.

Rust 5,173 771 Updated Jun 18, 2026

AI-powered animated comic generator — transform scripts into fully animated videos with AI-driven character design, storyboarding, and video synthesis.

TypeScript 1,567 271 Updated Apr 27, 2026

Academic Research Skills for Claude Code: research → write → review → revise → finalize

Python 32,671 2,679 Updated Jun 18, 2026

RISC-V IOMMU Specification

C 164 33 Updated Jun 8, 2026

Lightweight coding agent that runs in your terminal

Rust 91,974 13,594 Updated Jun 18, 2026

Skills for Real Engineers. Straight from my .claude directory.

Shell 134,924 11,686 Updated Jun 18, 2026

A Rust crate for cooking up terminal user interfaces (TUIs) 👨‍🍳🐀 https://ratatui.rs

Rust 21,120 691 Updated Jun 18, 2026

An open-source CLI to manage your DJI Osmo device via BLE and without DJI MIMO

Go 21 4 Updated Jan 11, 2026

Engine-Agnostic Model Hot-Swapping for Cost-Effective LLM Inference

Go 11 1 Updated Nov 13, 2025

code repo for GCR [FAST'26]

C++ 15 3 Updated Mar 3, 2026

Aggregated File System (AGFS), a modern tribute to the spirit of Plan 9

Go 398 37 Updated Jun 18, 2026
Python 8 Updated Apr 10, 2026
Python 18 9 Updated May 28, 2024

Manage filesystem snapshots and allow undo of system modifications

C++ 1,128 152 Updated Jun 18, 2026
Next