Skip to content
View Hadigan's full-sized avatar

Highlights

  • Pro

Block or report Hadigan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[NeurIPS'25] KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systems

Python 108 11 Updated Nov 3, 2025

My learning notes for ML SYS.

Python 4,783 306 Updated Dec 22, 2025

CUDA Kernel Benchmarking Library

Cuda 786 97 Updated Dec 10, 2025
MLIR 4 2 Updated Nov 25, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,721 1,358 Updated Dec 17, 2025

Artifact for "Marconi: Prefix Caching for the Era of Hybrid LLMs" [MLSys '25 Outstanding Paper Award, Honorable Mention]

Python 47 3 Updated Mar 5, 2025

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 9,543 979 Updated Dec 13, 2025

很多镜像都在国外。比如 gcr 。国内下载很慢,需要加速。致力于提供连接全世界的稳定可靠安全的容器镜像服务。

Shell 12,664 1,406 Updated Dec 23, 2025

2025年12月更新,目前国内可用Docker镜像源汇总,DockerHub国内镜像加速列表,🚀DockerHub镜像加速器

6,921 329 Updated Dec 16, 2025

A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.

Python 2,166 244 Updated Dec 24, 2025

Supercharge Your LLM with the Fastest KV Cache Layer

Python 6,423 814 Updated Dec 24, 2025

LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale

Python 165 29 Updated Jul 18, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 66,084 12,160 Updated Dec 24, 2025

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,471 479 Updated Dec 24, 2025

NVIDIA Linux open GPU kernel module source

C 16,518 1,549 Updated Dec 18, 2025

Infiniband Verbs Performance Tests

C 889 366 Updated Dec 14, 2025

A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology

C++ 1,300 180 Updated Dec 17, 2025

Display and control your Android device

C 132,987 12,422 Updated Dec 22, 2025

Redpill Recovery (arpl-i18n)

Shell 7,730 1,235 Updated Dec 16, 2025

High performance self-hosted photo and video management solution.

TypeScript 87,275 4,601 Updated Dec 24, 2025
Python 21 11 Updated Jun 4, 2023

LiteIO is a cloud-native block device service that uses multiple storage engines, including SPDK and LVM, to achieve high performance. It is specifically designed for Kubernetes in a hyper-converge…

Go 318 54 Updated Feb 6, 2024

Development repository for Fetch Directed Instruction Prefetching (FDP) in gem5

C++ 29 5 Updated Dec 21, 2025

Ocolos is the first online code layout optimization system for unmodified applications written in unmanaged languages.

C++ 53 16 Updated Jun 23, 2025

The DaCapo benchmark suite

Java 192 66 Updated Aug 11, 2025

BTB-X HPCA23 code

C++ 13 2 Updated Jan 6, 2023

An artifact for Berti: an Accurate and Timely Local-Delta Data Prefetcher

C++ 37 24 Updated Nov 9, 2022

This repository is meant to be a guide for building your own prefetcher for CPU caches and evaluating it, using ChampSim simulator

44 13 Updated Feb 2, 2022

JHipster is a development platform to quickly generate, develop, & deploy modern web applications & microservice architectures.

TypeScript 22,286 4,135 Updated Dec 24, 2025
Next