Skip to content
View hexfusion's full-sized avatar
🐀
scampering
🐀
scampering

Block or report hexfusion

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Go 132 78 Updated Jun 18, 2026

A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM

Python 520 105 Updated Jun 18, 2026

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 3,394 531 Updated Jun 18, 2026

Standardized Distributed Generative and Predictive AI Inference Platform for Scalable, Multi-Framework Deployment on Kubernetes

Go 5,582 1,532 Updated Jun 17, 2026

Low-overhead Kubernetes informer for sidecar controllers that don't need the full object

Go 8 Updated Apr 27, 2026

llm-d benchmark scripts and tooling

Python 62 94 Updated Jun 18, 2026

Concurrent ART (adaptive radix tree)

Rust 182 20 Updated Sep 26, 2025

llm-d Router: The intelligent entry point for inference requests

Go 222 240 Updated Jun 17, 2026

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 59,803 10,320 Updated Nov 12, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 83,215 18,172 Updated Jun 18, 2026

System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge

Go 4,422 714 Updated Jun 18, 2026

A lightweight, configurable, and real-time simulator designed to mimic the behavior of vLLM without the need for GPUs or running actual heavy models.

Go 149 98 Updated Jun 17, 2026

Gateway API Inference Extension

Jupyter Notebook 694 292 Updated Jun 17, 2026
Python 3 Updated Apr 21, 2026

Pull-through caching proxy with resumable downloads for OCI images

Go 1 1 Updated Jan 18, 2026

Collective communications library with various primitives for multi-machine training.

C++ 1,430 359 Updated Jun 17, 2026

A collection of RPMs

HTML 1 2 Updated Jun 11, 2026

Red Hat Device Edge image construction

Shell 3 2 Updated Jan 16, 2026

a script to run docker-compose.yml using podman

Python 6,116 601 Updated Jun 14, 2026

QMK TrackBall with 3 Switches (TB3S)

C 158 8 Updated Nov 17, 2024
TypeScript 28 27 Updated Jun 15, 2026

GitHub Action self-hosted runner images for OpenShift.

Shell 50 52 Updated Aug 15, 2023

Low-level unprivileged sandboxing tool used by Flatpak and similar projects

C 7,648 349 Updated Jun 2, 2026

Podman: A tool for managing OCI containers and pods.

Go 32,049 3,144 Updated Jun 18, 2026
Go 659 182 Updated Jun 2, 2026

demos asible collection

Python 2 1 Updated May 29, 2025

Generic Control plane for creating Kubernetes like APIs

Go 37 7 Updated Apr 9, 2026

Random 3D models

Python 64 5 Updated Jun 8, 2026

FABoulous bootc build system

Python 2 3 Updated Aug 24, 2025
Next