Skip to content
View mhomidi's full-sized avatar
💔
💔
  • Vancouver, Canada
  • 06:27 (UTC -07:00)

Block or report mhomidi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Offline optimization of your disaggregated Dynamo graph

Python 255 96 Updated Apr 10, 2026

Buddy-alloc is a memory allocator for no-std Rust, used for embedded environments.

Rust 32 6 Updated Nov 8, 2024

Juice Community Version Public Release

Go 638 62 Updated May 8, 2025

Community maintained hardware plugin for vLLM on Ascend

Python 1,904 1,047 Updated Apr 11, 2026

Tensor Fusion is a state-of-the-art GPU virtualization and pooling solution designed to optimize GPU cluster utilization to its fullest potential.

Go 139 30 Updated Apr 12, 2026

Rust library for concurrent data access, using memory-mapped files, zero-copy deserialization, and wait-free synchronization.

Rust 620 45 Updated Apr 9, 2026

Enforce the output format (JSON Schema, Regex etc) of a language model

Python 2,003 89 Updated Apr 4, 2026

A Virtual Machine Monitor for modern Cloud workloads. Features include CPU, memory and device hotplug, support for running Windows and Linux guests, device offload with vhost-user and a minimal com…

Rust 5,488 621 Updated Apr 12, 2026

Spin-based synchronization primitives

Rust 588 107 Updated Mar 24, 2026

A highly configurable logging framework for Rust

Rust 1,129 165 Updated Nov 16, 2025

LLM inference in C/C++

C++ 103,253 16,737 Updated Apr 12, 2026

Official inference framework for 1-bit LLMs

Python 38,146 3,413 Updated Mar 10, 2026

Home of OpenVMM and OpenHCL

Rust 1,825 183 Updated Apr 11, 2026

Simple, safe way to store and distribute tensors

Python 3,701 307 Updated Apr 2, 2026

Serverless LLM Serving for Everyone.

Python 671 70 Updated Mar 6, 2026

OpenVINO™ is an open source toolkit for optimizing and deploying AI inference

C++ 10,065 3,175 Updated Apr 12, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 25,684 5,297 Updated Apr 12, 2026

QJL: 1-Bit Quantized JL transform for KV Cache Quantization with Zero Overhead

Python 91 16 Updated Jan 27, 2025

Abseil Common Libraries (C++)

C++ 17,189 3,001 Updated Apr 10, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 76,229 15,473 Updated Apr 12, 2026

A secure container runtime with CRI/OCI interface

Rust 362 54 Updated Jan 8, 2026

The Go programming language

Go 133,425 18,914 Updated Apr 11, 2026

Application Kernel for Containers

Go 18,077 1,565 Updated Apr 11, 2026

An open and reliable container runtime

Go 20,570 3,874 Updated Apr 11, 2026

Writing an OS in Rust

HTML 17,398 1,204 Updated Apr 9, 2026

A book-in-progress about the Linux kernel and its insides.

Python 32,469 3,520 Updated Apr 12, 2026

A hyperparameter optimization framework

Python 13,922 1,302 Updated Apr 8, 2026

Linux kernel source tree

C 228,185 61,545 Updated Apr 12, 2026

`std::execution`, the proposed C++ framework for asynchronous and parallel programming.

C++ 2,298 236 Updated Apr 12, 2026

Efficient RPCs for datacenter networks

C++ 901 152 Updated May 9, 2024
Next