Skip to content
View njhill's full-sized avatar

Organizations

@netty @kserve @vllm-project @llm-d @Inferact

Block or report njhill

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Early-stage Rust drop-in alternative frontend for vLLM

Rust 69 9 Updated May 22, 2026

Tools for Python coroutines and advanced scheduling for `asyncio`

Python 19 1 Updated Dec 29, 2025

TPU inference for vLLM, with unified JAX and PyTorch support.

Python 351 213 Updated Jun 15, 2026

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 3,362 529 Updated Jun 15, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 82,944 18,086 Updated Jun 15, 2026

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 161,603 33,514 Updated Jun 15, 2026

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Rust 10,819 1,125 Updated Jun 15, 2026

High-performance netty and thrift-based microservice RPC library for Java

Java 4 4 Updated Sep 17, 2025

Alternative etcd3 java client

Java 163 42 Updated Sep 17, 2025

Distributed Model Serving Framework

Java 188 79 Updated Apr 14, 2026

Controller for ModelMesh

Go 243 135 Updated Apr 14, 2026

Abstracted helper classes providing consistent key-value store functionality, with zookeeper and etcd3 implementations

Java 6 3 Updated Sep 17, 2025

Fake XRandR configurations for multi-head setups with crappy video drivers, like fakexinerama but with xrandr

Python 274 38 Updated Apr 29, 2024

Java utilities for working with CompletionStages

Java 60 13 Updated Jan 17, 2019
Java 3,851 591 Updated Jun 15, 2026

Netty project - an event-driven asynchronous network application framework

Java 34,977 16,245 Updated Jun 15, 2026

The Java gRPC implementation. HTTP/2 based RPC

Java 12,031 3,990 Updated Jun 12, 2026