Skip to content
View njhill's full-sized avatar

Organizations

@netty @kserve @vllm-project @llm-d @Inferact

Block or report njhill

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Tools for Python coroutines and advanced scheduling for `asyncio`

Python 18 1 Updated Dec 29, 2025

TPU inference for vLLM, with unified JAX and PyTorch support.

Python 235 101 Updated Feb 16, 2026

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 2,494 319 Updated Feb 16, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 70,410 13,478 Updated Feb 16, 2026

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 156,540 32,090 Updated Feb 16, 2026

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Rust 10,475 1,037 Updated Feb 11, 2026

High-performance netty and thrift-based microservice RPC library for Java

Java 4 4 Updated Sep 17, 2025

Alternative etcd3 java client

Java 162 42 Updated Sep 17, 2025

Distributed Model Serving Framework

Java 185 79 Updated Sep 30, 2025

Controller for ModelMesh

Go 242 134 Updated Jun 10, 2025

Abstracted helper classes providing consistent key-value store functionality, with zookeeper and etcd3 implementations

Java 5 2 Updated Sep 17, 2025

Fake XRandR configurations for multi-head setups with crappy video drivers, like fakexinerama but with xrandr

Python 274 38 Updated Apr 29, 2024

Java utilities for working with CompletionStages

Java 60 13 Updated Jan 17, 2019
Java 3,804 582 Updated Feb 8, 2026

Netty project - an event-driven asynchronous network application framework

Java 34,803 16,285 Updated Feb 16, 2026

The Java gRPC implementation. HTTP/2 based RPC

Java 11,981 3,969 Updated Feb 12, 2026