Skip to content
View nerdalert's full-sized avatar
🐈
🦀 🐿
🐈
🦀 🐿

Sponsoring

@Homebrew

Organizations

@openshift @opendatahub-io @redhat-et @nexodus-io @llm-d @praxis-proxy

Block or report nerdalert

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Rust 2 Updated Jun 8, 2026

Hardware testing for the software world. Real or virtual, local or remote, human, automated or agentic.

Python 184 31 Updated Jun 13, 2026

AI and cloud-native proxy server and framework

Rust 44 33 Updated Jun 13, 2026

MCP server for troubleshooting vLLM inference workloads on Red Hat OpenShift AI — queries Prometheus, Alertmanager, Loki, Grafana, and Kubernetes from AI assistants.

Python 4 2 Updated May 19, 2026

Model as a Service

Go 28 77 Updated Jun 12, 2026

llm-d helm charts and deployment examples

Go Template 58 57 Updated May 1, 2026

Demo integrating Kuadrant with llm-d

Go 4 Updated Jul 10, 2025

A simple GPU reservation tool for single host shared development systems

Go 27 7 Updated Jun 8, 2026

Extract SRT subtitles with timestamps from a video file with the Whisper voice model

Python 1 Updated May 28, 2025

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 3,350 528 Updated Jun 13, 2026

Helm charts for llm-d

Shell 52 56 Updated Jul 22, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 82,790 18,035 Updated Jun 14, 2026

Fun with benchmarks

Python 5 2 Updated Apr 23, 2025

UI Component for Chatbot

TypeScript 3 14 Updated Feb 4, 2025

Get your documents ready for gen AI

Python 61,506 4,301 Updated Jun 13, 2026

Running Docling as an API service

Python 1,601 309 Updated Jun 12, 2026

Place to hack on UI for InstructLab

TypeScript 38 57 Updated Feb 11, 2026

On-demand self-hosted AWS EC2 runner for GitHub Actions

JavaScript 850 386 Updated May 8, 2026

Interact with the Deep Search platform for new knowledge explorations and discoveries

Python 226 32 Updated Jan 24, 2025

Create fast graph language models from converted PDF documents for knowledge extraction and Q&A.

C++ 59 7 Updated Jan 27, 2025

InstructLab Community wide collaboration space including contributing, security, code of conduct, etc

Python 94 50 Updated Feb 11, 2026

InstructLab Core package. Use this to chat with a model and execute the InstructLab workflow to train a model using custom taxonomy data.

Python 1,417 455 Updated Mar 30, 2026

Taxonomy tree that will allow you to create models tuned with your data

Python 298 1,254 Updated Sep 8, 2025

GitHub bot to assist with the taxonomy contribution workflow

Go 17 18 Updated Nov 4, 2024
Swift 1 Updated Feb 21, 2024

alfred workflow jwt decoder

Python 6 Updated Mar 14, 2026

Mesh network using QUIC Connect-Ip Tunnels

Go 4 3 Updated Sep 26, 2023

Simple example of a Quic Client/Server

Go 1 1 Updated Jan 17, 2023
Swift 1 1 Updated Feb 21, 2024
Next