Skip to content
View nerdalert's full-sized avatar
🐈
🦀 🐿
🐈
🦀 🐿

Organizations

@openshift @opendatahub-io @redhat-et @neuralmagic @nexodus-io @llm-d @praxis-proxy

Block or report nerdalert

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AI and cloud-native proxy server and framework

Rust 23 19 Updated May 15, 2026

MCP server for troubleshooting vLLM inference workloads on Red Hat OpenShift AI — queries Prometheus, Alertmanager, Loki, Grafana, and Kubernetes from AI assistants.

Python 4 2 Updated May 5, 2026

Model as a Service

Go 24 73 Updated May 17, 2026

llm-d helm charts and deployment examples

Go Template 57 56 Updated May 1, 2026

Demo integrating Kuadrant with llm-d

Go 4 Updated Jul 10, 2025

A simple GPU reservation tool for single host shared development systems

Go 26 7 Updated May 14, 2026

Extract SRT subtitles with timestamps from a video file with the Whisper voice model

Python 1 Updated May 28, 2025

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 3,199 477 Updated May 16, 2026

Helm charts for llm-d

Shell 52 57 Updated Jul 22, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 80,293 16,887 Updated May 18, 2026

Fun with benchmarks

Python 5 2 Updated Apr 23, 2025

UI Component for Chatbot

TypeScript 3 14 Updated Feb 4, 2025

Get your documents ready for gen AI

Python 59,895 4,151 Updated May 17, 2026

Running Docling as an API service

Python 1,524 302 Updated May 12, 2026

Place to hack on UI for InstructLab

TypeScript 37 57 Updated Feb 11, 2026

On-demand self-hosted AWS EC2 runner for GitHub Actions

JavaScript 849 387 Updated May 8, 2026

Interact with the Deep Search platform for new knowledge explorations and discoveries

Python 227 32 Updated Jan 24, 2025

Create fast graph language models from converted PDF documents for knowledge extraction and Q&A.

C++ 58 7 Updated Jan 27, 2025

InstructLab Community wide collaboration space including contributing, security, code of conduct, etc

Python 94 50 Updated Feb 11, 2026

InstructLab Core package. Use this to chat with a model and execute the InstructLab workflow to train a model using custom taxonomy data.

Python 1,416 455 Updated Mar 30, 2026

Taxonomy tree that will allow you to create models tuned with your data

Python 297 1,258 Updated Sep 8, 2025

GitHub bot to assist with the taxonomy contribution workflow

Go 17 18 Updated Nov 4, 2024
Swift 1 Updated Feb 21, 2024

alfred workflow jwt decoder

Python 6 Updated Mar 14, 2026

Mesh network using QUIC Connect-Ip Tunnels

Go 4 3 Updated Sep 26, 2023

Simple example of a Quic Client/Server

Go 1 1 Updated Jan 17, 2023
Swift 1 1 Updated Feb 21, 2024

LLMs for Project Nexodus documentation question-answering

Jupyter Notebook 3 Updated Aug 8, 2023

POC using OpenAI with chatbot docs

Python 2 1 Updated Jun 8, 2023
Next