Skip to content
View poussa's full-sized avatar

Organizations

@opea-project

Block or report poussa

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Intel GPU Base operator allows automatic deployment of GPU related components to enable use of Intel GPU hardware within the Kubernetes cluster.

Go 1 1 Updated Jun 18, 2026

Intel Network Operator allows automatic configuring and easier use of RDMA NICs with Intel AI accelerators in Kubernetes.

Go 6 4 Updated May 29, 2026

Community maintained hardware plugin for vLLM on Intel Gaudi

Python 42 139 Updated Jun 18, 2026

helm charts for deploying models with llm-d

Go Template 31 63 Updated Jun 11, 2026

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 3,403 533 Updated Jun 19, 2026

llm-d benchmark scripts and tooling

Python 62 94 Updated Jun 18, 2026

Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs

Python 1,284 171 Updated Jun 18, 2026

llm-d helm charts and deployment examples

Go Template 58 57 Updated May 1, 2026

Gateway API Inference Extension

Jupyter Notebook 694 293 Updated Jun 17, 2026

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime

Python 2,662 309 Updated Jun 18, 2026

AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-text.

Go 1,210 125 Updated Jun 19, 2026

Intel® AI for Enterprise RAG converts enterprise data into actionable insights with excellent TCO. Utilizing Intel Gaudi AI accelerators and Intel Xeon processors ensuring streamlined deployment.

Python 61 28 Updated Jun 18, 2026

Workload Services Framework (WSF) is a benchmarking framework on Intel(R) Xeon(R) Platforms

Shell 59 55 Updated Jun 17, 2026

Terraform provider for Keycloak

Go 932 427 Updated Jun 18, 2026

GenAI Studio is a low code platform to enable users to construct, evaluate, and benchmark GenAI applications. The platform also provide capability to export developed application as a ready-to-depl…

JavaScript 63 27 Updated Jun 4, 2026

Containerization and cloud native suite for OPEA

Go 73 98 Updated Apr 6, 2026

A repository that deploys Coder OSS entirely from TF

HCL 179 52 Updated Feb 1, 2023

AWS EKS - kubernetes project and terraform module

HCL 337 170 Updated Feb 27, 2026

Karpenter is a Kubernetes Node Autoscaler built for flexibility, performance, and simplicity.

Go 1,957 500 Updated Jun 18, 2026

Collection of Intel device plugins for Kubernetes

Go 138 218 Updated Jun 17, 2026

EKS Node Viewer

Go 1,632 147 Updated Jun 15, 2026

A collection of community maintained NRI plugins

Go 108 41 Updated Jun 18, 2026

An End-to-End Distributed and Scalable Cloud KMS (Key Management System) built on top of Intel SGX enclave-based HSM (Hardware Security Module), aka eHSM.

C++ 168 54 Updated Jul 25, 2024
Go Template 24 33 Updated Jun 18, 2026

Node Resource Interface

Go 390 96 Updated Jun 5, 2026

Production-Grade Container Scheduling and Management

Go 123,114 43,242 Updated Jun 19, 2026

This repo follows the SDS extension standard of Envoy and implements an external SDS server via more secure solution which is known as Hardware Security Module(HSM). By using this repo, User can m…

Go 6 6 Updated Apr 2, 2024

Intel QuickAssist Technology( QAT) OpenSSL Engine (an OpenSSL Plug-In Engine) which provides cryptographic acceleration for both hardware and optimized software using Intel QuickAssist Technology e…

C 440 137 Updated Apr 10, 2026
Next