Skip to content
View pontostroy's full-sized avatar

Block or report pontostroy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Local-first MCP server that gives coding agents structured context packets, code/schema facts, and diagnostics - backed by a local SQLite store.

TypeScript 33 5 Updated Apr 30, 2026

Real-time network diagnostics in your terminal. One command, zero config, instant visibility.

Rust 1,482 49 Updated Apr 29, 2026

high-performance linear attention kernel library built on TileLang

Python 354 25 Updated Apr 30, 2026

comfyui optimizations for the dgx spark

Shell 3 1 Updated Apr 28, 2026

Ansible playbooks for a 3-node K3s cluster with NVIDIA DGX Spark nodes for distributed LLM inference

Python 1 Updated Apr 30, 2026

ComfyUI-QwenVL custom node: Integrates the Qwen-VL series, including Qwen2.5-VL and the latest Qwen3-VL, with GGUF support for advanced multimodal AI in text generation, image understanding, and vi…

Python 745 110 Updated Feb 10, 2026

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 2 1 Updated Apr 26, 2026

Cardamon is a cleanup tool for Prometheus that collects unused metrics from Grafana and Prometheus and generates drop statements for them.

HTML 93 Updated Apr 29, 2026

A lightweight schema-on-read analytics in a single binary

Go 95 2 Updated Apr 27, 2026

The only tool you need to know what is happening and how to fix it.

Go 95 8 Updated Apr 29, 2026
TypeScript 50 5 Updated Apr 25, 2026
Python 4 Updated Mar 25, 2026

STREAM benchmark

C 495 175 Updated Feb 17, 2025

Single-file web UI for NVIDIA DGX Spark — pull Ollama models, browse and download from HuggingFace, manage LiteLLM routing, and control SGLang, vLLM, llama.cpp, LocalAI, and ComfyUI. All from one b…

Python 11 2 Updated Apr 24, 2026

A coding agent optimized to smaller LLMs

TypeScript 836 56 Updated Apr 28, 2026

Export helm stats into the Prometheus format

Go 282 73 Updated Apr 30, 2026

Open WebUI Desktop 🌐

Svelte 1,621 142 Updated Apr 29, 2026

Linux-native "fake root" for implementing rootless containers

Go 1,253 116 Updated Apr 30, 2026

Self-hosted, semantically-connected personal knowledge base

Rust 1,376 94 Updated Apr 30, 2026

Lightweight OpenTelemetry viewer for local development. View traces, logs & metrics instantly.

TypeScript 79 11 Updated Apr 29, 2026

Files for youtube's videos

Mustache 420 208 Updated Mar 19, 2026

Tool-calling quality benchmark for LLM serving stacks. 65+ deterministic scenarios testing multi-turn orchestration, safety boundaries, and structured output. Supports vLLM, LiteLLM, and llama.cpp.

Python 33 5 Updated Apr 30, 2026

Port Kill helps you find and free ports and caches blocking your dev work.

Rust 2,016 54 Updated Apr 3, 2026

The Helm chart you can use to install any of your applications into Kubernetes/OpenShift

Go Template 562 88 Updated Apr 27, 2026

Turn any NVIDIA GPU into a local AI platform. Inference + fine-tuning in your browser. One command to start, automatic clustering.

Python 9 3 Updated Apr 25, 2026

High-performance uncensored Gemma-4-26B inference on NVIDIA DGX Spark using vLLM - 45+ tok/s

Python 7 1 Updated Apr 27, 2026

The Prometheus monitoring system and time series database.

Go 63,850 10,370 Updated Apr 29, 2026

nono - a capability-based, multiplexing sandbox tool, built for developers - lift'n'shift seamless path to prod. Run agents securely without needing any additional infra, zero setup, zero latency.

Rust 2,181 155 Updated Apr 30, 2026

A terminal-based system monitor (TUI) optimized for NVIDIA Grace Blackwell (GB10) and hybrid CPU architectures.

Python 7 Updated Apr 19, 2026

Check your Kubernetes changes before they hit the cluster

Go 593 42 Updated Apr 27, 2026
Next