gb10

Star

Here are 54 public repositories matching this topic...

Avarok-Cybersecurity / atlas

Star

Pure Rust Inference Engine

rust cuda transformers ssm mamba dgx openai-api llm-inference speculative-decoding gb10 nvfp4 dgx-spark

Updated Jun 13, 2026
Rust

eelbaz / dgx-spark-vllm-setup

Star

One-command vLLM installation for NVIDIA DGX Spark with Blackwell GB10 GPUs (sm_121 architecture)

machine-learning ai deep-learning gpu cuda pytorch nvidia arm64 blackwell llm vllm llm-inference gb10 dgx-spark

Updated Oct 28, 2025
Shell

jdaln / dgx-spark-inference-stack

Star

Serve the home! Inference stack for your Nvidia DGX Spark aka the Grace Blackwell AI supercomputer on your desk. Mostly vLLM based for now and single-spark. For the not-so-rich buddies. If you want latest/in-testing, look at the branches

docker docker-compose cuda inference self-hosted llama model-serving mlops dgx generative-ai local-llm gb10 dgx-spark

Updated Jun 3, 2026
Shell

bjk110 / spark_vllm_docker

Star

DGX Spark / GB10 vLLM Docker stack for large-model serving, presets, patches, and validation notes.

docker docker-compose cuda llm-serving vllm qwen deepseek gb10 dgx-spark

Updated Jun 11, 2026
Python

joeynyc / spark-doctor

Star

Local diagnostic CLI for NVIDIA DGX Spark (GB10). Detects power caps, unified memory pressure, thermal risk, Docker/runtime issues, and validates vLLM/Ollama/llama.cpp/SGLang recipes.

cli nvidia diagnostics dgx llama-cpp vllm local-llm ollama sglang gb10 dgx-spark grace-blackwell nvidia-dgx-spark

Updated May 15, 2026
Python

seanGSISG / dgx-spark-sunshine-setup

Star

Headless 4K remote desktop for the NVIDIA DGX Spark (GB10): one-command installer for Sunshine + Moonlight low-latency game streaming with NVENC hardware encoding, a software virtual display (no HDMI dummy plug), GDM autologin, and optional Tailscale.

Updated Jun 3, 2026
Shell

Entrpi / ds4-on-spark

Star

antirez/ds4 (DwarfStar 4) on NVIDIA DGX Spark — install, benchmarks, and roofline analysis. Steady-state decode at ~95% of bandwidth ceiling; MTP and concurrency analyzed.

benchmark cuda inference moe llm gguf gb10 dgx-spark deepseek-v4-flash

Updated Jun 4, 2026
Shell

croll83 / llama.cpp-dgx

Star

llama.cpp fork optimized for NVIDIA DGX Spark / GB10 (Blackwell, SM 12.1) — TurboQuant weights + KV, NVFP4, DFlash MTP

blackwell llama-cpp speculative-decoding gb10 nvfp4 dflash turboquant

Updated May 26, 2026
C++

parallelArchitect / sparkview

Star

Operator-grade GPU monitor for NVIDIA GPUs with native GB10 / DGX Spark coherent UMA support — PSI pressure, clock detection, ConnectX-7 network layer

python monitoring gpu cuda tui nvidia psi unified-memory gb10 dgx-spark

Updated May 31, 2026
Python

calico88x / DGX-Model-Manager

Star

Single-file web UI for NVIDIA DGX Spark — pull Ollama models, browse and download from HuggingFace, manage LiteLLM routing, and control SGLang, vLLM, llama.cpp, LocalAI, and ComfyUI. All from one browser tab.

web ai nvidia model-deployment fastapi ai-tools llm llm-tools gb10 dgx-spark dgxspark

Updated May 19, 2026
Python

getainode / ainode

Star

Turn any NVIDIA GPU into a local AI platform. Inference + fine-tuning in your browser. One command to start, automatic clustering.

open-source gpu cuda inference self-hosted distributed nvidia fine-tuning ai-platform llm vllm local-ai gb10 dgx-spark grace-blackwell

Updated Apr 25, 2026
Python

scottgl9 / sglang-spark-gb10-optimizations

Sponsor

Star

SGLang optimizations for NVIDIA Spark (GB10) — SM121 Grace Blackwell

optimization marlin sglang gb10

Updated Jun 12, 2026
Python

albond / DGX_Spark_Unsloth_Lossless_Speedup

Star

7.67× LoRA / 8.35× Full FT speedup for Qwen3.5 (0.8B–27B) on NVIDIA DGX Spark — wall-clock parity with rented H100. Lossless within BF16. Three-command interactive wizard handles model picker, data validator, training, and merge.

cuda transformers pytorch nvidia triton lora fine-tuning peft multimodal blackwell qwen unsloth gb10 dgx-spark qwen3-5 sm121

Updated May 19, 2026
Python

Logos-Flux / optimized-CUDA-GB10

Star

Optimized CUDA kernels for NVIDIA GB10 Blackwell (sm_121, DGX Spark). RMSNorm + GELU. First sm_121 kernel on HuggingFace Kernel Hub.

gpu cuda pytorch nvidia kernels gelu huggingface blackwell rmsnorm gb10 dgx-spark sm121

Updated May 3, 2026
Cuda

bidual / awesome-dgx-spark

Star

A curated list of tools, guides, playbooks, and resources for the NVIDIA DGX Spark (GB10 Grace Blackwell personal AI supercomputer).

awesome nvidia awesome-list blackwell llm gb10 dgx-spark

Updated Jun 9, 2026
Shell

a1exus / sparky

Star

NVIDIA DGX Spark workstation — self-hosted LLM stack (vLLM, llama.cpp, Ollama + Open WebUI) behind Traefik, with Cloudflare Tunnel + Tailscale ingress and Netdata observability.

Updated Jun 8, 2026
Makefile

parallelArchitect / spark-gpu-throttle-check

Star

Enhanced GPU throttle diagnostic for DGX Spark (GB10): NVML direct telemetry, throttle cause decoder, PCIe link monitoring, baseline drift detection, timeline capture.

cuda cublas nvidia nvml pcie usb-pd gpu-monitoring power-delivery gb10 gpu-diagnostics dgx-spark throttle-detection clock-throttling

Updated Mar 22, 2026
Python

jxlarrea / homeassistant-voice-recipes

Sponsor

Star

GPU/CUDA-accelerated voice control stack for Home Assistant. Runs on x86/x64 and ARM64 (including the NVIDIA DGX Spark). 100% Local - No Cloud, No Subscriptions.

text-to-speech x86-64 cuda gpu-acceleration home-assistant speech-to-text arm64 voice-assistant local-llm qwen3 gb10 dgx-spark

Updated Jun 13, 2026
Go

timothystewart6 / vllm-gb10

Sponsor

Star

vLLM Docker image for the NVIDIA DGX Spark (GB10 / sm_121a).

docker cuda inference pytorch nvidia arm64 llm vllm gb10 dgx-spark

Updated Jun 2, 2026
Shell

AEON-7 / Qwen3.6-27B-AEON-Ultimate-Uncensored-DDTree

Star

Experimental DDTree-on-vLLM research track for Qwen3.6 AEON Ultimate on DGX Spark / GB10.

blackwell vllm speculative-decoding gb10 nvfp4 dgx-spark dflash qwen36 ddtree

Updated May 15, 2026
Python

Improve this page

Add a description, image, and links to the gb10 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the gb10 topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gb10

Here are 54 public repositories matching this topic...

Avarok-Cybersecurity / atlas

eelbaz / dgx-spark-vllm-setup

jdaln / dgx-spark-inference-stack

bjk110 / spark_vllm_docker

joeynyc / spark-doctor

seanGSISG / dgx-spark-sunshine-setup

Entrpi / ds4-on-spark

croll83 / llama.cpp-dgx

parallelArchitect / sparkview

calico88x / DGX-Model-Manager

getainode / ainode

scottgl9 / sglang-spark-gb10-optimizations

albond / DGX_Spark_Unsloth_Lossless_Speedup

Logos-Flux / optimized-CUDA-GB10

bidual / awesome-dgx-spark

a1exus / sparky

parallelArchitect / spark-gpu-throttle-check

jxlarrea / homeassistant-voice-recipes

timothystewart6 / vllm-gb10

AEON-7 / Qwen3.6-27B-AEON-Ultimate-Uncensored-DDTree

Improve this page

Add this topic to your repo