blackwell

Here are 49 public repositories matching this topic...

abuttan1979 / VLN-YuanNav

🧭 Enhance navigation with VLN-YuanNav, a visual-language model using advanced memory and decision-making for effective exploration.

android linux framework trading navigation amd inference pytorch wechat android-framework quantitative-finance flutter gityuan model-serving yuan blackwell genshin llm yuanbao

Updated Mar 27, 2026
Python

Atsusheeesh / vllm-daily

Star

📊 Summarize merged PRs daily with vLLM, ensuring you stay updated on key changes and enhancements in your projects.

github javascript java frontend backend tutorials pytorch webapp developer-tools llama developer-portal model-serving blackwell professional-networking llm llm-serving deepseek gpt-oss

Updated Mar 27, 2026

MoHussein197 / dgx-spark-finetune-llm

Star

🔧 Fine-tune large language models efficiently on NVIDIA DGX Spark with LoRA adapters and optimized quantization for high performance.

deep-learning pytorch nvidia lora quantization fine-tuning blackwell llm nvfp4 dgx-spark transformer-engine mxfp8

Updated Mar 27, 2026
Python

sgl-project / sglang

Star

SGLang is a high-performance serving framework for large language models and multimodal models.

reinforcement-learning cuda inference transformer moe attention llama glm minimax wan diffusion vlm blackwell llm qwen deepseek gpt-oss qwen-image

Updated Mar 27, 2026
Python

MGD-Ben / GPT-OSS

Star

🚀 Build and explore OpenAI's GPT-OSS model from scratch in Python, unlocking the mechanics of large language models.

agent amd cuda openai mistral vlm ai-agents fine-tuning kimi blackwell stable-diffusion chatgpt llm-serving qwen deepseek deepseek-v3 gpt-oss gpt-oss-120b

Updated Mar 27, 2026
Python

vllm-project / vllm

Sponsor

Star

A high-throughput and memory-efficient inference and serving engine for LLMs

Updated Mar 27, 2026
Python

jcartu / rasputin-memory

Star

An AI memory system that never forgets. Qdrant vectors + FalkorDB knowledge graph + neural reranking, self-hosted on 3 GPUs.

Updated Mar 27, 2026
HTML

XrecentX / vllm-skills

Star

🚀 Deploy and manage vLLM with ready-made skills for modular automation, adhering to the Anthropics template for seamless integration.

nodejs education amd pandas pytorch embeddings openai model-serving python-pandas learning-resources tpu blackwell llm-serving qwen deepseek gpt-oss

Updated Mar 27, 2026
Shell

kekzl / imp

Star

High-performance LLM inference engine in C++/CUDA for NVIDIA Blackwell GPUs (RTX 5090)

cpp cuda inference nvidia transformer quantization mamba mixture-of-experts blackwell llm qwen gguf rtx-5090 gated-deltanet

Updated Mar 27, 2026
Cuda

actypedef / SharQ

Star

SharQ: Bridging Activation Sparsity and FP4 Quantization for LLM Inference

sparsity quantization blackwell llm llm-inference nvfp4

Updated Mar 27, 2026
Python

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

cuda pytorch moe blackwell llm-serving

Updated Mar 27, 2026
Python

PrimitiveContext / blackwell

Star

Production LLM deployment specs for NVIDIA Blackwell GPUs (RTX Pro 6000, DGX Spark). Includes vLLM configurations, benchmarks, load balancer, and throughput calculators for NVFP4/FP8/MoE models.

benchmark nvidia moe blackwell vllm sglang mxfp4 nvfp4 dgx-spark rtx-pro-6000 msi-edgexpert

Updated Mar 25, 2026
Python

mmontes11 / docker-ai-workspace

Star

Docker image equipped with AI tools to be used as a workspace running as Pod of my homelab

go docker kubernetes ai gpu opencode nvidia pod uv blackwell

Updated Mar 25, 2026
Dockerfile

masezou / ComfyUI-series

Star

Various ComfyUI Dockerfile

docker ai cuda pytorch nvidia blackwell stable-diffusion comfyui

Updated Mar 25, 2026

GradientHQ / parallax

Star

Parallax is a distributed model serving framework that lets you build your own AI cluster anywhere

python distributed-systems chatbot pytorch transformer llama glm minimax kimi blackwell large-language-models llm llm-serving qwen deepseek oss-gpt decentralized-inference

Updated Mar 24, 2026
Python

innova-foundation / ccminer

Star

CUDA miner for Innova (Tribus) and 30+ algorithms. Blackwell/5090 optimized fork of tpruvot/ccminer.

cuda nvidia mining ccminer innova blackwell tribus

Updated Mar 23, 2026
C

thupalo / tensorrt-on-dgx-spark

Star

Deploy Nemotron 3 Nano 30B on NVIDIA DGX Spark using TensorRT-LLM (Blackwell GB10, NVFP4 quantization, OpenAI-compatible API)

docker inference aarch64 mamba mixture-of-experts blackwell local-llm tensorrt-llm nemotron openai-compatible nvfp4 nvidia-dgx-spark

Updated Mar 22, 2026
Shell

thupalo / llama-on-dgx-spark

Star

Deploy Nemotron 3 Nano 30B with 1M context window on NVIDIA DGX Spark using llama.cpp (Blackwell sm_121, Q4_0 KV cache quantization)

cuda inference aarch64 mamba mixture-of-experts blackwell long-context llama-cpp local-llm gguf kv-cache-quantization nemotron nvidia-dgx-spark 1m-context-window

Updated Mar 22, 2026
Shell

YuClawLab / yuclaw-edge

Star

Production C++ FIX 4.4 execution gateway for YUCLAW. Compiles on ARM64 DGX Spark Blackwell. Graduated execution levels 0-4. Real risk controls at the hardware layer.

cpp execution low-latency arm64 fix-protocol risk-management high-frequency-trading institutional blackwell nvidia-dgx

Updated Mar 18, 2026
C++

soy-tuber / localllama-insights

Star

Technical insights from r/LocalLLaMA — vLLM, FP8, NVFP4, Blackwell GPU benchmarks, and more. Unverified community knowledge, generated by Nemotron 9B. Issues welcome.

gpu inference benchmarks blackwell llm fp8 vllm localllama rtx-5090 nvfp4

Updated Mar 16, 2026

Improve this page

Add a description, image, and links to the blackwell topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the blackwell topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

blackwell

Here are 49 public repositories matching this topic...

abuttan1979 / VLN-YuanNav

Atsusheeesh / vllm-daily

MoHussein197 / dgx-spark-finetune-llm

sgl-project / sglang

MGD-Ben / GPT-OSS

vllm-project / vllm

jcartu / rasputin-memory

XrecentX / vllm-skills

kekzl / imp

actypedef / SharQ

NVIDIA / TensorRT-LLM

PrimitiveContext / blackwell

mmontes11 / docker-ai-workspace

masezou / ComfyUI-series

GradientHQ / parallax

innova-foundation / ccminer

thupalo / tensorrt-on-dgx-spark

thupalo / llama-on-dgx-spark

YuClawLab / yuclaw-edge

soy-tuber / localllama-insights

Improve this page

Add this topic to your repo