DEV Community

# gpu

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Same eBPF, Different Vendor: Tracing libhip Calls on AMD ROCm

Same eBPF, Different Vendor: Tracing libhip Calls on AMD ROCm

Comments
3 min read
Best GPU for Llama 70B in 2026 (48GB+ VRAM Required)

Best GPU for Llama 70B in 2026 (48GB+ VRAM Required)

Comments
6 min read
From TCP Retransmits to MCP-Driven Cluster Investigations: An eBPF GPU Agent Retrospective

From TCP Retransmits to MCP-Driven Cluster Investigations: An eBPF GPU Agent Retrospective

1
Comments
8 min read
From Zero to Supercomputing: A Beginner-Friendly Guide to Using HPC Clusters Like CINECA

From Zero to Supercomputing: A Beginner-Friendly Guide to Using HPC Clusters Like CINECA

Comments
5 min read
What Inference-Platform Benchmark Posts Leave Out

What Inference-Platform Benchmark Posts Leave Out

Comments
8 min read
Every GPU Container Bug I've Hit on OKE (and How I Fixed Them)

Every GPU Container Bug I've Hit on OKE (and How I Fixed Them)

1
Comments
5 min read
Why CUDA kernels silently corrupt memory and how to catch the bug

Why CUDA kernels silently corrupt memory and how to catch the bug

Comments
5 min read
AMD RDNA 4 & AI PRO GPUs Launch, FSR 4.1 Benchmarks, DGX Water Cooling

AMD RDNA 4 & AI PRO GPUs Launch, FSR 4.1 Benchmarks, DGX Water Cooling

Comments
3 min read
RTX 5080 Launched, Rust for CUDA, & LLM GPU Scheduling Deep Dive

RTX 5080 Launched, Rust for CUDA, & LLM GPU Scheduling Deep Dive

Comments
3 min read
MCP Shows What the Agent Did. eBPF Shows Why the GPU Stalled.

MCP Shows What the Agent Did. eBPF Shows Why the GPU Stalled.

Comments
7 min read
Fractals and Non-Euclidean Geometry in the Browser

Fractals and Non-Euclidean Geometry in the Browser

Comments
3 min read
Which serverless GPU platforms actually have fast cold starts for AI inference — p99, not p50

Which serverless GPU platforms actually have fast cold starts for AI inference — p99, not p50

Comments
2 min read
DeepSeek-V4-Flash Benchmarks, FlashRT CUDA Runtime, & V100 LLM Performance

DeepSeek-V4-Flash Benchmarks, FlashRT CUDA Runtime, & V100 LLM Performance

Comments
3 min read
RTX 5090, LLaMA.cpp TurboQuant, & Blackwell CUDA Scheduling Boosts GPU Performance

RTX 5090, LLaMA.cpp TurboQuant, & Blackwell CUDA Scheduling Boosts GPU Performance

1
Comments
3 min read
CUDA-Oxide 0.1, RTX 5070 Launch, & BeeLlama.cpp Boost 3090 Inference

CUDA-Oxide 0.1, RTX 5070 Launch, & BeeLlama.cpp Boost 3090 Inference

Comments
3 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.