hbm
Here are 7 public repositories matching this topic...
DreamRAM: A Fine-Grained Configurable Design Space Modeling Tool for Custom 3D Die-Stacked DRAM
-
Updated
Apr 28, 2026 - Python
Block-sparse for v5e-1 eliminates the O(N²) HBM attention matrix via online fused softmax, crossing the 240.2 FLOPs/byte ridge point at N≥1024
-
Updated
Mar 11, 2026 - Python
Thermal-aware batch controller for vLLM/TensorRT-LLM. Prevents HBM thermal throttling from killing p99 latency on H100/H200. Monitors nvidia-smi, auto-cuts batch size at 85°C, migrates cold KV to DRAM. Prometheus + Grafana included. 4.2s -> 2.1s p99 at 128K context.
-
Updated
Apr 13, 2026 - Python
A.F.O artifact for bridge-sensitive bottleneck attribution and control in hierarchical LLM memory paths (gem5-based reproducibility bundle).
-
Updated
May 11, 2026 - Python
🔲 Chip substrate — 28-verb semiconductor stack (architecture / design / EDA / process / packaging / NPU / PIM / 3D / photonic / RTL-gen / yield / consciousness-chip).
-
Updated
May 14, 2026 - Python
Reference simulator + benchmark harness for HBM residency control and fragmentation metrics.
-
Updated
Apr 2, 2026 - Python
Improve this page
Add a description, image, and links to the hbm topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the hbm topic, visit your repo's landing page and select "manage topics."