blackwell

Here are 2 public repositories matching this topic...

Code for the paper "ARCQuant: Boosting NVFP4 Quantization with Augmented Residual Channels for LLMs"

quantization mixed-precision blackwell llm llm-inference microscaling nvfp4

High-performance LLM inference engine in C++/CUDA for NVIDIA Blackwell GPUs (RTX 5090)

cpp cuda inference nvidia transformer quantization mamba mixture-of-experts blackwell llm qwen gguf rtx-5090 gated-deltanet

Add a description, image, and links to the blackwell topic page so that developers can more easily learn about it.

To associate your repository with the blackwell topic, visit your repo's landing page and select "manage topics."