mxfp8

Star

Here are 5 public repositories matching this topic...

NVIDIA / cudnn-frontend

Star

cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it

Updated May 20, 2026
Python

waybarrios / dgx-spark-finetune-llm

Star

LLM fine-tuning with LoRA + NVFP4/MXFP8 on NVIDIA DGX Spark (Blackwell GB10)

deep-learning pytorch nvidia lora quantization fine-tuning blackwell llm nvfp4 dgx-spark transformer-engine mxfp8

Updated Dec 22, 2025
Python

sayakpaul / diffusers-blackwell-quants

Star

Easy recipes to speed up latency of Flux, QwenImage, and LTX-2 with NVFP4 and MXFP8 on Blackwell.

pytorch image-gen diffusers video-gen torchao blackwell-gpu nvfp4 mxfp8

Updated Apr 10, 2026
Python

MoHussein197 / dgx-spark-finetune-llm

Star

🔧 Fine-tune large language models efficiently on NVIDIA DGX Spark with LoRA adapters and optimized quantization for high performance.

deep-learning pytorch nvidia lora quantization fine-tuning blackwell llm nvfp4 dgx-spark transformer-engine mxfp8

Updated May 20, 2026
Python

idonati / spark-vllm-docker-festr2

Star

Patches + recipe to deploy festr2/MiMo-V2.5-Pro-NVFP4-MXFP8-attn-TP8 on 8-node DGX Spark sm_121 (Ray + vLLM, TP=8). Fixes the fused-qkv loader bug that mis-slotted Q values as K/V on 7 of 8 ranks.

moe ray quantization mimo huggingface vllm gb10 nvfp4 dgx-spark mxfp8 sm121 tensor-parallel

Updated May 19, 2026
Python

Improve this page

Add a description, image, and links to the mxfp8 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the mxfp8 topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mxfp8

Here are 5 public repositories matching this topic...

NVIDIA / cudnn-frontend

waybarrios / dgx-spark-finetune-llm

sayakpaul / diffusers-blackwell-quants

MoHussein197 / dgx-spark-finetune-llm

idonati / spark-vllm-docker-festr2

Improve this page

Add this topic to your repo