h100

Here are 5 public repositories matching this topic...

DeepSeek-V3, R1 671B on 8xH100 Throughput Benchmarks

Cog Single GPU Quantized Implementation of Step-Video-T2V

replicate single-gpu fp8 h100 step-video-t2v diffsynth

Production-grade GPU acceleration for robot learning. 10-20× faster training on NVIDIA H100/A100. Nsight validated.

One-offs.

bedrock resnet cifar mosaic faiss claude-instant h100 titan-embeddings claude-instant-1 langchain-document

LLaMA-Factory FP8 training environment for NVIDIA Hopper GPUs. Fixes common configuration issues causing 2x slowdown with FP8 mixed precision.

deep-learning pytorch nvidia hopper performance-optimization fp8 h100 llama-factory gh200 transformer-engine

Add a description, image, and links to the h100 topic page so that developers can more easily learn about it.

To associate your repository with the h100 topic, visit your repo's landing page and select "manage topics."