#
ampere
Here are 3 public repositories matching this topic...
AML's goal is to make benchmarking of various AI architectures on Ampere CPUs a pleasurable experience :)
machine-learning natural-language-processing computer-vision model-zoo tensorflow inference pytorch artificial-intelligence arm64 aarch64 ampere armv8-a onnxruntime mlperf-inference dlrm large-language-models yolov8 llama2
-
Updated
Nov 19, 2025 - Python
Cross-platform FlashAttention-2 Triton implementation for Turing+ with custom configuration mode
machine-learning deep-learning gpu optimization pytorch triton attention hopper implementation attention-mechanism turing ampere implementation-from-scratch blackwell transfromers large-language-models llm flash-attention flash-attention-2 flashattention
-
Updated
Dec 16, 2025 - Python
Improve this page
Add a description, image, and links to the ampere topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the ampere topic, visit your repo's landing page and select "manage topics."