Adaptive Block-Scaled Data Types

J Cook, HS Lee, K Le, J Guo, G Traverso… - arXiv preprint arXiv …, 2026 - arxiv.org
… IF4 (Int/Float 4) data type selects between FP4 and INT4 … The selected data type is denoted
using the scale factor’s … outperforms existing 4-bit block-scaled formats, achieving lower loss …

Adaptive block-scaled gemms on vector processors for dnn training at the edge

NS Murthy, N Laubeuf, D Bhattacharjee… - 2024 IFIP/IEEE 32nd …, 2024 - ieeexplore.ieee.org
block-scaled datatypes and implement GeMMs using them on ARM vector processor. • We
develop an adaptive block-scaled … “fine-adaptation” to allow accuracytraining time trade-off. …

Efficient DNN Training Using Vectorized Block-Scaled GeMMs with Adaptive Block Shapes

N Satya Murthy, N Laubeuf, D Bhattacharjee… - IFIP/IEEE International …, 2024 - Springer
… operations of our block-scaled kernels. We also provide more ablation studies for the
adaptive training setup to justify our approach. Our main contributions are as follows: …

Efficient DNN Training Using Vectorized Block-Scaled GeMMs with Adaptive Block

NS Murthy, N Laubeuf¹, D Bhattacharjee¹… - VLSI-SoC: Technology … - books.google.com
… vector shapes for blockscaled datatypes and implement … adaptive block-scaled DNN
training approach, by initially performing “coarse-commutative” training followed by “fine-adaptation

Joint and individual variation explained (JIVE) for integrated analysis of multiple data types

EF Lock, KA Hoadley, JS Marron… - The annals of applied …, 2013 - pmc.ncbi.nlm.nih.gov
… Table 1 gives very diverse examples of such data objects. In this context we refer to each
dataset as a data type to indicate that it comes from a distinct mode of measurement or domain. …

Four Over Six: More Accurate NVFP4 Quantization with Adaptive Block Scaling

J Cook, J Guo, G Xiao, Y Lin, S Han - arXiv preprint arXiv:2512.02010, 2025 - arxiv.org
… While Four Over Six could theoretically be applied to other block-scaled low-precision FP4
… factors: a block scaled to 4 requires a scale factor that is 50% larger than a block scaled to 6. …

Block-Quantized and Data-Efficient Deep Learning at the Edge: Optimizing Deployments with General-Purpose and Domain-Specific Vector Instruction Sets

N Satya Murthy, M Verhelst, F Catthoor - 2025 - lirias.kuleuven.be
… Additionally, model adaptation at the edge is often required to effectively handle data drift, …
Block-scaled GeMM kernels are designed and optimized across multiple design parameters…

[PDF][PDF] Bit-Accurate Simulation of Convolution-Based Filtering on Reconfigurable Hardware

H Scherl, M Kowarschik, J Hornegger - … in Erlangen 2005 Erlangen 12.-15, 2005 - cs.fau.de
… We focus on convolutions of blockscaled 16 bit data both … chain using fixed–point data
types, we based our software … , and the block exponent is adapted appropriately. This results in a …

Error Diffusion: Post Training Quantization with Block-Scaled Number Formats for Neural Networks

A Khodamoradi, K Denolf, E Dellinger - arXiv preprint arXiv:2410.11203, 2024 - arxiv.org
… quantization with support for block-scaled data formats. Our … variety of number formats,
including the block-scaled ones, to aid the … Our experiments confirm that block-scaled data formats …

Efficient precision-scalable hardware for microscaling (MX) processing in robotics learning

S Cuyckens, X Yi, NS Murthy, C Fang… - 2025 IEEE/ACM …, 2025 - ieeexplore.ieee.org
… For this edge training, Microscaling (MX) data types offer a … unit that supports all six MX
data types by exploiting sub-word … [33] NS Murthy et al., “Adaptive block-scaled gemms on …