Adaptive Block-Scaled Data Types
… IF4 (Int/Float 4) data type selects between FP4 and INT4 … The selected data type is denoted
using the scale factor’s … outperforms existing 4-bit block-scaled formats, achieving lower loss …
using the scale factor’s … outperforms existing 4-bit block-scaled formats, achieving lower loss …
Adaptive block-scaled gemms on vector processors for dnn training at the edge
… block-scaled datatypes and implement GeMMs using them on ARM vector processor. • We
develop an adaptive block-scaled … “fine-adaptation” to allow accuracytraining time trade-off. …
develop an adaptive block-scaled … “fine-adaptation” to allow accuracytraining time trade-off. …
Efficient DNN Training Using Vectorized Block-Scaled GeMMs with Adaptive Block Shapes
N Satya Murthy, N Laubeuf, D Bhattacharjee… - IFIP/IEEE International …, 2024 - Springer
… operations of our block-scaled kernels. We also provide more ablation studies for the
adaptive training setup to justify our approach. Our main contributions are as follows: …
adaptive training setup to justify our approach. Our main contributions are as follows: …
Efficient DNN Training Using Vectorized Block-Scaled GeMMs with Adaptive Block
NS Murthy, N Laubeuf¹, D Bhattacharjee¹… - VLSI-SoC: Technology … - books.google.com
… vector shapes for blockscaled datatypes and implement … adaptive block-scaled DNN
training approach, by initially performing “coarse-commutative” training followed by “fine-adaptation…
training approach, by initially performing “coarse-commutative” training followed by “fine-adaptation…
Joint and individual variation explained (JIVE) for integrated analysis of multiple data types
… Table 1 gives very diverse examples of such data objects. In this context we refer to each
dataset as a data type to indicate that it comes from a distinct mode of measurement or domain. …
dataset as a data type to indicate that it comes from a distinct mode of measurement or domain. …
Four Over Six: More Accurate NVFP4 Quantization with Adaptive Block Scaling
… While Four Over Six could theoretically be applied to other block-scaled low-precision FP4
… factors: a block scaled to 4 requires a scale factor that is 50% larger than a block scaled to 6. …
… factors: a block scaled to 4 requires a scale factor that is 50% larger than a block scaled to 6. …
Block-Quantized and Data-Efficient Deep Learning at the Edge: Optimizing Deployments with General-Purpose and Domain-Specific Vector Instruction Sets
N Satya Murthy, M Verhelst, F Catthoor - 2025 - lirias.kuleuven.be
… Additionally, model adaptation at the edge is often required to effectively handle data drift, …
Block-scaled GeMM kernels are designed and optimized across multiple design parameters…
Block-scaled GeMM kernels are designed and optimized across multiple design parameters…
[PDF][PDF] Bit-Accurate Simulation of Convolution-Based Filtering on Reconfigurable Hardware
H Scherl, M Kowarschik, J Hornegger - … in Erlangen 2005 Erlangen 12.-15, 2005 - cs.fau.de
… We focus on convolutions of block– scaled 16 bit data both … chain using fixed–point data
types, we based our software … , and the block exponent is adapted appropriately. This results in a …
types, we based our software … , and the block exponent is adapted appropriately. This results in a …
Error Diffusion: Post Training Quantization with Block-Scaled Number Formats for Neural Networks
A Khodamoradi, K Denolf, E Dellinger - arXiv preprint arXiv:2410.11203, 2024 - arxiv.org
… quantization with support for block-scaled data formats. Our … variety of number formats,
including the block-scaled ones, to aid the … Our experiments confirm that block-scaled data formats …
including the block-scaled ones, to aid the … Our experiments confirm that block-scaled data formats …
Efficient precision-scalable hardware for microscaling (MX) processing in robotics learning
… For this edge training, Microscaling (MX) data types offer a … unit that supports all six MX
data types by exploiting sub-word … [33] NS Murthy et al., “Adaptive block-scaled gemms on …
data types by exploiting sub-word … [33] NS Murthy et al., “Adaptive block-scaled gemms on …