-
Notifications
You must be signed in to change notification settings - Fork 55
Description
Hi all,
I'm in the process of researching Apple AMX as a potential way of speeding up IEEE FP BLAS kernels in OpenBLAS.
On the MacOS side, it seems that between this repository and other resources, I have all I need to be able to write the kernels.
The issue as of now is Linux. Speaking with the folks supporting/developing Asahi Linux (see mastodon thread here: https://mast.hpc.social/@fclc/109914828822965657) discussion came up that Asahi has no plans to support the EL0 CPU state required for AMX.
I'm of the opinion that it may be possible to implement a Linux kernel module to allow for the usage of AMX on M1, M2 and the various SKUs based on those SOCs.
This would probably require fairly tight understanding of AMX and its underlying operations.
I was hoping for insight from any of the folks working on this present project.