Skip to content
View TejashShah's full-sized avatar
  • https://github.com/ROCmSoftwarePlatform/MIOpen
  • San Diego, CA
  • X @_TejashShah_

Block or report TejashShah

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Fast and memory-efficient exact attention

Python 21,199 2,232 Updated Dec 20, 2025

PjRt plugin and Python APIs for MPMD workflows in Jax

C++ 7 Updated Aug 4, 2025

Experimental projects related to TensorRT

MLIR 116 22 Updated Dec 20, 2025

PyTorch compiler that accelerates training and inference. Get built-in optimizations for performance, memory, parallelism, and easily write your own.

Python 1,430 111 Updated Dec 19, 2025

A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")

C++ 367 74 Updated Dec 20, 2025

Home for OctoML PyTorch Profiler

114 9 Updated Apr 24, 2023

[DEPRECATED] Moved to ROCm/rocm-libraries repo

Assembly 1,185 268 Updated Dec 20, 2025