Skip to content
View erfanzar's full-sized avatar
:shipit:
:shipit:

Organizations

@Instinct-AI

Block or report erfanzar

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 6,589 625 Updated Jul 3, 2026

This is ROCgdb, the ROCm source-level debugger for Linux, based on GDB, the GNU source-level debugger.

C 73 24 Updated Jul 1, 2026

Static array shape checking for JAX powered by eval_shape

Python 2 Updated Jun 22, 2026

A machine learning compiler for GPUs, CPUs, and ML accelerators

C++ 4,360 854 Updated Jul 3, 2026

A std::execution style runtime context and High Performance RPC Transport for using OpenUCX. Including CUDA/ROCM/... devices with RDMA.

C++ 33 5 Updated May 26, 2026

Anki is a smart spaced repetition flashcard program

Rust 28,908 3,077 Updated Jul 2, 2026

Probabilistic Programming and Nested sampling in JAX

Python 241 19 Updated May 13, 2026

Training materials associated with NVIDIA's CUDA Training Series (www.olcf.ornl.gov/cuda-training-series/)

Cuda 3 Updated Aug 12, 2025
Jupyter Notebook 25 5 Updated Dec 16, 2025

Code accompanying the paper "Generalized Interpolating Discrete Diffusion"

Python 120 16 Updated Jun 9, 2025

[DEPRECATED] Moved to ROCm/rocm-libraries repo

C++ 117 66 Updated Jun 8, 2026

[DEPRECATED] Moved to ROCm/rocm-libraries repo

C++ 135 65 Updated Jun 17, 2026

A tool and a library for bi-directional translation between SPIR-V and LLVM IR

LLVM 616 269 Updated Jul 2, 2026
MLIR 183 56 Updated Jun 30, 2026

Shared Middle-Layer for Triton Compilation

MLIR 339 103 Updated Dec 5, 2025

Open-source framework for the research and development of foundation models.

Python 1,159 135 Updated Jul 3, 2026

Minimal yet performant LLM examples in pure JAX

Python 263 36 Updated Apr 10, 2026

An implementation of PSGD Kron in JAX for distributed training in JAX or Flax

Python 10 Updated Nov 6, 2025

Minimal but scalable implementation of large language models in JAX

Python 34 5 Updated Nov 28, 2025

[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image

Python 3,566 289 Updated Jul 17, 2025

Train very large language models in Jax.

Python 208 17 Updated Oct 21, 2023

A simple library for scaling up JAX programs

Python 148 11 Updated Nov 4, 2025

jax-triton contains integrations between JAX and OpenAI Triton

Python 465 58 Updated Jun 24, 2026

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 101,264 28,228 Updated Jul 3, 2026

The Gosub browser engine

Rust 3,673 182 Updated Jul 2, 2026

Train transformer language models with reinforcement learning.

Python 18,749 2,817 Updated Jul 3, 2026
Python 1,428 102 Updated Jul 2, 2026

FastAPI framework, high performance, easy to learn, fast to code, ready for production

Python 99,923 9,530 Updated Jul 1, 2026

Everything you want to know about Google Cloud TPU

Python 570 30 Updated Jul 16, 2024
Next