Skip to content
View masahi's full-sized avatar

Organizations

@apache @dmlc @octoml

Block or report masahi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A TUI Git client inspired by Magit

Rust 2,806 149 Updated Jun 5, 2026

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 6,521 606 Updated Jun 19, 2026

CUDA/Metal accelerated language model inference

C 643 32 Updated May 29, 2025

RPyC (Remote Python Call) - A transparent and symmetric RPC library for python

Python 1,703 250 Updated Aug 14, 2025

📚 Jupyter notebook tutorials for OpenVINO™

Jupyter Notebook 3,162 1,020 Updated Jun 19, 2026

Embree ray tracing kernels repository.

C++ 2,706 430 Updated Jun 18, 2026

Universal LLM Deployment Engine with ML Compilation

Python 22,823 2,066 Updated May 11, 2026

Build system, successor to Buck

Rust 4,361 364 Updated Jun 19, 2026

MoonRay is an open-source, award-winning, state-of-the-art production path tracing renderer, initially developed at DreamWorks and an active member project of the Academy Software Foundation.

CMake 4,686 303 Updated Jun 16, 2026

optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052

C++ 479 36 Updated Mar 15, 2024

Language Modeling with the H3 State Space Model

Assembly 523 51 Updated Sep 29, 2023

An open-source efficient deep learning framework/compiler, written in python.

Python 743 69 Updated Sep 4, 2025

An efficient vector-graphics renderer

Rust 2,643 56 Updated May 16, 2023

A GPU compute-centric 2D renderer.

Rust 4,109 259 Updated Jun 19, 2026

A modern cross-platform low-level graphics library and rendering framework

Batchfile 4,331 384 Updated Jun 19, 2026

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Python 4,720 388 Updated Apr 9, 2026

Real-time GPU path tracing with an OpenUSD Hydra render delegate

C++ 607 50 Updated Aug 8, 2025

This is the development repository for the OpenFHE library. The current version is 1.5.1 (released on April 10, 2026).

C++ 1,139 295 Updated Jun 18, 2026

3D fluid simulation experiments in Rust, using WebGPU-rs (WIP)

Rust 484 18 Updated Dec 17, 2022
HLSL 518 89 Updated May 21, 2026

A STARK prover and verifier for arbitrary computations

Rust 892 232 Updated Jul 19, 2025

The Flutter engine

C++ 7,575 5,967 Updated Feb 25, 2025

A General-purpose Task-parallel Programming System in C++

C++ 12,017 1,395 Updated Jun 17, 2026

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime

Python 2,662 309 Updated Jun 19, 2026

Single C file, Realtime CPU/GPU Profiler with Remote Web Viewer

C 3,304 284 Updated Aug 28, 2024

Vulkan and rust experiments, including a spectral path tracer using Vulkan ray tracing extensions

Rust 133 5 Updated Sep 13, 2025

Instant neural graphics primitives: lightning fast NeRF and more

Cuda 17,442 2,066 Updated Feb 2, 2026

magic-trace collects and displays high-resolution traces of what a process is doing

OCaml 6,105 194 Updated Jun 17, 2026

3D engine with modern graphics

C 7,098 754 Updated Jun 19, 2026
Next