Skip to content
View leofang's full-sized avatar

Highlights

  • Pro

Organizations

@NVIDIA @mpi4py @conda-forge @cupy @rapidsai

Block or report leofang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

cuTile is a programming model for writing parallel kernels for NVIDIA GPUs

Python 1,603 80 Updated Dec 17, 2025

Nsight Python is a Python kernel profiling interface based on NVIDIA Nsight Tools

Python 74 6 Updated Dec 16, 2025

A conda plugin which creates NVIDIA-specific virtual packages

Python 7 Updated Nov 12, 2025

NVIDIA NVSHMEM is a parallel programming interface for NVIDIA GPUs based on OpenSHMEM. NVSHMEM can significantly reduce multi-process communication and coordination overheads by allowing programmer…

C++ 415 47 Updated Nov 13, 2025

Schema validation just got Pythonic

Python 2,939 215 Updated Oct 26, 2025

Vector classes and utilities

Python 94 35 Updated Dec 15, 2025

Linter that finds portability issues in Python package distributions (wheels, sdists, conda packages).

Python 44 4 Updated Dec 8, 2025

NumPy & SciPy for GPU

Python 10,672 979 Updated Dec 17, 2025

Manipulating ragged arrays in an Array API compliant way.

Python 45 8 Updated Dec 14, 2025

A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).

C++ 567 73 Updated Sep 15, 2025

Reusable GitHub Actions workflows for RAPIDS CI

Shell 7 25 Updated Dec 15, 2025

Let your Claude able to think

TypeScript 16,611 1,963 Updated Nov 4, 2025

Experimental projects related to TensorRT

MLIR 117 22 Updated Dec 17, 2025

Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks

C++ 22,960 1,191 Updated Dec 17, 2025

Dr.Jit — A Just-In-Time-Compiler for Differentiable Rendering

C++ 730 55 Updated Dec 17, 2025

Python library for generating high-performance implementations of stencil kernels for weather and climate modeling from a domain-specific language (DSL).

Python 136 54 Updated Dec 17, 2025

The CUDA target for Numba

Python 229 50 Updated Dec 17, 2025

A Python module for decorators, wrappers and monkey patching.

Python 2,245 244 Updated Nov 7, 2025

Download Taiwan financial market data via FMD API.

Python 29 Updated Mar 6, 2025

DaCe - Data Centric Parallel Programming

Python 568 148 Updated Dec 17, 2025

A retargetable MLIR-based machine learning compiler and runtime toolkit.

C++ 3,509 810 Updated Dec 17, 2025

Python disk-backed cache (Django-compatible). Faster than Redis and Memcached. Pure-Python.

Python 2,744 155 Updated Aug 10, 2024

NVIDIA curated collection of educational resources related to general purpose GPU programming.

Jupyter Notebook 993 178 Updated Dec 12, 2025

NVIDIA Math Libraries for the Python Ecosystem

Cython 541 31 Updated Nov 17, 2025

芫荽,基於 Klee One 改造的學習用台灣繁體字型

Python 1,915 69 Updated May 30, 2025

GPU Development in Python 101 tutorial

Jupyter Notebook 278 68 Updated Oct 15, 2024

A library for detecting, labeling, and reasoning about microarchitectures

Python 123 32 Updated Dec 10, 2025

JupyterLite demo deployed to GitHub Pages 🚀

Jupyter Notebook 413 240 Updated Dec 16, 2025

A massively parallel, high-level programming language

Rust 19,115 468 Updated Jun 3, 2025

A cross-version Python bytecode decompiler

Python 4,177 448 Updated Nov 28, 2025
Next