bongole

Hirohisa Mitsuishi bongole

75 followers · 11 following

Tsukuba, Japan
http://twitter.com/bongole

Achievements

Stars

12 stars written in Cuda

Clear filter

karpathy / llm.c

LLM training in simple, raw C/CUDA

Cuda 29,232 3,441 Updated Jun 26, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 9,060 1,128 Updated Feb 9, 2026

baidu-research / warp-ctc

Fast parallel CTC.

Cuda 4,075 1,033 Updated Mar 4, 2024

thu-ml / SageAttention

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

Cuda 3,239 376 Updated Jan 17, 2026

rapidsai / cugraph

cuGraph - RAPIDS Graph Analytics Library

Cuda 2,147 347 Updated Mar 20, 2026

CannyLab / tsne-cuda

GPU Accelerated t-SNE for CUDA with Python bindings

Cuda 1,926 137 Updated Oct 2, 2024

NVIDIA / nvbench

CUDA Kernel Benchmarking Library

Cuda 834 102 Updated Mar 20, 2026

rapidsai / cuspatial

CUDA-accelerated GIS and spatiotemporal algorithms

Cuda 699 164 Updated Jul 28, 2025

tensorflow / recommenders-addons

Additional utils and helpers to extend TensorFlow when build recommendation systems, contributed and maintained by SIG Recommenders.

Cuda 634 145 Updated Sep 4, 2025

bycloudai / instant-ngp-Windows

Forked from NVlabs/instant-ngp

Instant neural graphics primitives: lightning fast NeRF and more

Cuda 502 74 Updated Aug 14, 2022

enp1s0 / ozIMMU

FP64 equivalent GEMM via Int8 Tensor Cores using the Ozaki scheme

Cuda 114 5 Updated Dec 2, 2025

loganwatchorn / notes-pmpp

Notes on "Programming Massively Parallel Processors" by Hwu, Kirk, and Hajj (4th ed.)

Cuda 53 Updated Aug 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hirohisa Mitsuishi bongole

Achievements

Achievements

Block or report bongole

Stars

karpathy / llm.c

deepseek-ai / DeepEP

baidu-research / warp-ctc

thu-ml / SageAttention

rapidsai / cugraph

CannyLab / tsne-cuda

NVIDIA / nvbench

rapidsai / cuspatial

tensorflow / recommenders-addons

bycloudai / instant-ngp-Windows

enp1s0 / ozIMMU

loganwatchorn / notes-pmpp