vipmath

vipmath

6 followers · 0 following

Lists (1)

Sort

janggi-variants

variant chess-janggi developing

Stars

11 stars written in Cuda

Clear filter

karpathy / llm.c

LLM training in simple, raw C/CUDA

Cuda 28,807 3,376 Updated Jun 26, 2025

HigherOrderCO / HVM2

A massively parallel, optimal functional runtime in Rust

Cuda 11,206 434 Updated Nov 21, 2024

thu-ml / SageAttention

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

Cuda 3,146 338 Updated Jan 17, 2026

XuezheMax / megalodon

Reference implementation of Megalodon 7B model

Cuda 528 53 Updated May 17, 2025

mit-han-lab / Quest

[ICML 2024] Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference

Cuda 372 40 Updated Jul 10, 2025

usyd-fsalab / fp6_llm

An efficient GPU support for LLM inference with x-bit quantization (e.g. FP6,FP5).

Cuda 276 23 Updated Jul 16, 2025

lucidrains / flash-cosine-sim-attention

Implementation of fused cosine similarity attention in the same style as Flash Attention

Cuda 220 12 Updated Feb 13, 2023

Computational-Machine-Intelligence / LeetDecoding

Cuda 40 Updated Nov 27, 2025

alanbacellar / DWN

Differentiable Weightless Neural Networks

Cuda 33 9 Updated Feb 2, 2026

gmongaras / Cottention_Transformer

Code for the paper "Cottention: Linear Transformers With Cosine Attention"

Cuda 20 Updated Nov 15, 2025

Da1sypetals / cuda-Wavelet-KAN

CUDA implementation of Wavelet KAN.

Cuda 16 2 Updated Jun 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly