vipmath

vipmath

5 followers · 0 following

Lists (1)

Sort

janggi-variants

variant chess-janggi developing

Stars

17 stars written in C

Clear filter

mozilla-ai / llamafile

Distribute and run LLMs with a single file.

C 23,530 1,250 Updated Dec 19, 2025

karpathy / llama2.c

Inference Llama 2 in one file of pure C

C 19,035 2,430 Updated Aug 6, 2024

trholding / llama2.c

Forked from karpathy/llama2.c

Llama 2 Everywhere (L2E)

C 1,522 45 Updated Aug 27, 2025

microsoft / Tutel

Tutel MoE: Optimized Mixture-of-Experts Library, Support GptOss/DeepSeek/Kimi-K2/Qwen3 using FP8/NVFP4/MXFP4

C 950 107 Updated Dec 15, 2025

libxsmm / libxsmm

Library for specialized dense and sparse matrix operations, and deep learning primitives.

C 927 199 Updated Dec 17, 2025

NVlabs / nvdiffrecmc

Official code for the NeurIPS 2022 paper "Shape, Light, and Material Decomposition from Images using Monte Carlo Rendering and Denoising".

C 402 33 Updated Nov 20, 2023

kimwalisch / libpopcnt

🚀 Fast C/C++ bit population count library

C 356 41 Updated Jun 29, 2024

cpldcpu / BitNetMCU

Neural Networks with low bit weights on low end 32 bit microcontrollers such as the CH32V003 RISC-V Microcontroller and others

C 307 34 Updated Nov 9, 2025

Dao-AILab / fast-hadamard-transform

Fast Hadamard transform in CUDA, with a PyTorch interface

C 267 49 Updated Oct 19, 2025

RahulSChand / llama2.c-for-dummies

Step by step explanation/tutorial of llama2.c

C 224 20 Updated Oct 9, 2023

VIA-Research / uPIMulator

C 161 28 Updated Feb 1, 2025

yuzhenmao / IceFormer

Implementation of IceFormer: Accelerated Inference with Long-Sequence Transformers on CPUs (ICLR 2024).

C 25 3 Updated Jul 15, 2025

bigattichouse / bitvector_research

Tiny example project to test bitwise vector cosine similarity

C 10 1 Updated Oct 8, 2025

UBC-ECE-Sasha / PIM-Embedding-Lookup

C 8 6 Updated Jan 10, 2024

cgoxopx / llama2.gl

Forked from karpathy/llama2.c

Inference Llama 2 in OpenGL Compute Shader

C 6 Updated Aug 31, 2023

ylu149 / FFT-Optimization-on-GPU-and-CPU

The goal of this project is to utilize hardware optimizations via the CPU and GPU to speed up FFT performance.

C 2 Updated Feb 28, 2024

saeedeh-jahanshahi / ulog

C 1 Updated Nov 13, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly