Lists (1)
Sort Name ascending (A-Z)
Starred repositories
Up to 100x faster strings for C, C++, CUDA, Python, Rust, Swift, JS, & Go, leveraging NEON, AVX2, AVX-512, SVE, GPGPU, & SWAR to accelerate search, hashing, sorting, edit distances, sketches, and m…
MPI programming lessons in C and executable code examples
FZF sorter for telescope written in c
A simple yet fast user space network driver for Intel 10 Gbit/s NICs written from scratch
NJU EMUlator, a full system x86/mips32/riscv32/riscv64 emulator for teaching
[NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep Learning; [NeurIPS 2022] MCUNetV3: On-Device Training Under 2…
Bao, a Lightweight Static Partitioning Hypervisor
Pluto: An automatic polyhedral parallelizer and locality optimizer
Tiny Dream - An embedded, Header Only, Stable Diffusion C++ implementation
A prototype implementation of Bao for PostgreSQL
Polyhedral Parallel Code Generation (source repository: http://repo.or.cz/ppcg.git)
A event-driven network library based on reactor pattern written in C.