Stars
SIMD-accelerated distances, dot products, matrix ops, geospatial & geometric kernels for 16 numeric types — from 6-bit floats to 64-bit complex — across x86, Arm, RISC-V, and WASM, with bindings fo…
Zig port of dendibakh/perf-ninja - an online course where you can learn and master the skill of low-level performance analysis and tuning.
Novel implementation of a Trie data structure optimized for small, sparse maps
The PULP Ara is a 64-bit Vector Unit, compatible with the RISC-V Vector Extension Version 1.0, working as a coprocessor to CORE-V's CVA6 core
A fast multi-producer, multi-consumer lock-free concurrent queue for C++11
A comparative, extendable benchmarking suite for C and C++ hash-table libraries.
Decompiler Explorer! Compare tools on the forefront of static analysis, now in your web browser!
Continuous profiling for analysis of CPU and memory usage, down to the line number and throughout time. Saving infrastructure cost, improving performance, and increasing reliability.
MTuner is a C/C++ memory profiler and memory leak finder for Windows and other platforms
A tool for running small microbenchmarks on recent Intel and AMD x86 CPUs.
The book "Performance Analysis and Tuning on Modern CPU"
Open-source Linux performance suite for engineers—profiling and tuning workloads and system configurations.
ChampSim is an open-source trace based simulator maintained at Texas A&M University and through the support of the computer architecture community.
This is an online course where you can learn and master the skill of low-level performance analysis and tuning.
A free and strong UCI chess engine
Low-overhead tracing of all Linux kernel-user transitions, for serious performance analysis. Includes kernel patches, loadable module, and post-processing software. Output is HTML/SVG per-CPU-core …