Stars
One header file library that implement missing transcendental math functions (cos, sin, acos, and more....) using 100% AVX/Neon instructions (no branching)
The platform independent header allowing to compile any C/C++ code containing ARM NEON intrinsic functions for x86 target systems using SIMD up to AVX2 intrinsic functions
A tool for running small microbenchmarks on recent Intel and AMD x86 CPUs.
Tutorial: Writing a "bare metal" operating system for Raspberry Pi 4
Agenium Scale vectorization library for CPUs and GPUs
The book "Performance Analysis and Tuning on Modern CPU"
fengwang / float16_t
Forked from acgessler/half_floatCPP20 implementation of a 16-bit floating-point type mimicking most of the IEEE 754 behavior. Single file and header-only.
Optimized implementations of various library functions for ARM architecture processors
TIGR - the TIny GRaphics library for Windows, macOS, Linux, iOS and Android.
X86 CPU topics overview for developers , oriented towards performance
C++14 concurrent lock-free low-latency queue.
f.k.a. Awesome ChatGPT Prompts. Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.
An introduction to ARM64 assembly on Apple Silicon Macs
Library for collision detection between two convex shapes
canonical libwebsockets.org networking library
Conversion to/from half-precision floating point formats
Compile Time Regular Expression in C++