Skip to content
View xinydev's full-sized avatar
  • 22:15 (UTC +08:00)

Block or report xinydev

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 48,966 8,196 Updated Dec 9, 2024

Performance-portable, length-agnostic SIMD with runtime dispatch

C++ 5,113 387 Updated Nov 5, 2025

Public domain cross platform lock free thread caching 16-byte aligned memory allocator implemented in C

Python 2,359 203 Updated Oct 26, 2025

Meta fork of the OG Jemalloc project

C 193 15 Updated Nov 2, 2025

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 8,728 1,514 Updated Nov 5, 2025

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 72,142 8,557 Updated Nov 4, 2025

Implementations of SIMD instruction sets for systems which don't natively support them.

C 2,838 289 Updated Oct 24, 2025

An open-source C++ library developed and used at Facebook.

C++ 30,019 5,797 Updated Nov 5, 2025
C++ 4,971 530 Updated Nov 4, 2025

A fast & densely stored hashmap and hashset based on robin-hood backward shift deletion

C++ 1,188 91 Updated Nov 2, 2025

Main gperftools repository

C++ 8,856 1,535 Updated Oct 10, 2025

mimalloc is a compact general purpose allocator with excellent performance.

C 12,114 1,011 Updated Oct 7, 2025

Example RISC-V Out-of-Order/Superscalar Processor Performance Core and MSS Model

C++ 190 74 Updated Oct 25, 2025

A fast and scalable x86-64 multicore simulator

C++ 378 194 Updated Nov 27, 2023

A tool for running small microbenchmarks on recent Intel and AMD x86 CPUs.

Python 484 62 Updated Jun 8, 2025

A small C compiler

C 10,654 976 Updated Oct 30, 2023
C 183 70 Updated Nov 4, 2025
C 146 13 Updated May 21, 2025

Capstone disassembly/disassembler framework for ARM, ARM64 (ARMv8), Alpha, BPF, Ethereum VM, HPPA, LoongArch, M68K, M680X, Mips, MOS65XX, PPC, RISC-V(rv32G/rv64G), SH, Sparc, SystemZ, TMS320C64X, T…

C 8,349 1,625 Updated Oct 31, 2025

A translator from Intel SSE intrinsics to Arm/Aarch64 NEON implementation

C++ 1,433 229 Updated Sep 7, 2025

The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.

C++ 3,069 808 Updated Nov 4, 2025

Performance monitoring and benchmarking suite

C 1,843 251 Updated Nov 3, 2025

uops.info Code Analyzer

Python 298 23 Updated Jan 14, 2024

Open Source Architecture Code Analyzer

Jupyter Notebook 336 25 Updated Oct 2, 2025

VVenC, the Fraunhofer Versatile Video Encoder

C++ 1,104 200 Updated Nov 3, 2025

Suite for benchmarking malloc implementations.

C 457 63 Updated Oct 23, 2025

VVdeC, the Fraunhofer Versatile Video Decoder

C++ 522 111 Updated Nov 4, 2025
469 35 Updated Nov 3, 2023

Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks

C++ 22,671 1,170 Updated Nov 4, 2025

ARMv8 performance monitor from userspace

C 83 28 Updated Jun 8, 2023
Next