Skip to content
View jepeake's full-sized avatar

Highlights

  • Pro

Block or report jepeake

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
29 results for source starred repositories written in C++
Clear filter

LLM inference in C/C++

C++ 89,569 13,639 Updated Nov 11, 2025

Port of OpenAI's Whisper model in C/C++

C++ 44,418 4,919 Updated Nov 9, 2025

A monitor of resources

C++ 28,229 839 Updated Nov 11, 2025

MLX: An array framework for Apple silicon

C++ 22,778 1,382 Updated Nov 10, 2025

A modern, C++-native, test framework for unit-tests, TDD and BDD - using C++14, C++17 and later (C++11 support is in v2.x branch, and C++03 on the Catch1.x branch)

C++ 19,947 3,165 Updated Nov 9, 2025

Tensor library for machine learning

C++ 13,531 1,386 Updated Nov 9, 2025

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

C++ 12,101 1,856 Updated Nov 11, 2025

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 8,758 1,522 Updated Nov 10, 2025

lightweight, standalone C++ inference engine for Google's Gemma models.

C++ 6,610 573 Updated Nov 7, 2025

Performance-portable, length-agnostic SIMD with runtime dispatch

C++ 5,129 388 Updated Nov 5, 2025

Yosys Open SYnthesis Suite

C++ 4,119 1,001 Updated Nov 11, 2025

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 3,892 306 Updated Nov 11, 2025

The official repository for the gem5 computer-system architecture simulator.

C++ 2,286 1,575 Updated Nov 10, 2025

CUDA Library Samples

C++ 2,176 425 Updated Nov 6, 2025

Widelands is a free, open source real-time strategy game with singleplayer campaigns and a multiplayer mode. The game was inspired by Settlers II™ (© Bluebyte) but has significantly more variety an…

C++ 2,124 169 Updated Nov 10, 2025

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

C++ 1,942 149 Updated Nov 11, 2025

Patterns and behaviors for GPU computing

C++ 1,745 283 Updated Jun 26, 2022

nextpnr portable FPGA place and route tool

C++ 1,547 277 Updated Nov 10, 2025

TinyChatEngine: On-Device LLM Inference Library

C++ 922 95 Updated Jul 4, 2024

Deep learning toolkit-enabled VLSI placement

C++ 888 245 Updated Sep 27, 2025

Low-bit LLM inference on CPU/NPU with lookup table

C++ 886 74 Updated Jun 5, 2025

SystemVerilog compiler and language services

C++ 875 182 Updated Nov 11, 2025

[MLSys'25] QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving; [MLSys'25] LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention

C++ 776 54 Updated Mar 6, 2025

NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs

C++ 610 76 Updated Oct 14, 2025

OpenSTA engine

C++ 521 216 Updated Nov 7, 2025

SystemVerilog 2017 Pre-processor, Parser, Elaborator, UHDM Compiler. Provides IEEE Design/TB C/C++ VPI and Python AST & UHDM APIs. Compiles on Linux gcc, Windows msys2-gcc & msvc, OsX

C++ 425 77 Updated Sep 6, 2025

Universal Hardware Data Model. A complete modeling of the IEEE SystemVerilog Object Model with VPI Interface, Elaborator, Serialization, Visitor and Listener. Used as a compiled interchange format …

C++ 237 43 Updated Sep 6, 2025

Xplace 3.0: An Extremely Fast, Extensible and Deterministic Placement Framework with Detailed-Routability and Timing Optimization

C++ 143 16 Updated Jun 19, 2025

An FPGA accelerator for general-purpose Sparse-Matrix Dense-Matrix Multiplication (SpMM).

C++ 91 17 Updated Jul 26, 2024