Skip to content
View Gyu1291's full-sized avatar
🚀
Let's rocket
🚀
Let's rocket
  • KAIST
  • Seoul, Korea
  • 03:12 (UTC +09:00)

Block or report Gyu1291

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

41 stars written in C++
Clear filter

LLM inference in C/C++

C++ 89,453 13,618 Updated Nov 9, 2025

A library for efficient similarity search and clustering of dense vectors.

C++ 37,848 4,102 Updated Nov 8, 2025

The Serenity Operating System 🐞

C++ 32,570 3,271 Updated Nov 9, 2025

MLX: An array framework for Apple silicon

C++ 22,761 1,383 Updated Nov 8, 2025

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

C++ 12,080 1,850 Updated Nov 9, 2025

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 9,448 958 Updated Oct 24, 2025

Matter (formerly Project CHIP) creates more connections between more objects, simplifying development for manufacturers and increasing compatibility for consumers, guided by the Connectivity Standa…

C++ 8,327 2,279 Updated Nov 9, 2025

The BusTub Relational Database Management System (Educational)

C++ 4,691 1,961 Updated Oct 22, 2025

The official distribution of olcPixelGameEngine, a tool used in javidx9's YouTube videos and projects

C++ 4,036 918 Updated Sep 26, 2025

The official repository for the gem5 computer-system architecture simulator.

C++ 2,281 1,574 Updated Nov 7, 2025

Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training

C++ 1,844 245 Updated Nov 4, 2025

GPGPU-Sim provides a detailed simulation model of contemporary NVIDIA GPUs running CUDA and/or OpenCL workloads. It includes support for features such as TensorCores and CUDA Dynamic Parallelism as…

C++ 1,486 583 Updated Feb 15, 2025

3D Procedural Game Engine Using OpenGL

C++ 1,309 104 Updated Nov 9, 2025

Fast Multimodal LLM on Mobile Devices

C++ 1,167 141 Updated Nov 8, 2025

[MLSys'25] QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving; [MLSys'25] LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention

C++ 775 54 Updated Mar 6, 2025

Custom programming interpreter for ZSharp (Z#), a custom game programming language I made

C++ 713 79 Updated Feb 20, 2024

Portable RISC-V System-on-Chip implementation: RTL, debugger and simulators

C++ 675 110 Updated Jul 16, 2025

A Fast and Extensible DRAM Simulator, with built-in support for modeling many different DRAM technologies including DDRx, LPDDRx, GDDRx, WIOx, HBMx, and various academic proposals. Described in the…

C++ 667 215 Updated Aug 29, 2023

An x86 monolithic kernel and operating system written in modern C++. Comes with in-house graphical applications and command line utilities, plus ports of existing software. And yes, it runs DOOM!

C++ 644 29 Updated Sep 9, 2024

The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resources in your applications.

C++ 467 62 Updated Oct 29, 2025

DRAMsim3: a Cycle-accurate, Thermal-Capable DRAM Simulator

C++ 419 175 Updated Aug 3, 2024

Minimal example of animating the HTML5 canvas from C++ using OpenGL through WebAssembly

C++ 366 53 Updated Jun 15, 2020

ONNXim is a fast cycle-level simulator that can model multi-core NPUs for DNN inference

C++ 163 28 Updated Feb 10, 2025

Advanced Matrix Extensions (AMX) Guide

C++ 105 8 Updated Jan 11, 2022
C++ 71 5 Updated Aug 2, 2024

A Reconfigurable Accelerator with Data Reordering Support for Low-Cost On-Chip Dataflow Switching

C++ 70 11 Updated Nov 1, 2025

Repo for SpecEE: Accelerating Large Language Model Inference with Speculative Early Exiting (ISCA25)

C++ 67 8 Updated Apr 25, 2025

mNPUsim: A Cycle-accurate Multi-core NPU Simulator (IISWC 2023)

C++ 65 11 Updated Oct 30, 2025

A simulator for SK hynix AiM PIM architecture based on Ramulator 2.0

C++ 42 8 Updated Jul 22, 2025

A Cycle-level simulator for M2NDP

C++ 32 6 Updated Aug 14, 2025
Next