Gyu1291

Follow

🚀

Let's rocket

0xMint Gyu1291

🚀

Let's rocket

Follow

Hobby programmer

17 followers · 40 following

KAIST
Seoul, Korea
20:09 (UTC +09:00)

Achievements

Achievements

Starred repositories

109 stars written in Python

VIA-Research / vTrain

Python 73 15 Updated May 27, 2025

sail-sg / LongSpec

LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification

Python 67 3 Updated Jul 14, 2025

ruikangliu / Quantized-Reasoning-Models

[COLM 2025] Official PyTorch implementation of "Quantization Hurts Reasoning? An Empirical Study on Quantized Reasoning Models"

Python 57 3 Updated Jul 8, 2025

7shoe / AdaParse

Forked from ramanathanlab/pdfwf

Adaptive Parallel PDF Parsing and Resource Scaling Engine

Python 56 14 Updated Oct 23, 2025

abdelfattah-lab / BitMoD-HPCA-25

Python 51 10 Updated Jul 19, 2025

bespoke-silicon-group / bsg_bladerunner

Meta-Repository for Bespoke Silicon Group's Manycore Architecture (A.K.A HammerBlade)

Python 43 20 Updated Jun 16, 2025

LumenPallidium / jepa

Experiments in Joint Embedding Predictive Architectures (JEPAs).

Python 43 10 Updated Jan 5, 2024

Dapid / tmscoring

Python implementation of the TMscore program

Python 43 13 Updated Apr 29, 2024

PSAL-POSTECH / PyTorchSim

PyTorchSim is a Comprehensive, Fast, and Accurate NPU Simulation Framework

Python 42 3 Updated Nov 6, 2025

1202kbs / DMCMC

Official PyTorch implementation of "Denoising MCMC for Accelerating Diffusion-Based Generative Models", ICML 2023 Oral Paper

Python 31 4 Updated Sep 14, 2023

justADeni / intel-npu-llm

A simple Python script for running LLMs on Intel's Neural Processing Units (NPUs)

Python 26 1 Updated Oct 17, 2025

tenstorrent / vllm

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 25 12 Updated Nov 7, 2025

CLab-HKUST-GZ / micro58-axcore

Python 20 4 Updated Oct 21, 2025

GusLovesMath / Local_LLM_Training_Apple_Silicon

Created and enhanced a local LLM training system on Apple Silicon with MLX and Metal API, overcoming the absence of CUDA support. Fine-tuned the Llama3 model on 16 GPUs for streamlined solution of …

Python 20 5 Updated May 29, 2024

casys-kaist / DaCapo

Python 19 Updated Nov 5, 2024

LLMkvsys / rethink-kv-compression

Python 19 1 Updated Mar 7, 2025

krafton-ai / lexico

KV cache compression via sparse coding

Python 14 2 Updated Oct 26, 2025

georgia-tech-synergy-lab / MicroScopiQ-LLM-Quantization

[ISCA 2025] Official Implementation of "MicroScopiQ: Accelerating Foundational Models through Outlier-Aware Microscaling Quantization"

Python 12 1 Updated Oct 30, 2025

yc2367 / BBS-MICRO

Python 12 2 Updated Nov 11, 2024

Starred topics

LaTeX

Google

Go

Docker

Pixel Art

Linux

IPFS

Electron

C#

C++

See all starred topics