Skip to content
View jepeake's full-sized avatar

Highlights

  • Pro

Block or report jepeake

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
83 stars written in Python
Clear filter

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.

Python 1,080 128 Updated Oct 7, 2024

Dataflow compiler for QNN inference on FPGAs

Python 897 278 Updated Nov 13, 2025

LM Studio Apple MLX engine

Python 817 70 Updated Nov 13, 2025

A Library for Differentiable Logic Gate Networks

Python 743 85 Updated Mar 19, 2024

BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.

Python 708 52 Updated Aug 6, 2025

Model Compression Toolbox for Large Language Models and Diffusion Models

Python 696 67 Updated Aug 14, 2025

KernelBench: Can LLMs Write GPU Kernels? - Benchmark with Torch -> CUDA (+ more DSLs)

Python 661 86 Updated Nov 11, 2025

✨ Elevate your GitHub Profile ReadMe with Minimalistic Retro Terminal GIFs 🚀

Python 659 24 Updated Nov 8, 2024
Python 548 42 Updated Dec 16, 2024

Train high-quality text-to-image diffusion models in a data & compute efficient manner

Python 509 35 Updated Mar 27, 2025

SymbiYosys (sby) -- Front-end for Yosys-based formal verification flows

Python 479 83 Updated Nov 11, 2025

Kernel Tuner

Python 372 59 Updated Nov 12, 2025

[ACL 2025 Main] EfficientQAT: Efficient Quantization-Aware Training for Large Language Models

Python 310 22 Updated May 22, 2025

A Text-Based Environment for Interactive Debugging

Python 276 37 Updated Nov 11, 2025

1.58 Bit LLM on Apple Silicon using MLX

Python 225 28 Updated May 10, 2024

Machine-Learning Accelerator System Exploration Tools

Python 183 85 Updated Nov 1, 2025
Python 148 11 Updated Feb 15, 2025

[ACL 2024] A novel QAT with Self-Distillation framework to enhance ultra low-bit LLMs.

Python 127 17 Updated May 16, 2024

Async pipelined version of Verl

Python 125 13 Updated Apr 8, 2025

Perun is a Python package that measures the energy consumption of your applications.

Python 88 6 Updated Nov 10, 2025
Python 79 23 Updated Mar 10, 2025

MAGE: A Multi-Agent Engine for Automated RTL Code Generation

Python 72 15 Updated Apr 11, 2025

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 1 Updated Jul 23, 2025