AI/ML Engineer and Independent Researcher from Kalamazoo, Michigan.
- Michigan
-
15:16
(UTC -04:00) - in/zwmaronek
- https://substack.com/@zachmania
- zwmaronek
Pinned Loading
-
Beyond-Early-Exit
Beyond-Early-Exit PublicBeyond Early Exit: Solving GPU Warp Divergence in Adaptive LLM Inference with Micro-Batched Routing. Author: Zachary Maronek Date: January 2026
Python
-
particleblood-rust-gpu
particleblood-rust-gpu PublicRust/WebGL2 GPU particle sim variant of particleblood
Rust
-
ple-coded-gguf
ple-coded-gguf PublicPLE-Coded GGUF: exploiting Gemma E4B's per-layer embeddings as a compression side-channel
Python
-
hgsel-moe
hgsel-moe PublicHash-based Gradient-guided Sparse Expert Layer A deterministic, production-grade Sparse Mixture of Experts (MoE) architecture for dense Transformers.
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.