Official implementation of the paper: "ZClip: Adaptive Spike Mitigation for LLM Pre-Training".
-
Updated
Nov 20, 2025 - Python
Official implementation of the paper: "ZClip: Adaptive Spike Mitigation for LLM Pre-Training".
Post-mortem debugger for LLM training loss spikes. Records per-layer gradients, activations, and weight distributions — scrub back to the exact step that caused the divergence.
Add a description, image, and links to the loss-spike topic page so that developers can more easily learn about it.
To associate your repository with the loss-spike topic, visit your repo's landing page and select "manage topics."