HALO: Hardware-aware quantization with low critical-path-delay weights for LLM acceleration Accepted in AAAI 2026 (Oral) Code will be released soon.