Releases: p-e-w/heretic
Releases · p-e-w/heretic
v1.3.0
Changes
- @Vinay-Umrethe (who had previously contributed under the username @Vinayyyy7) implemented reproducible runs in #191. @p-e-w revised and improved that implementation in #303.
- @magiccodingman reduced peak VRAM usage in #239. @olekssy fixed a bug in that implementation in #301.
- @farolone added support for Qwen3.5 models in #187
- @MoonRide303 added support for Gemma 4 models in #287
- @erm14254 made sure all abliterable components across layers are displayed in #215
- @cpagac fixed VRAM usage reporting for multi-GPU setups in #169
- @cpagac fixed a division-by-zero error in the evaluator in #225
- @spikymoth improved automatic response prefix determination with a two-step process in #194
- @spikymoth added model card generation for local models with an existing README in #157
- @Diplo2by improved startup speed when Heretic is run with
-h/--helpin #293 - @AWuhrmann fixed the example value for the
max_memorysetting in #284 - @p-e-w added an integrated benchmarking system, made the response prefix logic configurable, implemented multiple infrastructure improvements, and fixed various minor issues
New Contributors
- @cpagac made their first contribution in #169
- @farolone made their first contribution in #187
- @erm14254 made their first contribution in #215
- @AWuhrmann made their first contribution in #284
- @MoonRide303 made their first contribution in #287
- @Diplo2by made their first contribution in #293
- @magiccodingman made their first contribution in #239
- @olekssy made their first contribution in #301
Full Changelog: v1.2.0...v1.3.0
v1.2.0
Changes
- @noctrex added a
max_memorysetting to limit memory usage in #83 - @spikymoth added a mechanism to avoid excessive low-divergence iteration in #73
- @accemlcc implemented a new LoRA-based abliteration engine with support for 4-bit quantization in #60
- @accemlcc added enumeration of all available GPUs on startup in #86
- @Vinayyyy7 added the ability to run more trials after optimization is complete in #76
- @anrp fixed MXFP4 loading in #107
- @anrp refactored the save machinery in #110
- @anrp added broad support for VL models in #108
- @anrp implemented saving and resuming optimization progress in #106, #119, and #116
- @spikymoth implemented Magnitude-Preserving Orthogonal Ablation in #52
- @salmanmkc upgraded GitHub Actions to the latest versions in #136 and #137
- @p-e-w added full type checking of the codebase, debug output, prompt modification functionality, and an example config file for slop reduction, and fixed various minor issues
New Contributors
- @noctrex made their first contribution in #83
- @accemlcc made their first contribution in #60
- @anrp made their first contribution in #107
- @salmanmkc made their first contribution in #136
Full Changelog: v1.1.0...v1.2.0
v1.1.0
Changes
- @mbarnson added basic MPS (Apple Silicon) support in #5
- @red40maxxer reduced memory usage in #15
- @Ooooze added IBM Granite MoE support in #14
- @kldzj added multi-GPU support in #17 and #32
- @ricyoung fixed an error when Hugging Face user profile fields are missing in #20
- @tymat added support for MXFP4 quantized models with Triton tensors in #28
- @spikymoth improved support for loading local datasets in #33
- @kldzj added support for models that require
trust_remote_codein #31 - @Vinayyyy7 added notebook (Colab/Kaggle) compatibility in #42
- @Vinayyyy7 fixed loading for certain models that default to the float32 dtype in #44
- @spikymoth improved refusal detection in #45
- @red40maxxer added a PR title lint to CI in #66
- @p-e-w added research features, support for stopping the optimization process early, and support for thinking models, and implemented an important padding fix suggested by @accemlcc
New Contributors
- @mbarnson made their first contribution in #5
- @red40maxxer made their first contribution in #15
- @Ooooze made their first contribution in #14
- @kldzj made their first contribution in #17
- @ricyoung made their first contribution in #20
- @tymat made their first contribution in #28
- @spikymoth made their first contribution in #33
- @Vinayyyy7 made their first contribution in #42
Full Changelog: v1.0.1...v1.1.0