Skip to content

Releases: p-e-w/heretic

v1.3.0

05 May 14:36
9b7624d

Choose a tag to compare

Changes

  • @Vinay-Umrethe (who had previously contributed under the username @Vinayyyy7) implemented reproducible runs in #191. @p-e-w revised and improved that implementation in #303.
  • @magiccodingman reduced peak VRAM usage in #239. @olekssy fixed a bug in that implementation in #301.
  • @farolone added support for Qwen3.5 models in #187
  • @MoonRide303 added support for Gemma 4 models in #287
  • @erm14254 made sure all abliterable components across layers are displayed in #215
  • @cpagac fixed VRAM usage reporting for multi-GPU setups in #169
  • @cpagac fixed a division-by-zero error in the evaluator in #225
  • @spikymoth improved automatic response prefix determination with a two-step process in #194
  • @spikymoth added model card generation for local models with an existing README in #157
  • @Diplo2by improved startup speed when Heretic is run with -h/--help in #293
  • @AWuhrmann fixed the example value for the max_memory setting in #284
  • @p-e-w added an integrated benchmarking system, made the response prefix logic configurable, implemented multiple infrastructure improvements, and fixed various minor issues

New Contributors

Full Changelog: v1.2.0...v1.3.0

v1.2.0

14 Feb 13:45
27097bf

Choose a tag to compare

Changes

  • @noctrex added a max_memory setting to limit memory usage in #83
  • @spikymoth added a mechanism to avoid excessive low-divergence iteration in #73
  • @accemlcc implemented a new LoRA-based abliteration engine with support for 4-bit quantization in #60
  • @accemlcc added enumeration of all available GPUs on startup in #86
  • @Vinayyyy7 added the ability to run more trials after optimization is complete in #76
  • @anrp fixed MXFP4 loading in #107
  • @anrp refactored the save machinery in #110
  • @anrp added broad support for VL models in #108
  • @anrp implemented saving and resuming optimization progress in #106, #119, and #116
  • @spikymoth implemented Magnitude-Preserving Orthogonal Ablation in #52
  • @salmanmkc upgraded GitHub Actions to the latest versions in #136 and #137
  • @p-e-w added full type checking of the codebase, debug output, prompt modification functionality, and an example config file for slop reduction, and fixed various minor issues

New Contributors

Full Changelog: v1.1.0...v1.2.0

v1.1.0

10 Dec 12:24
d9f2b04

Choose a tag to compare

Changes

  • @mbarnson added basic MPS (Apple Silicon) support in #5
  • @red40maxxer reduced memory usage in #15
  • @Ooooze added IBM Granite MoE support in #14
  • @kldzj added multi-GPU support in #17 and #32
  • @ricyoung fixed an error when Hugging Face user profile fields are missing in #20
  • @tymat added support for MXFP4 quantized models with Triton tensors in #28
  • @spikymoth improved support for loading local datasets in #33
  • @kldzj added support for models that require trust_remote_code in #31
  • @Vinayyyy7 added notebook (Colab/Kaggle) compatibility in #42
  • @Vinayyyy7 fixed loading for certain models that default to the float32 dtype in #44
  • @spikymoth improved refusal detection in #45
  • @red40maxxer added a PR title lint to CI in #66
  • @p-e-w added research features, support for stopping the optimization process early, and support for thinking models, and implemented an important padding fix suggested by @accemlcc

New Contributors

Full Changelog: v1.0.1...v1.1.0

v1.0.1

16 Nov 13:00
b3545e4

Choose a tag to compare

First public release