Commit Graph

  • ca783db6c9 docs: update README Philipp Emanuel Weidmann 2025-12-10 16:30:35 +05:30
  • 6acccac994 feat: add progress bars for plotting operations Philipp Emanuel Weidmann 2025-12-10 13:07:34 +05:30
  • ac154a55a0 fix: suppress CoT output for thinking models Philipp Emanuel Weidmann 2025-12-09 11:54:08 +05:30
  • 15781a8a0c fix: skip common response prefix for thinking models Philipp Emanuel Weidmann 2025-12-09 08:25:10 +05:30
  • 24c3aeb442 feat: turn boolean settings into CLI flags Philipp Emanuel Weidmann 2025-12-07 11:37:07 +05:30
  • ffbde3ac2a fix: follow up after recent PRs Philipp Emanuel Weidmann 2025-12-07 10:26:16 +05:30
  • 932d737edf feat: add silhouette coefficient to residual geometry output Philipp Emanuel Weidmann 2025-12-07 08:48:38 +05:30
  • 1f5e977f4f Revert "perf: optimize abliteration matrix op (#46)" (#74) Philipp Emanuel Weidmann 2025-12-07 06:30:37 +05:30
  • da27ba8054 fix: always left-pad inputs, and avoid optimizing for empty responses Philipp Emanuel Weidmann 2025-12-06 06:31:09 +05:30
  • baf5b0b0d1 feat: add geometric median to residual geometry output Philipp Emanuel Weidmann 2025-12-05 20:15:50 +05:30
  • eeb28b28c1 feat: add option to plot residual vectors Philipp Emanuel Weidmann 2025-12-04 14:22:29 +05:30
  • d836fb2da9 ci: add PR title lint (#66) red40maxxer 2025-12-02 22:55:48 -05:00
  • 60bd531fde perf: optimize abliteration matrix op (#46) red40maxxer 2025-12-01 21:43:43 -05:00
  • 1f74ac2888 Guard against refusals in broken English (#45) Spiky Moth 2025-11-26 06:59:08 +01:00
  • 63fc0e7d5a feat: Add bfloat16 to default dtypes list (#44) _Vinayyyy_ 2025-11-25 12:22:52 +05:30
  • 1efc4ee9e1 Featuring Notebook (Colab/Kaggle) Compatibility (#42) _Vinayyyy_ 2025-11-24 19:46:39 +05:30
  • 452b35e7b7 Add trust_remote_code configuration option (#31) Nikolai Kolodziej 2025-11-24 01:57:44 +01:00
  • b79b8b1475 Improve support for loading local datasets (#33) Spiky Moth 2025-11-23 06:45:34 +01:00
  • 83cbf0612a Add option to print refusal geometry Philipp Emanuel Weidmann 2025-11-22 13:18:54 +05:30
  • c35f3031f8 Allow stopping the optimization process early with Ctrl+C Philipp Emanuel Weidmann 2025-11-21 10:11:00 +05:30
  • 2e1bb4b655 Use PYTORCH_ALLOC_CONF instead of deprecated PYTORCH_CUDA_ALLOC_CONF (#32) Nikolai Kolodziej 2025-11-21 02:57:28 +01:00
  • af02bc6ece Fix support for MXFP4 quantized models with Triton tensors (#28) Anthony Eufemio 2025-11-19 22:13:06 -10:00
  • 22a4a5b5b5 Add citation information to README Philipp Emanuel Weidmann 2025-11-19 12:14:17 +05:30
  • 694edf18d3 Follow up after recent PRs Philipp Emanuel Weidmann 2025-11-19 11:19:47 +05:30
  • c9c022a143 Fix linting issues Philipp Emanuel Weidmann 2025-11-19 10:16:58 +05:30
  • 9905d9517f Fix formatting issues Philipp Emanuel Weidmann 2025-11-19 10:04:43 +05:30
  • f06e939791 Add Ruff as a dev dependency Philipp Emanuel Weidmann 2025-11-19 09:59:18 +05:30
  • f3b9826ca4 Add CI workflow Philipp Emanuel Weidmann 2025-11-19 09:45:54 +05:30
  • 13bb7b24d6 Fix KeyError when HuggingFace user profile fields are missing (#20) Richard Young, PhD 2025-11-18 16:02:50 -08:00
  • c8b6663b93 Fix multi-GPU support and memory management (#17) Nikolai Kolodziej 2025-11-19 00:39:12 +01:00
  • 61fdf72b42 Add support for Granite MoE Hybrid in model.py by including down projections for shared MLP and MoE experts (#14) Ooze 2025-11-18 06:02:58 +03:00
  • 7bad84b4f1 perf: clear residuals after computing direction (#15) red40maxxer 2025-11-17 11:48:22 -05:00
  • 09730bad70 MPS support (#5) Matt Barnson 2025-11-17 05:12:01 -08:00
  • b3545e4b1e Fix retrieving package version v1.0.1 Philipp Emanuel Weidmann 2025-11-16 17:35:13 +05:30
  • 3f346b6150 Change package name Philipp Emanuel Weidmann 2025-11-16 17:01:50 +05:30
  • 1a59d226c1 Fix spacing after images in README Philipp Emanuel Weidmann 2025-11-16 16:06:08 +05:30
  • 12ecf50033 Add README Philipp Emanuel Weidmann 2025-11-16 15:19:27 +05:30
  • ea699dce46 Improve appearance of selection menus Philipp Emanuel Weidmann 2025-11-16 11:32:58 +05:30
  • 8a1aceff11 Switch to multi-objective optimization Philipp Emanuel Weidmann 2025-11-14 18:04:23 +05:30
  • 0bae27f359 Fix some of the problems with Falcon-E-3B Philipp Emanuel Weidmann 2025-11-13 11:39:08 +05:30
  • e24080db64 Add metadata to pyproject.toml Philipp Emanuel Weidmann 2025-11-02 10:06:15 +05:30
  • fae39ffb89 Move default configuration to Python Philipp Emanuel Weidmann 2025-11-02 09:29:55 +05:30
  • 850c21b534 Make multivariate TPE work properly Philipp Emanuel Weidmann 2025-11-01 16:57:12 +05:30
  • a24e6eba96 Improve optimization Philipp Emanuel Weidmann 2025-10-31 16:04:28 +05:30
  • a9655c8d31 Perform calculations involving residual vectors in float32 Philipp Emanuel Weidmann 2025-10-31 13:47:24 +05:30
  • 1496e0a04c Dynamically choose between global and per-layer refusal directions Philipp Emanuel Weidmann 2025-10-31 13:04:45 +05:30
  • c638d3d012 Adjust score parameters Philipp Emanuel Weidmann 2025-10-25 13:15:31 +05:30
  • 47e855d5d8 Guard against missing model card data Philipp Emanuel Weidmann 2025-10-25 13:12:18 +05:30
  • e2419de016 Add "abliterated" to model tags Philipp Emanuel Weidmann 2025-10-25 09:59:44 +05:30
  • ad8b04d371 Bump version to 1.0.0 Philipp Emanuel Weidmann 2025-10-25 09:52:43 +05:30
  • 37c5ea06d1 Print elapsed and remaining time Philipp Emanuel Weidmann 2025-10-25 09:47:54 +05:30
  • cf57a0cfbe Add functionality to evaluate any model relative to the main model Philipp Emanuel Weidmann 2025-10-24 13:38:03 +05:30
  • e6aba71186 Improve refusal detection Philipp Emanuel Weidmann 2025-10-24 11:27:28 +05:30
  • f8f3f9a012 Fix chat responses being cut off Philipp Emanuel Weidmann 2025-10-22 12:30:28 +05:30
  • 6359aa44bb Separate abliteration parameters for different layer components Philipp Emanuel Weidmann 2025-10-22 12:05:28 +05:30
  • ed65d6902b Support gpt-oss MoE Philipp Emanuel Weidmann 2025-10-15 17:51:39 +05:30
  • 7ed0cb1ffb Support Phi-3.5-MoE Philipp Emanuel Weidmann 2025-10-14 11:23:53 +05:30
  • 8b827ee386 Support multimodal models Philipp Emanuel Weidmann 2025-10-14 10:32:34 +05:30
  • dd7abd3296 Add hf_transfer to dependencies Philipp Emanuel Weidmann 2025-10-14 07:56:43 +05:30
  • 3d5e645c13 Handle Ctrl+C gracefully Philipp Emanuel Weidmann 2025-10-12 12:53:40 +05:30
  • 74b55977f0 Pretty-print configuration errors Philipp Emanuel Weidmann 2025-10-12 10:39:59 +05:30
  • b4a0c0d3f2 Add program version to generated README intro Philipp Emanuel Weidmann 2025-10-11 17:31:11 +05:30
  • 7caf9fcdc5 Separate training and evaluation prompts Philipp Emanuel Weidmann 2025-10-09 12:51:31 +05:30
  • 2ff8dcba6b Add model card when uploading to Hugging Face Philipp Emanuel Weidmann 2025-09-30 08:43:21 +05:30
  • 5b01ad4344 Add save and upload functionality Philipp Emanuel Weidmann 2025-09-27 11:15:41 +05:30
  • 7573a2eebd Support passing model name without "--model" argument prefix Philipp Emanuel Weidmann 2025-09-25 15:02:22 +05:30
  • fd0fa52552 Add chat functionality Philipp Emanuel Weidmann 2025-09-24 18:09:23 +05:30
  • f00d35dc46 Improve early abort score calculation Philipp Emanuel Weidmann 2025-09-23 19:02:00 +05:30
  • 3f242369e0 Add educated guesses for parameter values to get the optimizer started Philipp Emanuel Weidmann 2025-09-23 16:00:20 +05:30
  • c447805fc2 Improve default dtype configuration Philipp Emanuel Weidmann 2025-09-23 13:31:41 +05:30
  • b6c715ab6f Abort trial early if KL divergence is too high Philipp Emanuel Weidmann 2025-09-23 13:20:31 +05:30
  • 9485edc221 Support Qwen3 MoE Philipp Emanuel Weidmann 2025-09-22 15:22:48 +05:30
  • 1b37160490 Fix model loading issues Philipp Emanuel Weidmann 2025-09-21 16:04:41 +05:30
  • af19fbd254 Initial commit Philipp Emanuel Weidmann 2025-09-21 11:10:30 +05:30