heretic

Author	SHA1	Message	Date
Spiky Moth	9d1734855d	feat: avoid excessive low divergence iteration (#73 ) * feat: adjust scoring to avoid useless iteration Adjusts the scoring function to avoid targeting meaninglessly low KL divergences. Below a threshold value, the KL divergence score switches to the refusal count. Adds config option kl_divergence_target (defaulting to 0.01). * fix: Clean up parameter selection in objective Create variables for num_layers and last_layer_index * Improves readability and makes choices explicit * feat: Print the parameters of the selected model	2025-12-14 14:26:48 +05:30
George	740aab61ba	feat: add max_memory parameter to limit memory usage (#83 ) * add max_memory parameter to limit memory usage * Added to reload_model also * forgot to add self * Process max_memory once in __init__ and store it as an instance variable, then reuse it in both locations	2025-12-11 20:57:40 +05:30
Philipp Emanuel Weidmann	ffbde3ac2a	fix: follow up after recent PRs	2025-12-07 10:26:16 +05:30
Philipp Emanuel Weidmann	eeb28b28c1	feat: add option to plot residual vectors	2025-12-04 14:22:29 +05:30
Spiky Moth	1f74ac2888	Guard against refusals in broken English (#45 ) * Guard against refusals in broken English * Normalize whitespace between words	2025-11-26 11:29:08 +05:30
Philipp Emanuel Weidmann	83cbf0612a	Add option to print refusal geometry	2025-11-22 13:18:54 +05:30
Philipp Emanuel Weidmann	8a1aceff11	Switch to multi-objective optimization	2025-11-14 18:04:23 +05:30
Philipp Emanuel Weidmann	fae39ffb89	Move default configuration to Python	2025-11-02 09:29:55 +05:30
Philipp Emanuel Weidmann	a24e6eba96	Improve optimization	2025-10-31 16:04:28 +05:30
Philipp Emanuel Weidmann	c638d3d012	Adjust score parameters	2025-10-25 13:15:31 +05:30
Philipp Emanuel Weidmann	e6aba71186	Improve refusal detection	2025-10-24 11:27:28 +05:30
Philipp Emanuel Weidmann	7caf9fcdc5	Separate training and evaluation prompts	2025-10-09 12:51:31 +05:30
Philipp Emanuel Weidmann	c447805fc2	Improve default dtype configuration	2025-09-23 13:31:41 +05:30
Philipp Emanuel Weidmann	1b37160490	Fix model loading issues	2025-09-21 16:04:41 +05:30
Philipp Emanuel Weidmann	af19fbd254	Initial commit	2025-09-21 11:10:30 +05:30

15 Commits