← Back to feed

heretic

GitHub Repo Pretty sure · ethics aren't our call
https://github.com/p-e-w/heretic

Automated jailbreak factory with solid technical chops, but let's not pretend the ethics here are anything but a choice.

15%
25%
60%
Slop 15%Signal 25%Science 60%

Heretic implements actual research (abliteration via directional residual manipulation + Optuna-based hyperparameter search) with reproducible methodology. The code exists, the math is sound, and the ablation approach is non-trivial. BUT: this is fundamentally a tool to strip safety guardrails from LLMs at scale, framed as 'censorship removal.' The framing is doing heavy lifting—'safety alignment' and 'censorship' are not synonyms, though the repo treats them as such. The technical signal is ...

14538 stars Python 2026-03-15 175 days old

Become a MFer to rate — log in