Thursday, January 8, 2026
HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal