Friday, October 24, 2025
HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal