LLM agent safety, multi-turn red-teaming, jailbreak benchmarks, adversarial robu
- Large language model (LLM) agents are increasingly proposed as supervisory components for safety-critical systems, yet t
- We present NRT-Bench, a benchmark for multi-turn red-teaming of LLM agents acting as op