S-Eval: Automatic and Adaptive Test Generation for Benchmarking Safety Evaluation of Large Language Models Paper β’ 2405.14191 β’ Published May 23, 2024 β’ 1