MoReBench: Evaluating Procedural and Pluralistic Moral Reasoning in Language Models, More than Outcomes Paper • 2510.16380 • Published Oct 18, 2025 • 1 • 2
Will AI Tell Lies to Save Sick Children? Litmus-Testing AI Values Prioritization with AIRiskDilemmas Paper • 2505.14633 • Published May 20, 2025 • 4 • 2