AIRiskDilemmas kellycyy/AIRiskDilemmas Viewer • Updated May 21, 2025 • 42.6k • 114 • 3 Will AI Tell Lies to Save Sick Children? Litmus-Testing AI Values Prioritization with AIRiskDilemmas Paper • 2505.14633 • Published May 20, 2025 • 4
Will AI Tell Lies to Save Sick Children? Litmus-Testing AI Values Prioritization with AIRiskDilemmas Paper • 2505.14633 • Published May 20, 2025 • 4
CulturalBench A Robust, Diverse and Challegning Benchmark for Measuring Cultural Knowledge of LLMs CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs Paper • 2410.02677 • Published Oct 3, 2024 • 1 kellycyy/CulturalBench Viewer • Updated Oct 14, 2024 • 6.14k • 146 • 11
CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs Paper • 2410.02677 • Published Oct 3, 2024 • 1
DailyDilemmas kellycyy/daily_dilemmas Viewer • Updated Oct 15, 2024 • 17.7k • 3.46k • 9 DailyDilemmas: Revealing Value Preferences of LLMs with Quandaries of Daily Life Paper • 2410.02683 • Published Oct 3, 2024
DailyDilemmas: Revealing Value Preferences of LLMs with Quandaries of Daily Life Paper • 2410.02683 • Published Oct 3, 2024
AIRiskDilemmas kellycyy/AIRiskDilemmas Viewer • Updated May 21, 2025 • 42.6k • 114 • 3 Will AI Tell Lies to Save Sick Children? Litmus-Testing AI Values Prioritization with AIRiskDilemmas Paper • 2505.14633 • Published May 20, 2025 • 4
Will AI Tell Lies to Save Sick Children? Litmus-Testing AI Values Prioritization with AIRiskDilemmas Paper • 2505.14633 • Published May 20, 2025 • 4
DailyDilemmas kellycyy/daily_dilemmas Viewer • Updated Oct 15, 2024 • 17.7k • 3.46k • 9 DailyDilemmas: Revealing Value Preferences of LLMs with Quandaries of Daily Life Paper • 2410.02683 • Published Oct 3, 2024
DailyDilemmas: Revealing Value Preferences of LLMs with Quandaries of Daily Life Paper • 2410.02683 • Published Oct 3, 2024
CulturalBench A Robust, Diverse and Challegning Benchmark for Measuring Cultural Knowledge of LLMs CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs Paper • 2410.02677 • Published Oct 3, 2024 • 1 kellycyy/CulturalBench Viewer • Updated Oct 14, 2024 • 6.14k • 146 • 11
CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs Paper • 2410.02677 • Published Oct 3, 2024 • 1