Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

Kelly Chiu's picture

Kelly Chiu

kellycyy

21world's profile picture

shuyuej's profile picture

theblackcat102's profile picture

·

https://kellycyy.github.io/

kellychiuyy

AI & ML interests

None yet

Organizations

kellycyy 's collections 3

kellycyy/AIRiskDilemmas

Viewer • Updated May 21, 2025 • 42.6k • 231 • 3
Will AI Tell Lies to Save Sick Children? Litmus-Testing AI Values Prioritization with AIRiskDilemmas

Paper • 2505.14633 • Published May 20, 2025 • 4

A Robust, Diverse and Challegning Benchmark for Measuring Cultural Knowledge of LLMs

CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs

Paper • 2410.02677 • Published Oct 3, 2024 • 1
kellycyy/CulturalBench

Viewer • Updated Oct 14, 2024 • 6.14k • 1.19k • 15

kellycyy/daily_dilemmas

Viewer • Updated Oct 15, 2024 • 17.7k • 175 • 10
DailyDilemmas: Revealing Value Preferences of LLMs with Quandaries of Daily Life

Paper • 2410.02683 • Published Oct 3, 2024

kellycyy/AIRiskDilemmas

Viewer • Updated May 21, 2025 • 42.6k • 231 • 3
Will AI Tell Lies to Save Sick Children? Litmus-Testing AI Values Prioritization with AIRiskDilemmas

Paper • 2505.14633 • Published May 20, 2025 • 4

kellycyy/daily_dilemmas

Viewer • Updated Oct 15, 2024 • 17.7k • 175 • 10
DailyDilemmas: Revealing Value Preferences of LLMs with Quandaries of Daily Life

Paper • 2410.02683 • Published Oct 3, 2024

A Robust, Diverse and Challegning Benchmark for Measuring Cultural Knowledge of LLMs

CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs

Paper • 2410.02677 • Published Oct 3, 2024 • 1
kellycyy/CulturalBench

Viewer • Updated Oct 14, 2024 • 6.14k • 1.19k • 15

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs