PingPong: A Natural Benchmark for Multi-Turn Code-Switching Dialogues Paper • 2601.17277 • Published 12 days ago • 5
Language Surgery in Multilingual Large Language Models Paper • 2506.12450 • Published Jun 14, 2025 • 16
Datasheets Aren't Enough: DataRubrics for Automated Quality Metrics and Accountability Paper • 2506.01789 • Published Jun 2, 2025 • 15
Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia Paper • 2503.07920 • Published Mar 10, 2025 • 101
MetaMetrics: Calibrating Metrics For Generation Tasks Using Human Preferences Paper • 2410.02381 • Published Oct 3, 2024 • 1
Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia Paper • 2503.07920 • Published Mar 10, 2025 • 101
Attention-Based LSTM for Psychological Stress Detection from Spoken Language Using Distant Supervision Paper • 1805.12307 • Published May 31, 2018 • 1