The BERDS Benchmark aims to measure retrieval diversity for questions that are opinionated or invite diverse perspectives.
Hung-Ting Chen
timchen0618
·
AI & ML interests
NLP
Recent Activity
updated a dataset 1 day ago
timchen0618/browsecomp-plus-sel-tools-test300-random-seed7-v1 published a dataset 1 day ago
timchen0618/browsecomp-plus-sel-tools-test300-random-seed7-v1 updated a dataset 1 day ago
timchen0618/browsecomp-plus-sel-tools-test300-random-seed6-v1