mats

non-profit
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

mats-10-sprint-cs-jb 's collections 6

five-CoT-faith
Datasets from the five-CoT-faith benchmark for evaluating chain-of-thought faithfulness monitoring across five diverse tasks.