Benchmarking Cognitive Biases in Large Language Models as Evaluators Paper • 2309.17012 • Published Sep 29, 2023 • 3
Benchmarking Cognitive Biases in Large Language Models as Evaluators Paper • 2309.17012 • Published Sep 29, 2023 • 3
Under the Surface: Tracking the Artifactuality of LLM-Generated Data Paper • 2401.14698 • Published Jan 26, 2024
Read, Revise, Repeat: A System Demonstration for Human-in-the-loop Iterative Text Revision Paper • 2204.03685 • Published Apr 7, 2022