Enhance Basic Agent Evaluation Runner with improved error handling, logging, and user instructions. Refactor question fetching and submission processes for better clarity and robustness. 5ea91d8 Tingusto commited on May 14