feat: add episode trace, refresh training dataset, and update eval metrics a422c8d Mohammed-Altaf commited on 27 days ago