Run agent evaluation and submit answers automatically
Generate a superhero party theme and ideas
Generate code and answers using an AI assistant