Run benchmark with a CodeAgent that uses gpt 4 turbo and the following tools: PythonInterpreter, GoogleSearch, VisitWebpage and Final_Answer. Achieve 30% 04889fc
Sandra Sanchez commited on