Tracks perf of LLMs, VLMs and agents on web navigation tasks
Explore agent trajectories and judgments in web benchmarks
Convert Hebrew text to speech with pronunciation control