Sleeping Agents Autojudge ๐ข Evaluate team answers using scoring rules and generate CSV and JSON results
No application file Agents Generated Judge ๐ A plarform for the evaluation of LLM generated answers