language: en license: mit tags: - evaluation - llm - benchmarks
A model focused on evaluating the quality and correctness of LLM outputs.