Evaluate model responses for accuracy and factual errors
my space
Explore and evaluate RAG models with questions and metrics
Gradio MH4