Visualize LLM evaluation results and compare metrics
Generate conversational responses in multiple languages