Sleeping
Mining Change Detection
🚀
Streamlit template space
Streamlit template space
Evaluate AI agent responses with detailed metrics
Evaluate AI agent responses and generate detailed reports
Evaluating AI agents
Evaluate AI agent responses with detailed scoring and visualizations
Agentic Large-scale Evaluation & Analysis Framework (A-LEAF)
Evaluate agent responses with CSV/JSON data
Evaluate datasets with metrics and visualizations
Agent evaluation framework
Agentic evaluation framework