Update app.py
Browse files
app.py
CHANGED
|
@@ -173,14 +173,16 @@ with gr.Blocks() as demo:
|
|
| 173 |
gr.Markdown("# Basic Agent Evaluation Runner")
|
| 174 |
gr.Markdown(
|
| 175 |
"""
|
| 176 |
-
|
| 177 |
-
|
| 178 |
-
|
| 179 |
-
|
|
|
|
|
|
|
| 180 |
---
|
| 181 |
-
|
| 182 |
-
|
| 183 |
-
|
| 184 |
"""
|
| 185 |
)
|
| 186 |
|
|
|
|
| 173 |
gr.Markdown("# Basic Agent Evaluation Runner")
|
| 174 |
gr.Markdown(
|
| 175 |
"""
|
| 176 |
+
**Instructions:**
|
| 177 |
+
|
| 178 |
+
1. Please clone this space, then modify the code to define your agent's logic, the tools, the necessary packages, etc ...
|
| 179 |
+
2. Log in to your Hugging Face account using the button below. This uses your HF username for submission.
|
| 180 |
+
3. Click 'Run Evaluation & Submit All Answers' to fetch questions, run your agent, submit answers, and see the score.
|
| 181 |
+
|
| 182 |
---
|
| 183 |
+
**Disclaimers:**
|
| 184 |
+
Once clicking on the "submit button, it can take quite some time ( this is the time for the agent to go through all the questions).
|
| 185 |
+
This space provides a basic setup and is intentionally sub-optimal to encourage you to develop your own, more robust solution. For instance for the delay process of the submit button, a solution could be to cache the answers and submit in a seperate action or even to answer the questions in async.
|
| 186 |
"""
|
| 187 |
)
|
| 188 |
|