Thanh Vinh Vo
commited on
Commit
·
c1032f3
1
Parent(s):
86c8012
update
Browse files
NOTES
CHANGED
|
@@ -3,13 +3,13 @@
|
|
| 3 |
- Don't give the master any tool, since it will try to delegate smaller work to the code agent, miss context
|
| 4 |
- Temperature to 0
|
| 5 |
- BeautifulSoup too bad
|
|
|
|
| 6 |
|
| 7 |
|
| 8 |
|
| 9 |
TASKS
|
| 10 |
-
- MATH: 6f37996b-2ac7-44b0-8e68-6d28256631b4
|
| 11 |
- 305ac316-eef6-4446-960a-92d80d542f82
|
| 12 |
-
- 7bd855d8-463d-4ed5-93ca-5fe35145f733
|
| 13 |
- cf106601-ab4f-4af9-b045-5295fe67b37d
|
| 14 |
- bda648d7-d618-4883-88f4-3466eabd860e
|
| 15 |
- 1f975693-876d-457b-a649-393859e79bf3
|
|
|
|
| 3 |
- Don't give the master any tool, since it will try to delegate smaller work to the code agent, miss context
|
| 4 |
- Temperature to 0
|
| 5 |
- BeautifulSoup too bad
|
| 6 |
+
- Stupid scoring function
|
| 7 |
|
| 8 |
|
| 9 |
|
| 10 |
TASKS
|
|
|
|
| 11 |
- 305ac316-eef6-4446-960a-92d80d542f82
|
| 12 |
+
- EXCEL: 7bd855d8-463d-4ed5-93ca-5fe35145f733
|
| 13 |
- cf106601-ab4f-4af9-b045-5295fe67b37d
|
| 14 |
- bda648d7-d618-4883-88f4-3466eabd860e
|
| 15 |
- 1f975693-876d-457b-a649-393859e79bf3
|
app.py
CHANGED
|
@@ -251,7 +251,7 @@ class BasicAgent:
|
|
| 251 |
3. If the question asks for a number, please return a numerical answer without unit (unless unit is specifically asked for). For example: 3 instead of three, 0 instead of None, 3 instead of $3.
|
| 252 |
4. `pandas` package is available for reading table data from HTML content or URL. It is useful for extracting tabular data from web pages (including Wikipedia pages).
|
| 253 |
5. If the question asks for a list, please make sure that the elements are separated by a comma(`,`) and a space(` `). For example: `1, 2, 3` instead of `1,2,3`.
|
| 254 |
-
6.
|
| 255 |
"""
|
| 256 |
result = self.manager_agent.run(prompt)
|
| 257 |
print(f"Agent responded with: {result}")
|
|
@@ -427,7 +427,7 @@ with gr.Blocks() as demo:
|
|
| 427 |
label="Question id to solve (empty to solve all)",
|
| 428 |
lines=1,
|
| 429 |
interactive=True,
|
| 430 |
-
value="
|
| 431 |
)
|
| 432 |
run_button = gr.Button("Run Evaluation & Submit All Answers")
|
| 433 |
status_output = gr.Textbox(
|
|
|
|
| 251 |
3. If the question asks for a number, please return a numerical answer without unit (unless unit is specifically asked for). For example: 3 instead of three, 0 instead of None, 3 instead of $3.
|
| 252 |
4. `pandas` package is available for reading table data from HTML content or URL. It is useful for extracting tabular data from web pages (including Wikipedia pages).
|
| 253 |
5. If the question asks for a list, please make sure that the elements are separated by a comma(`,`) and a space(` `). For example: `1, 2, 3` instead of `1,2,3`.
|
| 254 |
+
6. If the question asks for a number with specific decimal places, please format the number into string with the same decimal places. For example: 3.00 instead of 3.
|
| 255 |
"""
|
| 256 |
result = self.manager_agent.run(prompt)
|
| 257 |
print(f"Agent responded with: {result}")
|
|
|
|
| 427 |
label="Question id to solve (empty to solve all)",
|
| 428 |
lines=1,
|
| 429 |
interactive=True,
|
| 430 |
+
value="7bd855d8-463d-4ed5-93ca-5fe35145f733",
|
| 431 |
)
|
| 432 |
run_button = gr.Button("Run Evaluation & Submit All Answers")
|
| 433 |
status_output = gr.Textbox(
|