Thanh Vinh Vo commited on
Commit
c1032f3
·
1 Parent(s): 86c8012
Files changed (2) hide show
  1. NOTES +2 -2
  2. app.py +2 -2
NOTES CHANGED
@@ -3,13 +3,13 @@
3
  - Don't give the master any tool, since it will try to delegate smaller work to the code agent, miss context
4
  - Temperature to 0
5
  - BeautifulSoup too bad
 
6
 
7
 
8
 
9
  TASKS
10
- - MATH: 6f37996b-2ac7-44b0-8e68-6d28256631b4
11
  - 305ac316-eef6-4446-960a-92d80d542f82
12
- - 7bd855d8-463d-4ed5-93ca-5fe35145f733
13
  - cf106601-ab4f-4af9-b045-5295fe67b37d
14
  - bda648d7-d618-4883-88f4-3466eabd860e
15
  - 1f975693-876d-457b-a649-393859e79bf3
 
3
  - Don't give the master any tool, since it will try to delegate smaller work to the code agent, miss context
4
  - Temperature to 0
5
  - BeautifulSoup too bad
6
+ - Stupid scoring function
7
 
8
 
9
 
10
  TASKS
 
11
  - 305ac316-eef6-4446-960a-92d80d542f82
12
+ - EXCEL: 7bd855d8-463d-4ed5-93ca-5fe35145f733
13
  - cf106601-ab4f-4af9-b045-5295fe67b37d
14
  - bda648d7-d618-4883-88f4-3466eabd860e
15
  - 1f975693-876d-457b-a649-393859e79bf3
app.py CHANGED
@@ -251,7 +251,7 @@ class BasicAgent:
251
  3. If the question asks for a number, please return a numerical answer without unit (unless unit is specifically asked for). For example: 3 instead of three, 0 instead of None, 3 instead of $3.
252
  4. `pandas` package is available for reading table data from HTML content or URL. It is useful for extracting tabular data from web pages (including Wikipedia pages).
253
  5. If the question asks for a list, please make sure that the elements are separated by a comma(`,`) and a space(` `). For example: `1, 2, 3` instead of `1,2,3`.
254
- 6. Remember that `Ice Cream` is a food item!
255
  """
256
  result = self.manager_agent.run(prompt)
257
  print(f"Agent responded with: {result}")
@@ -427,7 +427,7 @@ with gr.Blocks() as demo:
427
  label="Question id to solve (empty to solve all)",
428
  lines=1,
429
  interactive=True,
430
- value="6f37996b-2ac7-44b0-8e68-6d28256631b4",
431
  )
432
  run_button = gr.Button("Run Evaluation & Submit All Answers")
433
  status_output = gr.Textbox(
 
251
  3. If the question asks for a number, please return a numerical answer without unit (unless unit is specifically asked for). For example: 3 instead of three, 0 instead of None, 3 instead of $3.
252
  4. `pandas` package is available for reading table data from HTML content or URL. It is useful for extracting tabular data from web pages (including Wikipedia pages).
253
  5. If the question asks for a list, please make sure that the elements are separated by a comma(`,`) and a space(` `). For example: `1, 2, 3` instead of `1,2,3`.
254
+ 6. If the question asks for a number with specific decimal places, please format the number into string with the same decimal places. For example: 3.00 instead of 3.
255
  """
256
  result = self.manager_agent.run(prompt)
257
  print(f"Agent responded with: {result}")
 
427
  label="Question id to solve (empty to solve all)",
428
  lines=1,
429
  interactive=True,
430
+ value="7bd855d8-463d-4ed5-93ca-5fe35145f733",
431
  )
432
  run_button = gr.Button("Run Evaluation & Submit All Answers")
433
  status_output = gr.Textbox(