Final_Assignment_Template

Sleeping

App Files Files Community

Datawithsarah commited on May 1, 2025

Commit

3707bff

1 Parent(s): b5ed0c0

enhanced prompt

Browse files

Files changed (1) hide show

app.py +22 -26

app.py CHANGED Viewed

@@ -329,35 +329,31 @@ def run_and_submit_all( profile: gr.OAuthProfile | None):
             print(f"Skipping item with missing task_id or question: {item}")
             continue
         try:
-            full_prompt = f"""You are a highly precise answering agent designed to meet the GAIA benchmark's exact-match standards.
-When presented with a question:
-- Use tools appropriately and deliberately. Do not make assumptions or guess answers.
-- Use `web_search` to find external sources only if necessary. If the results include short snippets, you MUST follow the link and read the full content using `read_wikipedia_page`.
-- You have access to `read_wikipedia_page` ONLY — no other external browsing is allowed.
-- When reading long text, ALWAYS use `smart_paginate_around_query` to extract focused context. Use 1-3 general keywords (not full questions) as the query.
-- If the task involves reversing words, letters, or phrases, use the `reverse_sentence` tool. Never reverse text manually.
-- For any file-based task (e.g., .mp3, .csv, .json, .xlsx), use the `file_name` provided in the metadata — not a name mentioned in the question text.
-- Format lists with a single space after each comma.
-- If asked for a number, return digits only — no commas, currency signs, or symbols (e.g., %, $, etc.).
-- If asked for a string, do not include articles (e.g., "the", "a") or abbreviations unless required. Spell out numbers in digit form unless stated otherwise.
-- If asked for a comma-separated list, apply the correct formatting per element type (string or number).
-Once you have the exact answer:
-- Immediately call `final_answer("your_answer")` and stop execution.
-- Never retry, rerun, or generate multiple answers.
-- Do not include reasoning, steps, thoughts, or commentary — just the final value.
 Example:
-If asked: "What is the capital of France?"
-Your answer logic should follow:
-```py
 print("Paris")
-```<end_code>
-Based on the above guidelines, answer the following question:
---begin of question--
 {question_text}
---end of question--
-If the questions mentions the need to use a file, use the following `file_name` value as the `file_name` parameter in any function calls:
-file_name: {file_name}"""
             submitted_answer = agent.run(full_prompt)
             answers_payload.append({"task_id": task_id, "submitted_answer": submitted_answer})
             results_log.append({"Task ID": task_id, "Question": question_text, "Submitted Answer": submitted_answer})

             print(f"Skipping item with missing task_id or question: {item}")
             continue
         try:
+            full_prompt = f"""
+You are a precise answering agent optimized for exact-match benchmarks like GAIA.
+Your job is to:
+- Use tools (e.g., `web_search`, `read_wikipedia_page`, `smart_paginate_around_query`, `reverse_sentence`, `open_file_as_text`, etc.) only when needed.
+- Never make assumptions. Do not guess.
+- Use `read_wikipedia_page` to read full content if snippets from `web_search` are not enough.
+- Use `smart_paginate_around_query` with 1-3 keyword terms — never full questions.
+- Use `reverse_sentence` for any reverse operation, never do it manually.
+- Use the provided `file_name` field for file tasks, not filenames inside the question.
+- Output formats:
+  - Numbers: Digits only, no commas, $, or %.
+  - Strings: No articles, abbreviations, or spelled-out numbers unless required.
+  - Lists: Comma separated, single space after each comma.
+- At the end, print only the final answer. No explanation, no reasoning.
 Example:
+If asked, “What is the capital of France?”
+Respond:
 print("Paris")
+Question:
 {question_text}
+File to use (if needed): {file_name}"""
             submitted_answer = agent.run(full_prompt)
             answers_payload.append({"task_id": task_id, "submitted_answer": submitted_answer})
             results_log.append({"Task ID": task_id, "Question": question_text, "Submitted Answer": submitted_answer})