GaiaAgent_Final_Assignment

Sleeping

Francesco-A commited on Jan 15

Commit

a92388a

1 Parent(s): 720cb5b

Agent and app update

Agent:
1 - switched back to gemini-2.5-flash-lite
2 - FINAL ANSWER format clarification

App:
implemented question filter logic

Files changed (2) hide show

agent.py +7 -5
app.py +10 -1

agent.py CHANGED Viewed

@@ -81,18 +81,20 @@ Follow a **PLAN → ACT → OBSERVE** loop:
 ### 4. Additional instructions for the following tasks provided by GAIA team
 - You are a general AI assistant. I will ask you a question. Do not reveal your internal reasoning. Only the content inside FinalAnswerTool will be evaluated.
-- Finish your answer with the following template: FINAL ANSWER: [YOUR FINAL ANSWER].  YOUR FINAL ANSWER should be a number OR as few words as possible OR a comma separated list of numbers and/or strings. If you are asked for a number, don't use comma to write your number neither use units such as $ or percent sign unless specified otherwise. If you are asked for a string, don't use articles, neither abbreviations (e.g. for cities), and write the digits in plain text unless specified otherwise. If you are asked for a comma separated list, apply the above rules depending of whether the element to be put in the list is a number or a string.
 ### 5. To provide the final answer, you MUST call the final_answer tool inside a <code> block.
 - Example of how to end the task:
 Thought: I have found the answer. I will now provide it.
 <code>
-final_answer("FINAL ANSWER: The capital of France is Paris")
 </code>
-\n\n
 """
 # Instruction for Tool-Based Agents (BasicAgent and Gemini-Standard)
@@ -146,8 +148,8 @@ class BasicAgent:
         return self.basic_agent.run(prompt)
 class GeminiAgent:
-    # def __init__(self, native_multimodal: bool = True, model_id: str = "gemini/gemini-2.5-flash-lite"):
-    def __init__(self, native_multimodal: bool = True, model_id: str = "gemini/gemini-3-flash-preview"):
         self.native_multimodal = native_multimodal
         if self.native_multimodal:
             client = genai.Client(api_key=os.environ.get("GOOGLE_API_KEY"))

 ### 4. Additional instructions for the following tasks provided by GAIA team
 - You are a general AI assistant. I will ask you a question. Do not reveal your internal reasoning. Only the content inside FinalAnswerTool will be evaluated.
+- YOUR FINAL ANSWER should be a number OR as few words as possible OR a comma separated list of numbers and/or strings. If you are asked for a number, don't use comma to write your number neither use units such as $ or percent sign unless specified otherwise. If you are asked for a string, don't use articles, neither abbreviations (e.g. for cities), and write the digits in plain text unless specified otherwise. If you are asked for a comma separated list, apply the above rules depending of whether the element to be put in the list is a number or a string.
+- Do NOT include "FINAL ANSWER:" in your final answer text. For example: if the question is "What is the capital of Spain?", respond with "Madrid". It is exact and expected answer.
 ### 5. To provide the final answer, you MUST call the final_answer tool inside a <code> block.
 - Example of how to end the task:
+Question: "What is the capital of France?"
 Thought: I have found the answer. I will now provide it.
 <code>
+final_answer("Paris")
 </code>
 """
 # Instruction for Tool-Based Agents (BasicAgent and Gemini-Standard)
         return self.basic_agent.run(prompt)
 class GeminiAgent:
+    def __init__(self, native_multimodal: bool = True, model_id: str = "gemini/gemini-2.5-flash-lite"):
+    # def __init__(self, native_multimodal: bool = True, model_id: str = "gemini/gemini-3-flash-preview"):
         self.native_multimodal = native_multimodal
         if self.native_multimodal:
             client = genai.Client(api_key=os.environ.get("GOOGLE_API_KEY"))

app.py CHANGED Viewed

@@ -94,12 +94,21 @@ def run_and_submit_all( profile: gr.OAuthProfile | None):
             print("🛑 STOP BUTTON PRESSED: Breaking loop and submitting partial results.")
             results_log.append({"Task ID": "MANUAL_STOP", "Question": "N/A", "Submitted Answer": "USER INTERRUPTED"})
             break
         task_id = item.get("task_id")
         question_text = item.get("question")
         if not task_id or question_text is None:
             print(f"Skipping item with missing task_id or question: {item}")
             continue
         try:
             submitted_answer = agent(question_text)
             answers_payload.append({"task_id": task_id, "submitted_answer": submitted_answer})

             print("🛑 STOP BUTTON PRESSED: Breaking loop and submitting partial results.")
             results_log.append({"Task ID": "MANUAL_STOP", "Question": "N/A", "Submitted Answer": "USER INTERRUPTED"})
             break
         task_id = item.get("task_id")
         question_text = item.get("question")
         if not task_id or question_text is None:
             print(f"Skipping item with missing task_id or question: {item}")
             continue
+        # CONTENT FILTER SKIP (using .lower() for case-insensitivity)
+        filter_keywords = ["chess"]
+        question_words = set(question_text.lower().split()) # Only matches if the exact word is used
+        if any(word in question_words for word in filter_keywords):
+            print(f"Skipping filtered question: {item}")
+            results_log.append({"Task ID": task_id, "Question": question_text, "Submitted Answer": "SKIPPED: KEYWORD FILTER LOGIC"})
+            continue
         try:
             submitted_answer = agent(question_text)
             answers_payload.append({"task_id": task_id, "submitted_answer": submitted_answer})