Spaces:

aail-hf
/

ensemble_machine

Sleeping

App Files Files Community

diana3135 commited on Nov 22, 2024

Commit

947ca60

1 Parent(s): 34fdb2c

adjust evaluation prompt

Browse files

Files changed (1) hide show

utils.py +5 -6

utils.py CHANGED Viewed

@@ -120,10 +120,10 @@ def get_evaluation_with_gpt(task_description, text, api_key=None):
     "You are tasked with evaluating the answer using a scale from 0 to 10 based on specific criteria. "
     "Follow a structured chain-of-thought approach for your evaluation to ensure thoroughness and objectivity:\n\n"
     "1. **Understand the Criteria:** Carefully review each evaluation criterion to ensure you fully grasp what is being assessed:\n"
-    "   - Novelty: The uniqueness and innovation of the ideas.\n"
-    "   - Implementability: The practicality of suggested actions.\n"
-    "   - Inimitability: The difficulty for competitors to replicate the ideas.\n"
-    "   - Alignment: The degree to which the ideas align with Airbnb’s goals and the 17 SDGs.\n\n"
     "2. **Analyze the Answer:** Break down the answer into its key components and assess how well it meets each criterion.\n"
     "   - Identify strengths, weaknesses, and any gaps in the ideas provided.\n"
     "   - Consider the context of the task and whether the ideas are realistic and relevant.\n\n"
@@ -134,10 +134,9 @@ def get_evaluation_with_gpt(task_description, text, api_key=None):
     "   - 9-10: Excellent fit; the idea fully aligns with the criteria, demonstrating exceptional insight.\n"
     "   - Note: Use the entire scoring range (0-10) and avoid defaulting to mid-range scores. If the provided answer is vague or off-topic, assign scores between 0-5.\n\n"
     "4. **Justify Each Score:** Provide a brief explanation for each score, highlighting specific aspects of the answer that influenced your evaluation.\n\n"
-    "5. **Summarize:** Conclude with an overall assessment, summarizing the strengths and weaknesses of the answer.\n\n"
     "Format your output exactly as follows:\n"
     "Novelty: [Score] - [Justification]\n"
-    "Implementability: [Score] - [Justification]\n"
     "Inimitability: [Score] - [Justification]\n"
     "Alignment: [Score] - [Justification]\n\n"
     "Begin your evaluation below:"

     "You are tasked with evaluating the answer using a scale from 0 to 10 based on specific criteria. "
     "Follow a structured chain-of-thought approach for your evaluation to ensure thoroughness and objectivity:\n\n"
     "1. **Understand the Criteria:** Carefully review each evaluation criterion to ensure you fully grasp what is being assessed:\n"
+    "   - Novelty: The uniqueness and originality of the ideas.\n"
+    "   - Feasibility: The practicality and implementability of suggested actions.\n"
+    "   - Inimitability: How difficult for competitors to replicate.\n"
+    "   - Alignment: How aligned the ideas are with Airbnb’s business objectives and 17 SDGs.\n\n"
     "2. **Analyze the Answer:** Break down the answer into its key components and assess how well it meets each criterion.\n"
     "   - Identify strengths, weaknesses, and any gaps in the ideas provided.\n"
     "   - Consider the context of the task and whether the ideas are realistic and relevant.\n\n"
     "   - 9-10: Excellent fit; the idea fully aligns with the criteria, demonstrating exceptional insight.\n"
     "   - Note: Use the entire scoring range (0-10) and avoid defaulting to mid-range scores. If the provided answer is vague or off-topic, assign scores between 0-5.\n\n"
     "4. **Justify Each Score:** Provide a brief explanation for each score, highlighting specific aspects of the answer that influenced your evaluation.\n\n"
     "Format your output exactly as follows:\n"
     "Novelty: [Score] - [Justification]\n"
+    "Feasibility: [Score] - [Justification]\n"
     "Inimitability: [Score] - [Justification]\n"
     "Alignment: [Score] - [Justification]\n\n"
     "Begin your evaluation below:"