diana3135 commited on
Commit
947ca60
·
1 Parent(s): 34fdb2c

adjust evaluation prompt

Browse files
Files changed (1) hide show
  1. utils.py +5 -6
utils.py CHANGED
@@ -120,10 +120,10 @@ def get_evaluation_with_gpt(task_description, text, api_key=None):
120
  "You are tasked with evaluating the answer using a scale from 0 to 10 based on specific criteria. "
121
  "Follow a structured chain-of-thought approach for your evaluation to ensure thoroughness and objectivity:\n\n"
122
  "1. **Understand the Criteria:** Carefully review each evaluation criterion to ensure you fully grasp what is being assessed:\n"
123
- " - Novelty: The uniqueness and innovation of the ideas.\n"
124
- " - Implementability: The practicality of suggested actions.\n"
125
- " - Inimitability: The difficulty for competitors to replicate the ideas.\n"
126
- " - Alignment: The degree to which the ideas align with Airbnb’s goals and the 17 SDGs.\n\n"
127
  "2. **Analyze the Answer:** Break down the answer into its key components and assess how well it meets each criterion.\n"
128
  " - Identify strengths, weaknesses, and any gaps in the ideas provided.\n"
129
  " - Consider the context of the task and whether the ideas are realistic and relevant.\n\n"
@@ -134,10 +134,9 @@ def get_evaluation_with_gpt(task_description, text, api_key=None):
134
  " - 9-10: Excellent fit; the idea fully aligns with the criteria, demonstrating exceptional insight.\n"
135
  " - Note: Use the entire scoring range (0-10) and avoid defaulting to mid-range scores. If the provided answer is vague or off-topic, assign scores between 0-5.\n\n"
136
  "4. **Justify Each Score:** Provide a brief explanation for each score, highlighting specific aspects of the answer that influenced your evaluation.\n\n"
137
- "5. **Summarize:** Conclude with an overall assessment, summarizing the strengths and weaknesses of the answer.\n\n"
138
  "Format your output exactly as follows:\n"
139
  "Novelty: [Score] - [Justification]\n"
140
- "Implementability: [Score] - [Justification]\n"
141
  "Inimitability: [Score] - [Justification]\n"
142
  "Alignment: [Score] - [Justification]\n\n"
143
  "Begin your evaluation below:"
 
120
  "You are tasked with evaluating the answer using a scale from 0 to 10 based on specific criteria. "
121
  "Follow a structured chain-of-thought approach for your evaluation to ensure thoroughness and objectivity:\n\n"
122
  "1. **Understand the Criteria:** Carefully review each evaluation criterion to ensure you fully grasp what is being assessed:\n"
123
+ " - Novelty: The uniqueness and originality of the ideas.\n"
124
+ " - Feasibility: The practicality and implementability of suggested actions.\n"
125
+ " - Inimitability: How difficult for competitors to replicate.\n"
126
+ " - Alignment: How aligned the ideas are with Airbnb’s business objectives and 17 SDGs.\n\n"
127
  "2. **Analyze the Answer:** Break down the answer into its key components and assess how well it meets each criterion.\n"
128
  " - Identify strengths, weaknesses, and any gaps in the ideas provided.\n"
129
  " - Consider the context of the task and whether the ideas are realistic and relevant.\n\n"
 
134
  " - 9-10: Excellent fit; the idea fully aligns with the criteria, demonstrating exceptional insight.\n"
135
  " - Note: Use the entire scoring range (0-10) and avoid defaulting to mid-range scores. If the provided answer is vague or off-topic, assign scores between 0-5.\n\n"
136
  "4. **Justify Each Score:** Provide a brief explanation for each score, highlighting specific aspects of the answer that influenced your evaluation.\n\n"
 
137
  "Format your output exactly as follows:\n"
138
  "Novelty: [Score] - [Justification]\n"
139
+ "Feasibility: [Score] - [Justification]\n"
140
  "Inimitability: [Score] - [Justification]\n"
141
  "Alignment: [Score] - [Justification]\n\n"
142
  "Begin your evaluation below:"