Spaces:
Sleeping
Sleeping
diana3135
commited on
Commit
·
947ca60
1
Parent(s):
34fdb2c
adjust evaluation prompt
Browse files
utils.py
CHANGED
|
@@ -120,10 +120,10 @@ def get_evaluation_with_gpt(task_description, text, api_key=None):
|
|
| 120 |
"You are tasked with evaluating the answer using a scale from 0 to 10 based on specific criteria. "
|
| 121 |
"Follow a structured chain-of-thought approach for your evaluation to ensure thoroughness and objectivity:\n\n"
|
| 122 |
"1. **Understand the Criteria:** Carefully review each evaluation criterion to ensure you fully grasp what is being assessed:\n"
|
| 123 |
-
" - Novelty: The uniqueness and
|
| 124 |
-
" -
|
| 125 |
-
" - Inimitability:
|
| 126 |
-
" - Alignment:
|
| 127 |
"2. **Analyze the Answer:** Break down the answer into its key components and assess how well it meets each criterion.\n"
|
| 128 |
" - Identify strengths, weaknesses, and any gaps in the ideas provided.\n"
|
| 129 |
" - Consider the context of the task and whether the ideas are realistic and relevant.\n\n"
|
|
@@ -134,10 +134,9 @@ def get_evaluation_with_gpt(task_description, text, api_key=None):
|
|
| 134 |
" - 9-10: Excellent fit; the idea fully aligns with the criteria, demonstrating exceptional insight.\n"
|
| 135 |
" - Note: Use the entire scoring range (0-10) and avoid defaulting to mid-range scores. If the provided answer is vague or off-topic, assign scores between 0-5.\n\n"
|
| 136 |
"4. **Justify Each Score:** Provide a brief explanation for each score, highlighting specific aspects of the answer that influenced your evaluation.\n\n"
|
| 137 |
-
"5. **Summarize:** Conclude with an overall assessment, summarizing the strengths and weaknesses of the answer.\n\n"
|
| 138 |
"Format your output exactly as follows:\n"
|
| 139 |
"Novelty: [Score] - [Justification]\n"
|
| 140 |
-
"
|
| 141 |
"Inimitability: [Score] - [Justification]\n"
|
| 142 |
"Alignment: [Score] - [Justification]\n\n"
|
| 143 |
"Begin your evaluation below:"
|
|
|
|
| 120 |
"You are tasked with evaluating the answer using a scale from 0 to 10 based on specific criteria. "
|
| 121 |
"Follow a structured chain-of-thought approach for your evaluation to ensure thoroughness and objectivity:\n\n"
|
| 122 |
"1. **Understand the Criteria:** Carefully review each evaluation criterion to ensure you fully grasp what is being assessed:\n"
|
| 123 |
+
" - Novelty: The uniqueness and originality of the ideas.\n"
|
| 124 |
+
" - Feasibility: The practicality and implementability of suggested actions.\n"
|
| 125 |
+
" - Inimitability: How difficult for competitors to replicate.\n"
|
| 126 |
+
" - Alignment: How aligned the ideas are with Airbnb’s business objectives and 17 SDGs.\n\n"
|
| 127 |
"2. **Analyze the Answer:** Break down the answer into its key components and assess how well it meets each criterion.\n"
|
| 128 |
" - Identify strengths, weaknesses, and any gaps in the ideas provided.\n"
|
| 129 |
" - Consider the context of the task and whether the ideas are realistic and relevant.\n\n"
|
|
|
|
| 134 |
" - 9-10: Excellent fit; the idea fully aligns with the criteria, demonstrating exceptional insight.\n"
|
| 135 |
" - Note: Use the entire scoring range (0-10) and avoid defaulting to mid-range scores. If the provided answer is vague or off-topic, assign scores between 0-5.\n\n"
|
| 136 |
"4. **Justify Each Score:** Provide a brief explanation for each score, highlighting specific aspects of the answer that influenced your evaluation.\n\n"
|
|
|
|
| 137 |
"Format your output exactly as follows:\n"
|
| 138 |
"Novelty: [Score] - [Justification]\n"
|
| 139 |
+
"Feasibility: [Score] - [Justification]\n"
|
| 140 |
"Inimitability: [Score] - [Justification]\n"
|
| 141 |
"Alignment: [Score] - [Justification]\n\n"
|
| 142 |
"Begin your evaluation below:"
|