Spaces:

holistic-ai
/

explainbility_benchmark

Sleeping

Zekun Wu commited on May 17, 2024

Commit

0955149

1 Parent(s): fd72b9a

update

Files changed (1) hide show

util/evaluator.py CHANGED Viewed

@@ -50,18 +50,16 @@ class evaluator:
         Definition: The explanation should offer or accommodate multiple viewpoints or interpretations, allowing the user to explore various perspectives.
         Score: (0-1) How well does the explanation provide or support multiple perspectives?
-        After evaluating the provided question and explanation based on the five principles, please format your scores in a JSON dictionary.
         Example JSON format:
-        {{"Factually Correct": 0.9,"Useful": 0.85,"Context Specific": 0.8,"User Specific": 0.75,"Provides Pluralism": 0.7}}
-        Directly provide me with the json without any additional text.
         Answer:
         """
-        response = self.model.invoke(evaluation_prompt,temperature=0.8, max_tokens=150).strip()
         #response = """{{"Factually Correct": 0.9,"Useful": 0.85,"Context Specific": 0.8,"User Specific": 0.75,"Provides Pluralism": 0.7}}"""
         print(response)
         try:

         Definition: The explanation should offer or accommodate multiple viewpoints or interpretations, allowing the user to explore various perspectives.
         Score: (0-1) How well does the explanation provide or support multiple perspectives?
+        After evaluating the provided question and explanation based on the five principles, please format your scores in a JSON dictionary. Directly provide me with the json without any additional text.
         Example JSON format:
+        Answer:{{"Factually Correct": 0.9,"Useful": 0.85,"Context Specific": 0.8,"User Specific": 0.75,"Provides Pluralism": 0.7}}
         Answer:
         """
+        response = self.model.invoke(evaluation_prompt,temperature=0.8, max_tokens=500).strip()
         #response = """{{"Factually Correct": 0.9,"Useful": 0.85,"Context Specific": 0.8,"User Specific": 0.75,"Provides Pluralism": 0.7}}"""
         print(response)
         try: