Spaces:

PropensityLabs
/

LLM-Propensity-Evals

Running

foo-barrr commited on Nov 1

Commit

4baa2f8

verified ·

1 Parent(s): f400429

Update app.py

Files changed (1) hide show

app.py CHANGED Viewed

@@ -48,11 +48,11 @@ with gr.Blocks(title="LLM Propensity Evaluation Leaderboard") as demo:
         ## Evaluation Details:
         - **Instruction Following Score**: Measures a model's tendency to follow instructions accurately. Measured using the IFEval dataset.
-        - **Hallucination Rate**: Evaluates how often a model hallucinates. Measured using a subset of the SimpleQA dataset. We calculated the rate using this formula : (1 - (correct + not_attempted)), where correct = when the model answered a question correctly and not_attempted = when a model admits to not knowing the answer to a question.*
         ## How to Interpret the Scores:
         * Instruction Following Score: Higher scores indicate better adherence to instructions.
-        * Hallucination Rate: Lower rates indicate fewer hallucinations.
         *Note*: The evaluation metrics are designed to provide insights into the models' behavior in specific contexts. They may not capture all aspects of model performance or alignment.
@@ -80,8 +80,7 @@ with gr.Blocks(title="LLM Propensity Evaluation Leaderboard") as demo:
     # Add footer information
     gr.Markdown("""
     ---
-    **Last Updated**: Sep 11, 2025
-    **Contact**: <TBD>
     """)
 # Launch the app

         ## Evaluation Details:
         - **Instruction Following Score**: Measures a model's tendency to follow instructions accurately. Measured using the IFEval dataset.
+        - **Factual Hallucination Rate**: Evaluates how often a model hallucinates when questioned on facts. Measured using a subset of the SimpleQA dataset, which explicitly asks uncommon facts. We calculated the rate using this formula : (1 - (correct + not_attempted)), where correct = when the model answered a question correctly and not_attempted = when a model admits to not knowing the answer to a question.*
         ## How to Interpret the Scores:
         * Instruction Following Score: Higher scores indicate better adherence to instructions.
+        * Hallucination Rate: Lower rates indicate fewer hallucinations.
         *Note*: The evaluation metrics are designed to provide insights into the models' behavior in specific contexts. They may not capture all aspects of model performance or alignment.
     # Add footer information
     gr.Markdown("""
     ---
+    **Last Updated**: November 1, 2025
     """)
 # Launch the app