Spaces:

anonymousauthorsanonymous
/

uncertainty

Runtime error

App Files Files Community

Anon Anon commited on Dec 7, 2022

Commit

5386bb9

1 Parent(s): 570c959

Update text for improved readability

Browse files

Files changed (1) hide show

app.py +10 -7

app.py CHANGED Viewed

@@ -213,7 +213,7 @@ with demo:
     gr.Markdown(
         "#### LLMs are pretty good at reporting their uncertainty. We just need to ask the right way.")
     gr.Markdown("Using our uncertainty metric informed by applying causal inference techniques in \
-        [Our ICLR paper under review](https://openreview.net/pdf?id=25VgHaPz0l4), \
         we are able to identify likely spurious correlations and exploit them in \
         the scenario of gender underspecified tasks. (Note that introspecting softmax probabilities alone is insufficient, as in the sentences \
         below, LLMs may report a softmax prob of ~0.9 despite the task being underspecified.)")
@@ -221,13 +221,16 @@ with demo:
         eight syntactically similar sentences. However semantically, \
         only two of the sentences are well-specified while the rest remain underspecified.")
     gr.Markdown("If a model can reliably tell us when it is uncertain about its predictions, one can replace only those uncertain predictions with\
-        an appropriate heuristic.")
     with gr.Row():
         model_name = gr.Radio(
             MODEL_NAMES,
             type="value",
-            label="Pick a preloaded BERT-like model for uncertainty evaluation (note: BERT-base performance least consistent)...",
         )
         own_model_name = gr.Textbox(
             label=f"...Or, if you selected an '{OWN_MODEL_NAME}' model, put any Hugging Face pipeline model name \
@@ -236,19 +239,19 @@ with demo:
     with gr.Row():
         occ_box = gr.Radio(
-            occs+[PICK_YOUR_OWN_LABEL], label=f"Pick an Occupation type from the Winogender Schemas evaluation set, or select '{PICK_YOUR_OWN_LABEL}'\
                  (it need not be about an occupation).")
     with gr.Row():
         alt_input_texts = gr.Textbox(
             lines=2,
-            label=f"...If you selected '{PICK_YOUR_OWN_LABEL}' above, add your own texts new-line delimited sentences here. Be sure\
             to include a single MASK-ed out pronoun. \
             If unsure on the required format, click an occupation above instead, to see some example input texts for this round."
         )
     with gr.Row():
-        get_text_btn = gr.Button("1) Load input texts")
     get_text_btn.click(
         fn=display_input_texts,
@@ -259,7 +262,7 @@ with demo:
     )
     with gr.Row():
-        uncertain_btn = gr.Button("2) Get uncertainty results!")
     gr.Markdown(
         "If there is an * by a sentence number, then at least one top prediction for that sentence was non-gendered.")

     gr.Markdown(
         "#### LLMs are pretty good at reporting their uncertainty. We just need to ask the right way.")
     gr.Markdown("Using our uncertainty metric informed by applying causal inference techniques in \
+        [our ICLR paper under review](https://openreview.net/pdf?id=25VgHaPz0l4), \
         we are able to identify likely spurious correlations and exploit them in \
         the scenario of gender underspecified tasks. (Note that introspecting softmax probabilities alone is insufficient, as in the sentences \
         below, LLMs may report a softmax prob of ~0.9 despite the task being underspecified.)")
         eight syntactically similar sentences. However semantically, \
         only two of the sentences are well-specified while the rest remain underspecified.")
     gr.Markdown("If a model can reliably tell us when it is uncertain about its predictions, one can replace only those uncertain predictions with\
+        an appropriate heuristic or information retrieval process.")
+    gr.Markdown("#### TL;DR")
+    gr.Markdown("Follow steps below to test out one of the pre-loaded options. Once you get the hang of it, you can load a new model and/or provide your own input texts.")
     with gr.Row():
         model_name = gr.Radio(
             MODEL_NAMES,
             type="value",
+            label="1) Pick a preloaded BERT-like model for uncertainty evaluation (note: RoBERTa-large performance is best)...",
         )
         own_model_name = gr.Textbox(
             label=f"...Or, if you selected an '{OWN_MODEL_NAME}' model, put any Hugging Face pipeline model name \
     with gr.Row():
         occ_box = gr.Radio(
+            occs+[PICK_YOUR_OWN_LABEL], label=f"2) Pick an Occupation type from the Winogender Schemas evaluation set, or select '{PICK_YOUR_OWN_LABEL}'\
                  (it need not be about an occupation).")
     with gr.Row():
         alt_input_texts = gr.Textbox(
             lines=2,
+            label=f"...Or, if you selected '{PICK_YOUR_OWN_LABEL}' above, add your own texts new-line delimited sentences here. Be sure\
             to include a single MASK-ed out pronoun. \
             If unsure on the required format, click an occupation above instead, to see some example input texts for this round."
         )
     with gr.Row():
+        get_text_btn = gr.Button("3) Load input texts")
     get_text_btn.click(
         fn=display_input_texts,
     )
     with gr.Row():
+        uncertain_btn = gr.Button("4) Get uncertainty results!")
     gr.Markdown(
         "If there is an * by a sentence number, then at least one top prediction for that sentence was non-gendered.")