Spark Chou commited on
Commit
4a831fc
·
1 Parent(s): a90eaa6
Files changed (1) hide show
  1. app.py +9 -6
app.py CHANGED
@@ -898,9 +898,11 @@ with gr.Blocks(theme=gr.themes.Soft(), css=".gradio-container {max-width: 960px
898
  go_to_pretest_btn = gr.Button("Got it, start the test", variant="primary")
899
 
900
  with pretest_page:
901
- gr.Markdown("""## Pre-Test Instructions
902
-
903
- - For each question, you'll evaluate the **response** (not the initiator) across **5 dimensions**.
 
 
904
  - Under each dimension, score **every listed feature** from **0 to 5**:
905
 
906
  ### 🔢 Scoring Guide:
@@ -911,17 +913,18 @@ with gr.Blocks(theme=gr.themes.Soft(), css=".gradio-container {max-width: 960px
911
  - **4** – Somewhat human-like
912
  - **5** – Strongly human-like
913
 
914
- - After rating all dimensions, make a final judgment: is the **responder** a human or an AI?
915
  - You can freely switch between dimensions using the **Previous** and **Next** buttons.
916
 
917
  ---
918
 
919
  ### ⚠️ Important Notes:
920
 
921
- - Focus on whether the **responder's speech** sounds more **human-like or machine-like** for each feature not just whether the feature is "present".
 
922
  > For example: correct pronunciation doesn't always mean "human", and mispronunciation doesn't mean "AI". Think in terms of human-likeness.
923
 
924
- - Even if you're confident early on about the responder's identity, still evaluate **each dimension independently**.
925
  Avoid just labeling all dimensions as "machine-like" or "human-like" without listening carefully.
926
  """)
927
  go_to_test_btn = gr.Button("Start the Test", variant="primary")
 
898
  go_to_pretest_btn = gr.Button("Got it, start the test", variant="primary")
899
 
900
  with pretest_page:
901
+ gr.Markdown("""## Test Instructions
902
+ - Every dialogue includes 2 speakers and lasts around 1 minute.
903
+ - **Initiator:** The one who talks the first in the dialogue.
904
+ - **Respondent:** The other one.
905
+ - For each question, you'll evaluate the **respondent** (not the initiator) across **5 dimensions**.
906
  - Under each dimension, score **every listed feature** from **0 to 5**:
907
 
908
  ### 🔢 Scoring Guide:
 
913
  - **4** – Somewhat human-like
914
  - **5** – Strongly human-like
915
 
916
+ - After rating all dimensions, make a final judgment: is the **respondent** a human or an AI?
917
  - You can freely switch between dimensions using the **Previous** and **Next** buttons.
918
 
919
  ---
920
 
921
  ### ⚠️ Important Notes:
922
 
923
+ - Once you start the test, try not to refresh the page or quit it. You need to grade 5 recordings every test.
924
+ - Focus on whether the **respondent's speech** sounds more **human-like or machine-like** for each feature — not just whether the feature is "present".
925
  > For example: correct pronunciation doesn't always mean "human", and mispronunciation doesn't mean "AI". Think in terms of human-likeness.
926
 
927
+ - Even if you're confident early on about the respondent's identity, still evaluate **each dimension independently**.
928
  Avoid just labeling all dimensions as "machine-like" or "human-like" without listening carefully.
929
  """)
930
  go_to_test_btn = gr.Button("Start the Test", variant="primary")