Spaces:
Sleeping
Sleeping
Spark Chou commited on
Commit ·
4a831fc
1
Parent(s): a90eaa6
new
Browse files
app.py
CHANGED
|
@@ -898,9 +898,11 @@ with gr.Blocks(theme=gr.themes.Soft(), css=".gradio-container {max-width: 960px
|
|
| 898 |
go_to_pretest_btn = gr.Button("Got it, start the test", variant="primary")
|
| 899 |
|
| 900 |
with pretest_page:
|
| 901 |
-
gr.Markdown("""##
|
| 902 |
-
|
| 903 |
-
-
|
|
|
|
|
|
|
| 904 |
- Under each dimension, score **every listed feature** from **0 to 5**:
|
| 905 |
|
| 906 |
### 🔢 Scoring Guide:
|
|
@@ -911,17 +913,18 @@ with gr.Blocks(theme=gr.themes.Soft(), css=".gradio-container {max-width: 960px
|
|
| 911 |
- **4** – Somewhat human-like
|
| 912 |
- **5** – Strongly human-like
|
| 913 |
|
| 914 |
-
- After rating all dimensions, make a final judgment: is the **
|
| 915 |
- You can freely switch between dimensions using the **Previous** and **Next** buttons.
|
| 916 |
|
| 917 |
---
|
| 918 |
|
| 919 |
### ⚠️ Important Notes:
|
| 920 |
|
| 921 |
-
-
|
|
|
|
| 922 |
> For example: correct pronunciation doesn't always mean "human", and mispronunciation doesn't mean "AI". Think in terms of human-likeness.
|
| 923 |
|
| 924 |
-
- Even if you're confident early on about the
|
| 925 |
Avoid just labeling all dimensions as "machine-like" or "human-like" without listening carefully.
|
| 926 |
""")
|
| 927 |
go_to_test_btn = gr.Button("Start the Test", variant="primary")
|
|
|
|
| 898 |
go_to_pretest_btn = gr.Button("Got it, start the test", variant="primary")
|
| 899 |
|
| 900 |
with pretest_page:
|
| 901 |
+
gr.Markdown("""## Test Instructions
|
| 902 |
+
- Every dialogue includes 2 speakers and lasts around 1 minute.
|
| 903 |
+
- **Initiator:** The one who talks the first in the dialogue.
|
| 904 |
+
- **Respondent:** The other one.
|
| 905 |
+
- For each question, you'll evaluate the **respondent** (not the initiator) across **5 dimensions**.
|
| 906 |
- Under each dimension, score **every listed feature** from **0 to 5**:
|
| 907 |
|
| 908 |
### 🔢 Scoring Guide:
|
|
|
|
| 913 |
- **4** – Somewhat human-like
|
| 914 |
- **5** – Strongly human-like
|
| 915 |
|
| 916 |
+
- After rating all dimensions, make a final judgment: is the **respondent** a human or an AI?
|
| 917 |
- You can freely switch between dimensions using the **Previous** and **Next** buttons.
|
| 918 |
|
| 919 |
---
|
| 920 |
|
| 921 |
### ⚠️ Important Notes:
|
| 922 |
|
| 923 |
+
- Once you start the test, try not to refresh the page or quit it. You need to grade 5 recordings every test.
|
| 924 |
+
- Focus on whether the **respondent's speech** sounds more **human-like or machine-like** for each feature — not just whether the feature is "present".
|
| 925 |
> For example: correct pronunciation doesn't always mean "human", and mispronunciation doesn't mean "AI". Think in terms of human-likeness.
|
| 926 |
|
| 927 |
+
- Even if you're confident early on about the respondent's identity, still evaluate **each dimension independently**.
|
| 928 |
Avoid just labeling all dimensions as "machine-like" or "human-like" without listening carefully.
|
| 929 |
""")
|
| 930 |
go_to_test_btn = gr.Button("Start the Test", variant="primary")
|