Spaces:

divvun-tts
/

tts-evaluation

Sleeping

App Files Files Community

asuni commited on Oct 30, 2025

Commit

f2df341

verified ·

1 Parent(s): 462b3f0

Upload 2 files

Browse files

Files changed (2) hide show

config_mos.yaml +37 -0
config_original.yaml +93 -0

config_mos.yaml ADDED Viewed

	@@ -0,0 +1,37 @@

+# Configuration for a standard Mean Opinion Score (MOS) test.
+title: "MOS Test - Audio Quality Evaluation"
+header_markdown: "Listen to the audio sample and rate its overall quality on a scale of 1 to 5."
+instructions_markdown: |
+  **Welcome, Annotator!**
+  Instructions for MOS test:
+  Please follow these steps carefully:
+  1.  Enter your unique **Annotator ID** before you begin.
+  2.  Listen to each audio clip from start to finish.
+  3.  Rate the clip using the sliders provided based on the scoring guide.
+  4.  Provide any extra details in the comments box.
+  5.  Click 'Save & Next' to submit your rating and load the next clip.
+# The directory where your audio files are stored.
+samples_directory: "sample-audios"
+# Set to 'true' to shuffle the audio files, 'false' for alphabetical order.
+randomize_samples: true
+# MOS tests typically use a single criterion for overall quality.
+criteria:
+  - label: "Overall Quality"
+    min: 1
+    max: 5
+    step: 1
+    default: 3
+    # These are standard definitions for the 5-point Absolute Category Rating (ACR) scale.
+    explanations:
+      1: "Bad - The quality is very distracting and unpleasant."
+      2: "Poor - The quality is distracting and annoying."
+      3: "Fair - The quality is slightly distracting, but acceptable."
+      4: "Good - The quality is not distracting, it is fine."
+      5: "Excellent - The quality is flawless and natural."

config_original.yaml ADDED Viewed

	@@ -0,0 +1,93 @@

+# General UI Configuration
+title: "TTS Rubric — Dynamic Evaluation"
+instructions_markdown: |
+  **Welcome annotator!**
+  Instructions for multiple aspect test
+  Please follow these steps carefully:
+  1.  Enter your unique **Annotator ID** before you begin.
+  2.  Listen to each audio clip from start to finish.
+  3.  Rate the clip using the sliders provided based on the scoring guide.
+  4.  Provide any extra details in the comments box.
+  5.  Click 'Save & Next' to submit your rating and load the next clip.
+# The directory where your audio files are stored.
+samples_directory: "sample-audios"
+# Set to 'true' to shuffle the audio files, 'false' for alphabetical order.
+randomize_samples: true
+# Define the evaluation criteria. The UI will be built from this list.
+criteria:
+  - label: "Clarity & Intelligibility"
+    min: 1
+    max: 5
+    step: 1
+    default: 3
+    explanations:
+      1: "Unacceptable."
+      2: "Often unclear or distorted; difficult to follow."
+      3: "Understandable but requires effort; some words unclear."
+      4: "Mostly clear, minor issues (with fast/slow playback)."
+      5: "Speech is clear, easy to understand (at all speeds)."
+  - label: "Accent & Pronunciation"
+    min: 1
+    max: 5
+    step: 1
+    default: 3
+    explanations:
+      1: "Severe pronunciation problems; largely unintelligible."
+      2: "Frequent pronunciation issues that impede understanding."
+      3: "Some mispronunciations that require effort to interpret."
+      4: "Minor pronunciation quirks but overall fine."
+      5: "Pronunciation is natural and appropriate for the target dialect."
+  - label: "Tone & Suitability"
+    min: 1
+    max: 5
+    step: 1
+    default: 3
+    explanations:
+      1: "Tone is inappropriate or harmful for the content."
+      2: "Tone often feels off or distracting from the content."
+      3: "Tone is acceptable but occasionally inappropriate."
+      4: "Generally appropriate tone with small mismatches."
+      5: "Tone fits the content and use-case perfectly."
+  - label: "Voice quality"
+    min: 1
+    max: 5
+    step: 1
+    default: 3
+    explanations:
+      1: "Unusable voice quality."
+      2: "Poor quality with frequent artifacts."
+      3: "Noticeable quality issues but still usable."
+      4: "Minor artifacts but overall high quality."
+      5: "Natural, pleasant voice with no artifacts."
+  - label: "Customization & Flexibility"
+    min: 1
+    max: 5
+    step: 1
+    default: 3
+    explanations:
+      1: "No useful customization; inflexible."
+      2: "Very limited or brittle customization options."
+      3: "Limited customization; acceptable for simple use-cases."
+      4: "Some customization available; works well for most cases."
+      5: "Highly flexible and customizable for different styles."
+  - label: "Listening comfort"
+    min: 1
+    max: 5
+    step: 1
+    default: 3
+    explanations:
+      1: "Uncomfortable or painful to listen to."
+      2: "Often fatiguing or distracting to listen to."
+      3: "Some listening fatigue; tolerable for short durations."
+      4: "Mostly comfortable with occasional sharpness or fatigue."
+      5: "Comfortable to listen to for extended periods."