tinykavi's picture
Add writing_pattern_classifier package for live demo
5548ff6

A newer version of the Gradio SDK is available: 6.9.0

Upgrade

Sinhala Dyslexic Writing Pattern Taxonomy

This document defines the interpretable dyslexic writing-pattern taxonomy used in this project.

The taxonomy is derived from surface-level orthographic and phonetic deviations observed in Sinhala dyslexic writing.


1. Orthographic Instability

Definition:
Inconsistent or incorrect written forms of characters without strong phonetic substitution.

Surface Signals:

  • Character omission
  • Character addition
  • Diacritic loss
  • Inconsistent spelling

Example:

  • Clean: රුපියල් දෙදාහක් තියෙනවා
  • Dyslexic: රුපියල් දෙදාහක් තියනව

2. Phonetic Confusion

Definition:
Errors that reflect confusion between phonologically similar sounds.

Surface Signals:

  • Character substitution
  • Phonetically similar replacements

Example:

  • Clean: ගණිත
  • Dyslexic: ගනිත

3. Word Boundary Confusion

Definition:
Difficulty maintaining correct word segmentation.

Surface Signals:

  • Word merges
  • Extra spaces
  • Missing spaces

4. Mixed Dyslexic Pattern

Definition:
Presence of multiple dyslexic patterns within the same sentence or essay.

Criteria:

  • More than one dominant surface error type

5. No Dominant Pattern

Definition:
No consistent dyslexic pattern detected or very low error density.


Notes

  • Patterns are assigned using rule-based dominance logic.
  • This system prioritizes explainability over raw accuracy.