File size: 984 Bytes
ed6bec6 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 |
# Dataset Generation Guide
This guide explains how to generate new training examples for the Glyphic Language.
---
# 1. Dictionary‑Driven Generation
All examples must be derived from:
- dictionary entries
- syntax rules
- BNF grammar
No freeform glyph usage is allowed.
---
# 2. Example Types
## 2.1 Atomic Examples
Single glyph → meaning
Meaning → single glyph
## 2.2 Scene Examples
Full sequences with:
- actor
- action
- object
- modifiers
- context
## 2.3 Negative Examples
Invalid sequences for syntax training.
## 2.4 Symbolic Examples
Mythic, emotional, sensory, or social scenes.
---
# 3. Generation Process
1. Select glyph(s) from dictionary
2. Build a valid sequence using syntax rules
3. Generate structured meaning
4. Generate natural language description
5. Add to appropriate dataset file
---
# 4. Quality Requirements
- No ambiguity
- No hallucinated glyphs
- No missing roles
- No invalid ordering
- No duplicate examples
|