Spaces:

saumilyajj
/

driftcall

Sleeping

App Files Files Community

driftcall / docs /modules /task_generator.md

saumilyajj

Upload folder using huggingface_hub

f2df60e verified 22 days ago

preview code

raw

history blame contribute delete

29.3 kB

task_generator — Procedural Task-Brief Generator

Module path: driftcall/task_generator.py Owner: Person A (Environment) Implements: DESIGN.md §4.2 (reset() semantics), §8 (Dataset Strategy — §8.2, §8.3, §8.4), §10.3 (curriculum language mix) Consumed by: driftcall/env.py (DriftCallEnv.reset()) Status: Design spec — no code yet.

1. Purpose

task_generator is the deterministic, seeded source of every GoalSpec consumed by DriftCallEnv.reset(). It expands a small hand-authored template library (4 domains × 5 templates × 10 source cities × 10 destinations × 5 languages × 20 drift-compatible slot combinations = 200,000 distinct episode variants, DESIGN.md §8.4) into concrete per-episode briefs.

One call — generate(seed, stage, language_weights) — returns a single fully-populated GoalSpec with:

A domain (airline | cab | restaurant | hotel) chosen deterministically from seed.
A template variant for that domain, filled with sampled slots (cities, dates, budgets, time windows, dietary flags, etc.).
A language picked from the caller-supplied language_weights distribution.
A seed_utterance — the natural-language voice brief in the chosen language, with Unicode-correct Devanagari / Tamil / Kannada script and Hinglish Roman transliteration.
slots and constraints dicts suitable for the reward graders (R1 task completion, R3 constraint adherence — DESIGN.md §7.1).

Determinism is the contract. Identical (seed, stage, language_weights) triples always produce identical GoalSpecs, byte-for-byte after NFC normalization. This enables reproducible training, reproducible evals, and reproducible drift scheduling downstream (DESIGN.md §6.2 — drift schedules are themselves seeded off the same episode ID).

The generator owns no random global state. Every stochastic choice threads through random.Random(seed_for_this_decision) where the sub-seed is derived from (seed, decision_tag) via a stable hash. It does not own drift selection — that belongs to drift_injector (DESIGN.md §6), which receives the same seed and composes its own schedule against the GoalSpec.domain.

2. Interface

All types are imported from driftcall.models (see docs/modules/models.md). All dataclasses are frozen.

2.1 Primary entry point

from __future__ import annotations
from driftcall.models import GoalSpec, LanguageCode

def generate(
    seed: int,
    stage: Literal[1, 2, 3],
    language_weights: dict[LanguageCode, float],
) -> GoalSpec:
    """
    Produce a single fully-populated GoalSpec for episode ``seed`` at curriculum ``stage``.

    Determinism: identical (seed, stage, language_weights) ⇒ identical GoalSpec
    after Unicode NFC normalization of ``seed_utterance``.

    :param seed:              non-negative int, episode identifier; also the root
                              seed for all sub-choices (domain, template, slots,
                              language, utterance variant).
    :param stage:             curriculum stage ∈ {1, 2, 3}; affects allowed
                              template complexity (stage 1 uses simple templates
                              only; stage 3 enables drift-compatible slots).
    :param language_weights:  normalized distribution over LanguageCode keys;
                              values must be non-negative and sum to 1.0 ± 1e-6.

    :returns: GoalSpec whose .seed_utterance is NFC-normalized UTF-8.

    :raises InvalidLanguageWeightError: weights empty, negative, or sum ≠ 1.0.
    :raises InvalidStageError:          stage ∉ {1, 2, 3}.
    :raises InvalidBudgetError:         sampled budget outside template's declared
                                        [low, high] range (indicates corrupt template).
    :raises MissingSlotError:           template variant references a {slot}
                                        placeholder not present in the filled slot dict.
    :raises TemplateFileMissingError:   ``data/task_briefs/templates.yaml`` not found
                                        or malformed.
    :raises UnicodeNormalizationError:  rendered utterance fails NFC round-trip check
                                        (raised defensively — should never fire in practice).
    """

2.2 Helper signatures (all module-private except where noted)

# --- template loader (public for tests + corpus packaging) ---
def load_templates(path: Path | str = "data/task_briefs/templates.yaml") -> TemplateLibrary:
    """
    Parse the YAML template file, validate the schema (§4 below),
    and return an in-memory TemplateLibrary.

    Called once at module import via a lazy singleton; callers should use
    ``_get_library()`` inside the module. Exposed publicly for unit tests
    and the dataset-packaging script that writes ``train/briefs.jsonl``
    (DESIGN.md §8.6).

    :raises TemplateFileMissingError:   path does not exist.
    :raises TemplateSchemaError:        YAML present but fails schema validation
                                        (missing required key, wrong type, etc.).
    """

# --- domain + template picker ---
def _pick_domain(seed: int) -> Literal["airline", "cab", "restaurant", "hotel"]:
    """Uniform over 4 domains, seeded by hash(seed, 'domain')."""

def _pick_template(seed: int, stage: int, domain: str, library: TemplateLibrary) -> Template:
    """
    Uniform over templates for ``domain`` whose ``min_stage`` ≤ ``stage``.
    Seeded by hash(seed, 'template').
    """

# --- slot expander ---
def _expand_slots(seed: int, template: Template) -> SlotGrid:
    """
    For each slot in the template's required_slots + optional_slots + constraints_template,
    sample one concrete value per the slot's declared distribution.

    Returns a SlotGrid: a frozen mapping of slot-name -> concrete value.

    Handles:
      - enum slots (``choices: [...]``)
      - uniform numeric ranges (``distribution: uniform, low, high, step``)
      - city slots (from the 10×10 city/destination grid, domain-filtered)
      - date slots (relative to a fixed reference date, DESIGN.md §11.1 — deterministic)
      - boolean slots (veg_only, etc.)
    """

# --- language picker ---
def _pick_language(seed: int, language_weights: dict[LanguageCode, float]) -> LanguageCode:
    """
    Weighted draw from ``language_weights`` seeded by hash(seed, 'language').
    ``language_weights`` is validated by ``generate()`` before this is called.
    """

# --- utterance formatter ---
def _format_utterance(
    seed: int,
    template: Template,
    slots: SlotGrid,
    language: LanguageCode,
) -> str:
    """
    Pick one of the template.language_variants[language] strings (uniform,
    seeded by hash(seed, 'variant')), substitute every {slot} placeholder
    with the Unicode-correct rendering of slots[slot], and return the
    NFC-normalized result.

    :raises MissingSlotError:          format string references {X} but X not in slots.
    :raises UnicodeNormalizationError: NFC round-trip fails.
    """

# --- public helper: list all (seed, stage, lang_weights) combos for dataset packaging ---
def enumerate_variants(
    limit: int | None = None,
    stage: int = 3,
    language_weights: dict[LanguageCode, float] | None = None,
) -> Iterator[GoalSpec]:
    """
    Deterministic walk over the procedural grid, yielding up to ``limit``
    GoalSpecs. Used by DESIGN.md §8.6 to produce ``train/briefs.jsonl``
    and ``val/briefs.jsonl``. Not called from env.reset().

    Walk order: domain (4) → template (5) → from×to (10×10) → language (5)
    → utterance variant. Stable across runs.
    """

3. Behavior Spec

3.1 Determinism via seed (DESIGN.md §4.2, §8.4)

Every sub-decision uses random.Random(stable_sub_seed(seed, tag)) where stable_sub_seed is int.from_bytes(hashlib.blake2b(f"{seed}:{tag}".encode(), digest_size=8).digest(), "big").
Valid tags: "domain", "template", "slots", "language", "variant", plus per-slot tags f"slot:{slot_name}".
Never call random.random() (global state) or time.time() anywhere in the module.
generate(42, 1, W) on two machines with identical Python versions returns byte-identical GoalSpec.seed_utterance after NFC normalization.

3.2 Language-weight sampling

language_weights is the caller's contract for the curriculum mix (DESIGN.md §10.3 defines Stage-1 50/30/20 and Stage-2/3 30/30/20/10/10 splits).
generate() validates weights before sampling. Each check binds to exactly one exception:
- Unsupported key — any key ∉ {"hi", "ta", "kn", "en", "hinglish"} (LanguageCode) → raises InvalidLanguageError.
- Empty weights dict — len(language_weights) == 0 → raises InvalidLanguageWeightError.
- Negative weight value — any w < 0 → raises InvalidLanguageWeightError.
- Sum outside tolerance — |sum(weights) − 1.0| > 1e-6 → raises InvalidLanguageWeightError.
- All weights zero — defensive assertion; redundant with the sum-check (if sum = 1 ± 1e-6 and all ≥ 0, at least one must be > 0) but kept as an explicit invariant that guards against floating-point edge cases where sum rounds to 1 via noise while every entry is 0. Raises InvalidLanguageWeightError.
_pick_language uses random.Random(sub_seed).choices(population, weights=w, k=1)[0].

3.3 Slot combinatorial grid (DESIGN.md §8.4)

Each template declares required_slots, optional_slots, and constraints_template (§4 below).
The source × destination city grid is domain-scoped: airline + hotel draw from inter-city pairs; cab + restaurant draw from intra-city locations. Both lists are 10 entries each per domain (40 total unique cities across domains, deduped in the YAML).
Optional slots are included with probability 0.5 (seeded).
Date slots are sampled relative to a fixed reference date 2026-04-25 from a 60-day forward window (so train/val sets are temporally stable).
Budget slots sample on the declared step grid: e.g., uniform 3000..15000 step 500 yields one of {3000, 3500, …, 15000}.
Stage 1 uses templates flagged min_stage: 1; stages 2–3 also admit min_stage: 2 and min_stage: 3 (more complex compound-constraint templates with drift-compatible slot layouts).

3.4 Unicode handling for Hindi, Tamil, Kannada

Template YAML is authored in NFC (Unicode Normalization Form C). The loader re-normalizes on read (defensive).
After slot substitution, _format_utterance calls unicodedata.normalize("NFC", s) and asserts unicodedata.is_normalized("NFC", s) — if not, raises UnicodeNormalizationError.
City names, dish names, and day-of-week translations for Hindi / Tamil / Kannada live in a static lookup table (data/task_briefs/i18n.yaml, loaded by load_templates). English + Hinglish share Roman script with Devanagari-free glyphs (ASCII + ₹).
i18n.yaml is NFC-normalized at load time. load_templates applies unicodedata.normalize("NFC", v) to every string value parsed out of data/task_briefs/i18n.yaml (city names, weekday names, dish names, domain-specific nouns — across hi, ta, kn, en, hinglish) before those strings are stored in TemplateLibrary.i18n. The same NFC pass is applied to every string inside templates.yaml (variant strings, choices enums, slot labels). Consequence: every string that _expand_slots pulls into a SlotGrid is already NFC, so downstream consumers — _format_utterance, reward R1 string-equality comparisons (DESIGN.md §7.1), and audit logging — may assume NFC without re-normalizing.
Hinglish is always Roman-script (no mixed scripts); Hindi is always Devanagari-script. A template that tries to mix the two in a single variant is rejected at load time.

3.5 Stage-aware complexity

Stage	Templates allowed	Compound constraints	Drift-compatible slot layout
1	`min_stage: 1` only (simple: domain + 1 required slot + up to 2 constraints)	No	No — slots chosen from v1-schema-compatible fields only
2	`min_stage` ≤ 2	Up to 2 constraints	Slots cover fields likely to be renamed (`price`, `fare_inr`) so drift is observable
3	all templates	Up to 3 constraints	Slots must include ≥ 1 field that a Stage-3 compound drift will touch

"Drift-compatible slot layout" is a static property of the template (declared in YAML via drift_slot_tags: [price, passenger_count, …]) — the generator does not itself pick drifts; it only guarantees the slot surface is rich enough for drift_injector to have something meaningful to mutate.

3.6 Invariants (enforced by tests)

generate(s, k, w) == generate(s, k, w) for any valid (s, k, w).
The returned GoalSpec.language appears in language_weights with weight > 0.
Every {slot} placeholder in seed_utterance is resolved — no literal {…} survives in the output.
GoalSpec.seed_utterance is in NFC.
Stage 1 never yields a template with min_stage > 1.
Numeric constraints (e.g., budget_inr) fall in the template's declared [low, high] range.
seed_utterance length ≤ 280 characters (one SMS; keeps ASR inputs bounded at deploy time — DESIGN.md §9).
Every string value in SlotGrid.values is NFC-normalized before generate() returns (guaranteed by the i18n.yaml + templates.yaml NFC pass in load_templates, §3.4). Reward R1 (string equality) and other downstream consumers may assume NFC on every slot string — they do not need to re-normalize.

4. Data Structures

4.1 Template YAML schema (matches DESIGN.md §8.3 exactly)

# data/task_briefs/templates.yaml
- template_id: airline.book.budget_timewindow
  domain: airline                 # {airline, cab, restaurant, hotel}
  intent: book_flight             # free string; mirrored into GoalSpec.intent
  min_stage: 1                    # 1 | 2 | 3
  required_slots: [from, to, when]
  optional_slots: [seat_pref]
  constraints_template:
    budget_inr:
      distribution: uniform
      low: 3000
      high: 15000
      step: 500
    time_window:
      choices: [morning, afternoon, evening, late_night]
  drift_slot_tags: [price, total_fare_inr]  # used by drift_injector for targeting
  # Language keys are ISO short codes matching LanguageCode = Literal["hi","ta","kn","en","hinglish"].
  # Long names (hindi/tamil/kannada/english) are NOT accepted — loader rejects them via TemplateSchemaError.
  language_variants:
    hinglish:
      - "Bhai {when} ko {to} jaana hai, cheapest flight {time_window} mein, {budget_inr} rupees max"
      - "{when} ko {from} se {to} ka ticket book kar de, under {budget_inr}, {time_window} ke baad"
    hi:
      - "मुझे {when} को {from} से {to} जाना है, {budget_inr} रुपये से कम में"
    ta:
      - "{when} அன்று {from} லிருந்து {to} க்கு டிக்கெட் வேண்டும், {budget_inr} ரூபாய்க்கு கீழ்"
    kn:
      - "{when} ರಂದು {from} ಇಂದ {to} ಗೆ ಅಗ್ಗದ ವಿಮಾನ ಟಿಕೆಟ್ ಬೇಕು, {budget_inr} ರೂಪಾಯಿಗಳ ಒಳಗೆ"
    en:
      - "Book the cheapest flight from {from} to {to} on {when}, budget under ₹{budget_inr}, departing {time_window}"

4.2 In-memory types

from __future__ import annotations
from dataclasses import dataclass
from typing import Literal, Mapping

LanguageCode = Literal["hi", "ta", "kn", "en", "hinglish"]
Domain = Literal["airline", "cab", "restaurant", "hotel"]

@dataclass(frozen=True)
class SlotDistribution:
    """Either an enum (``choices``) or a uniform numeric grid (``low``, ``high``, ``step``)."""
    kind: Literal["choices", "uniform"]
    choices: tuple[str, ...] | None = None
    low: float | None = None
    high: float | None = None
    step: float | None = None

@dataclass(frozen=True)
class Template:
    template_id: str
    domain: Domain
    intent: str
    min_stage: Literal[1, 2, 3]
    required_slots: tuple[str, ...]
    optional_slots: tuple[str, ...]
    constraints_template: Mapping[str, SlotDistribution]
    drift_slot_tags: tuple[str, ...]
    language_variants: Mapping[LanguageCode, tuple[str, ...]]  # ≥ 1 string per language

@dataclass(frozen=True)
class TemplateLibrary:
    templates: tuple[Template, ...]
    cities_by_domain: Mapping[Domain, tuple[str, ...]]
    i18n: Mapping[LanguageCode, Mapping[str, str]]  # e.g., {"hi": {"BLR": "बेंगलुरु", …}}

@dataclass(frozen=True)
class SlotGrid:
    """Concrete slot values after expansion. Keys are slot names; values are already
    localized to the chosen language (e.g., city rendered in Devanagari for 'hi')."""
    values: Mapping[str, object]  # str | int | float | bool

@dataclass(frozen=True)
class RawBrief:
    """Intermediate product: slots filled, language chosen, utterance not yet rendered.
    Used internally for testability — generate() returns a GoalSpec, not a RawBrief."""
    template_id: str
    domain: Domain
    intent: str
    slots: SlotGrid
    constraints: Mapping[str, object]
    language: LanguageCode

GoalSpec itself is defined in driftcall/models.py (DESIGN.md §4.1) and is the final product of generate(). The generator copies RawBrief fields into GoalSpec and adds the rendered seed_utterance.

5. Error Modes

All exceptions subclass TaskGeneratorError(Exception). Each is raised exactly once in the module and has a test asserting it.

Exception	Trigger	Where raised
`MissingSlotError`	template variant references `{X}` but X not in filled `SlotGrid`	`_format_utterance`
`InvalidLanguageError`	`language_weights` contains a key ∉ LanguageCode (e.g., `"hindi"`, `"marathi"`)	`generate` (pre-sample validation)
`InvalidLanguageWeightError`	empty dict, OR any value < 0, OR sum ∉ [1−1e-6, 1+1e-6], OR all weights = 0 (defensive, redundant with sum-check)	`generate`
`InvalidStageError`	`stage ∉ {1, 2, 3}`	`generate`
`InvalidBudgetError`	sampled numeric falls outside declared `[low, high]` (indicates corrupt template or step misalignment)	`_expand_slots`
`TemplateFileMissingError`	`data/task_briefs/templates.yaml` absent or unreadable	`load_templates`
`TemplateSchemaError`	YAML present but fails required-key / type / shape validation	`load_templates`
`UnicodeNormalizationError`	NFC round-trip check fails on rendered utterance (defensive)	`_format_utterance`
`NoVariantForLanguageError`	chosen template has no `language_variants[chosen_language]` entry	`_format_utterance`

No silent fallbacks. The generator never substitutes a default city, a default language, or a default template on failure — it raises. The env's reset() is expected to let these propagate (callers catch and restart with a different seed, never mask).

6. Dependencies

6.1 Reads

data/task_briefs/templates.yaml — the template library (§4.1 schema). Authored by hand in Phase D; never modified at runtime. NFC-normalized at load time (§3.4).
data/task_briefs/i18n.yaml — localized strings for city names, weekdays, domain-specific nouns, in Hindi / Tamil / Kannada. Same load path as templates; separate file for readability. load_templates applies unicodedata.normalize("NFC", v) to every string value (§3.4) so that TemplateLibrary.i18n is NFC-clean before any slot expansion runs.

Both files ship inside the Docker image for the env Space (DESIGN.md §11.1).

6.2 Imports

driftcall.models — GoalSpec, LanguageCode, Domain. The generator does not import from env.py, rewards.py, drift_injector.py, or any vendor module. Strict one-way dependency.
Python stdlib: random, hashlib, unicodedata, dataclasses, pathlib, typing.
Third-party: PyYAML (already in requirements.txt per DESIGN.md §11.1).

6.3 Produces

GoalSpec instance returned to DriftCallEnv.reset() (DESIGN.md §4.2).
Iterator[GoalSpec] via enumerate_variants for the dataset-packaging script that writes train/briefs.jsonl and val/briefs.jsonl (DESIGN.md §8.6).

6.4 Consumers

driftcall/env.py::DriftCallEnv.reset — the single production caller of generate().
training/data_export.py (Phase C4) — batch-calls enumerate_variants() to build the HF Hub dataset artifact.
tests/test_task_generator.py — exercises every branch + every error mode.

6.5 Non-dependencies (explicit)

Does not depend on the drift injector. The generator never picks a drift; it only declares drift_slot_tags on the template so the injector can target slots later.
Does not depend on audio pipeline. All output is text; TTS happens at the env boundary (DESIGN.md §9.4).

7. Edge Cases

Missing slot placeholder in a template variant. YAML author writes "Bhai {when} ko {destination} jaana hai" but declares required_slots: [from, to, when] — {destination} has no fill source. Detected in _format_utterance which iterates string.Formatter().parse() over the variant; raises MissingSlotError naming both the template_id and the missing slot. Also caught earlier if possible — load_templates does a static scan and raises TemplateSchemaError at load time so runtime failures are rare.
Invalid language code in language_weights. Caller passes {"marathi": 1.0}. generate validates keys against the LanguageCode literal before any sampling and raises InvalidLanguageError listing the unsupported keys. No partial GoalSpec is constructed.
Budget out of declared range. Template declares uniform 3000..15000 step 500. An implementation bug rounds to step 1000 and yields 16000. _expand_slots post-condition-checks every numeric against [low, high] and raises InvalidBudgetError. This should never fire with the spec implementation but exists as a defense — catching corrupt templates or future implementation regressions during unit tests.
Unicode NFC / NFD collision in Kannada or Tamil. Author pastes a Kannada string copied from macOS (NFD) into templates.yaml. load_templates re-normalizes to NFC on read; _format_utterance final-normalizes the substituted string. A direct byte comparison against the input YAML may differ, but the rendered seed_utterance is guaranteed NFC. UnicodeNormalizationError only fires if the round-trip assertion itself fails (indicates a Python/ICU bug, not a data bug).
Seed collision across episodes. Training loop calls generate(seed=42, …) twice across two different training epochs. Both calls return identical GoalSpecs — that is the contract. Upstream training code is responsible for using non-colliding seeds (e.g., seed = epoch * 10_000 + step); the generator does not deduplicate. Documented in the training spec (docs/modules/training.md, not here).
Language weights sum ≠ 1.0. Caller passes {"en": 0.5, "hi": 0.3} (sum 0.8). generate raises InvalidLanguageWeightError. Rationale: silent renormalization would mask curriculum-config bugs where a language is silently dropped. Caller must normalize explicitly.
Template with zero variants for requested language. _pick_language picks "ta" but the chosen template has no language_variants["ta"]. The generator does not resample language — that would bias the distribution. Instead it raises NoVariantForLanguageError. The template library invariant (enforced at load_templates) is every template has ≥ 1 variant in every LanguageCode; this exception is defense against YAML authoring regressions and is tested via a malformed fixture.
Step-misaligned uniform range. Template declares low: 3000, high: 15000, step: 700. (15000-3000) % 700 ≠ 0 — the grid doesn't cleanly terminate at high. load_templates detects this at load time and raises TemplateSchemaError, preventing runtime surprise.
Negative seed. generate(seed=-1, …) — stable hash handles negatives fine (blake2b accepts any UTF-8 bytes), but by convention the env passes non-negative episode IDs. The generator does not reject negatives; it just uses them verbatim. Documented in the interface docstring.
Very large seed (> 2^63). Same as #9 — blake2b handles arbitrary strings. No overflow.

8. Examples

8.1 Stage-1 airline, English

>>> W = {"en": 1.0, "hi": 0.0, "ta": 0.0, "kn": 0.0, "hinglish": 0.0}
>>> goal = generate(seed=42, stage=1, language_weights=W)
>>> goal.domain
'airline'
>>> goal.intent
'book_flight'
>>> goal.language
'en'
>>> goal.slots
{'from': 'HYD', 'to': 'BLR', 'when': '2026-05-02'}
>>> goal.constraints
{'budget_inr': 7500, 'time_window': 'evening'}
>>> goal.seed_utterance
'Book the cheapest flight from HYD to BLR on 2026-05-02, budget under ₹7500, departing evening'

Determinism check:

>>> generate(42, 1, W) == generate(42, 1, W)
True
>>> generate(42, 1, W).seed_utterance == generate(42, 1, W).seed_utterance
True

8.2 Stage-3 restaurant, Hinglish, drift-compatible slot layout

>>> W = {"en": 0.3, "hi": 0.2, "ta": 0.1, "kn": 0.1, "hinglish": 0.3}
>>> goal = generate(seed=42, stage=3, language_weights=W)
>>> goal.domain
'restaurant'
>>> goal.language
'hinglish'
>>> goal.slots
{'city': 'Mumbai', 'cuisine': 'Biryani', 'when': '2026-05-10T20:00'}
>>> goal.constraints
{'budget_inr': 400, 'veg_only': True, 'min_order_buffer': 100}
>>> goal.seed_utterance
"Bhai tonight Mumbai mein Biryani order karna hai, 400 rupees se kam, veg option chahiye"

This brief's slot surface (budget_inr + veg_only) overlaps the drift patterns restaurant.min_order_bump and restaurant.veg_filter_semantic (DESIGN.md §5.3) — so when drift_injector selects a Stage-3 compound drift, the agent's goal is genuinely affected. That is what "drift-compatible slot layout" means.

8.3 Kannada utterance (Unicode-correct Kannada script, U+0C80–U+0CFF)

>>> W = {"kn": 1.0, "en": 0.0, "hi": 0.0, "ta": 0.0, "hinglish": 0.0}
>>> goal = generate(seed=7, stage=2, language_weights=W)
>>> goal.domain
'airline'
>>> goal.language
'kn'
>>> goal.slots
{'from': 'BLR', 'to': 'MAA', 'when': '2026-05-08'}
>>> goal.constraints
{'budget_inr': 5500}
>>> goal.seed_utterance
'2026-05-08 ರಂದು BLR ಇಂದ MAA ಗೆ ಅಗ್ಗದ ವಿಮಾನ ಟಿಕೆಟ್ ಬೇಕು, 5500 ರೂಪಾಯಿಗಳ ಒಳಗೆ'
>>> import unicodedata
>>> unicodedata.is_normalized("NFC", goal.seed_utterance)
True
>>> # At least one codepoint in the Kannada block (U+0C80–U+0CFF)
>>> any(0x0C80 <= ord(c) <= 0x0CFF for c in goal.seed_utterance)
True
>>> # No Devanagari codepoints leaked in (U+0900–U+097F)
>>> any(0x0900 <= ord(c) <= 0x097F for c in goal.seed_utterance)
False

This example uses the genuine-Kannada-script variant declared in §4.1. City codes (BLR, MAA) remain in Roman because IATA/AAI airport codes are canonical identifiers in every language; full Kannada place names (ಬೆಂಗಳೂರು, ಚೆನ್ನೈ) are available in i18n.yaml and used by variants that reference {from_city_local} instead of {from}.

8.4 Tamil utterance with Devanagari-free script

>>> W = {"ta": 1.0, "en": 0.0, "hi": 0.0, "kn": 0.0, "hinglish": 0.0}
>>> goal = generate(seed=101, stage=2, language_weights=W)
>>> goal.language
'ta'
>>> goal.seed_utterance
'2026-05-04 அன்று HYD லிருந்து BLR க்கு டிக்கெட் வேண்டும், 6500 ரூபாய்க்கு கீழ்'
>>> unicodedata.is_normalized("NFC", goal.seed_utterance)
True
>>> # No Devanagari codepoints (U+0900–U+097F) present
>>> any(0x0900 <= ord(c) <= 0x097F for c in goal.seed_utterance)
False

8.5 Hindi utterance (Devanagari)

>>> W = {"hi": 1.0, "en": 0.0, "ta": 0.0, "kn": 0.0, "hinglish": 0.0}
>>> goal = generate(seed=5, stage=1, language_weights=W)
>>> goal.language
'hi'
>>> goal.seed_utterance
'मुझे 2026-05-01 को DEL से BOM जाना है, 6000 रुपये से कम में'

9. Open Questions

None — spec is complete.

All decisions referenced in §§1–8 follow DESIGN.md §4.1, §4.2, §8.3, §8.4, §10.3 without extension. The generator is a pure function of its inputs; no side effects, no mutable global state, no dependencies on drift or reward subsystems. Edge cases 1–10 cover the full error surface identified during review.

Cross-doc references established:

docs/modules/models.md — GoalSpec, LanguageCode, Domain definitions
docs/modules/drift_injector.md — consumes GoalSpec.domain and template drift_slot_tags to schedule drifts
docs/modules/env.md — calls generate() from DriftCallEnv.reset()
docs/modules/rewards.md — consumes GoalSpec.slots + GoalSpec.constraints for R1 and R3
docs/modules/datasets.md — calls enumerate_variants() to package HF Hub dataset