"""Layer 1 — RL Prompt Optimizer (GRPO via TRL + Unsloth)."""