Spaces:

EvalOps
/

jarvis

Running

App Files Files Community

jarvis / docs /operations /proactive-preference-loop.md

Jonathan Haas

Wave 38: proactive dedupe hardening and runtime decomposition

2ba439c 4 months ago

preview code

Raw

History Blame Contribute Delete

2.05 kB

	# Proactive Triage And Preference Learning

	## Proactive `nudge_decision`

	`proactive_assistant` supports `action="nudge_decision"` to bucket candidate nudges into:

	- `interrupt`
	- `notify`
	- `defer`

	Routing is deterministic and depends on:

	- `policy`: `interrupt \| defer \| adaptive`
	- `quiet_window_active` (explicit arg, otherwise runtime quiet-window policy)
	- candidate urgency inputs (`severity`, `due_at`, `expires_at`, `interrupt_allowed`)
	- optional `context` inputs (`user_busy`, `conversation_active`, `presence_confidence`)
	- `max_dispatch` capacity limit
	- dedupe suppression window (`dedupe_window_sec`, default 600 seconds)

	The response returns per-bucket rows plus summary counts and cumulative proactive counters.
	When context indicates an active/busy interaction or low presence confidence, non-critical interrupts are downgraded to `notify`/`defer`.
	Candidates recently dispatched within the dedupe window are downgraded to `defer` with reason `duplicate_recent_dispatch` to reduce notification spam.

	## Preference Learning Loop

	Conversation runtime now detects explicit user style directives and updates the active voice profile for:

	- `verbosity` (`brief \| normal \| detailed`)
	- `confirmations` (`minimal \| standard \| strict`)
	- `pace` (`slow \| normal \| fast`)
	- `tone` (`auto \| formal \| witty \| empathetic \| direct`)

	When memory is enabled, learned profile state is mirrored to memory summaries (`voice_profile:<user>`). Learned updates are exposed through runtime voice status (`preference_learning`) and observability intent metrics (`preference_update_turns`, `preference_update_fields`).

	## Safety Boundaries

	- Preference learning only triggers on explicit style-oriented directives (not on arbitrary requests).
	- High-risk action safeguards remain unchanged (preview/approval gates, policy checks).
	- Quiet-hour and policy controls still govern whether proactive actions interrupt or defer.
	- Multimodal grounding is advisory for operator transparency and recommendation quality; it does not bypass policy gates.