Spaces:

aliss77777
/

IFX-trace-implementation

No application file

App Files Files Community

IFX-trace-implementation / api /trace_implementation.md

Liss, Alex (NYC-HUG)

Add trace implementation and documentation; removed old files

0698457 7 months ago

preview code

raw

history blame contribute delete

13.8 kB

	# Freeplay Traces Implementation Plan

	## Overview

	This document outlines the implementation plan for integrating Freeplay Traces into the existing Huge League Soccer application. Traces provide enhanced observability by grouping LLM interactions within a session, allowing us to track input questions, LLM responses, and associated metadata.

	## Current Architecture

	### Existing Freeplay Integration Points

	1. Session Management (`api/server_gradio.py`)
	- `AppState.ensure_sessions()` - Creates Freeplay session on first user interaction
	- Session ID preserved across state changes

	2. Prompt Management (`api/workflows/base.py`)
	- `call_model()` - Retrieves persona-specific prompts via `get_prompt_by_persona()`
	- Uses Freeplay prompt templates (casual_fan_prompt, super_fan_prompt)

	3. Session Recording (`api/workflows/base.py`)
	- `should_continue()` - Records final LLM responses to Freeplay session
	- Uses `record_session()` method

	## Proposed Traces Implementation

	### AppState Enhancement

	The `AppState` class will be enhanced to store the actual Freeplay session object, allowing direct access to `session.create_trace()` as shown in the original documentation.

	### Modified Methods

	```python
	# api/utils/freeplay_helpers.py - Modified method
	def record_session(self, state, end: Optional[float] = time.time(),
	formatted_prompt: Optional[FormattedPrompt] = None,
	prompt_vars: Optional[dict] = None,
	trace_info: Optional[TraceInfo] = None): # NEW PARAMETER
	# ... existing code ...
	payload = RecordPayload(
	# ... existing fields ...
	trace_info=trace_info # NEW FIELD
	)
	```

	## Implementation Flow

	### 1. AppState Enhancement
	Location: `api/server_gradio.py` - `AppState` class

	```python
	class AppState(BaseModel):
	# ... existing fields ...
	freeplay_session_id: str = ""
	freeplay_session: Optional[Any] = None # NEW: Store the actual session object

	def ensure_sessions(self):
	if not self.zep_session_id:
	self.zep_session_id = ZepClient() \
	.get_or_create_user(self.email, self.first_name, self.last_name) \
	.create_session() \
	.session_id
	if not self.freeplay_session_id:
	freeplay_client = FreeplayClient()
	self.freeplay_session = freeplay_client.create_session() # Store session object
	self.freeplay_session_id = self.freeplay_session.session_id
	```

	### 2. User Input & Trace Creation
	Location: `api/server_gradio.py` - `submit_helper()`

	```python
	def submit_helper(state, handler, user_query):
	state.ensure_sessions() # Freeplay session already exists

	# Create trace directly on the session object
	trace_info = state.freeplay_session.create_trace(
	input=user_query, # optional metadata not included
	)

	# Pass trace_info to workflow
	workflow_bundle, workflow_state = build_workflow_with_state(
	handler=handler,
	zep_session_id=state.zep_session_id,
	freeplay_session_id=state.freeplay_session_id,
	email=state.email,
	first_name=state.first_name,
	last_name=state.last_name,
	persona=state.persona,
	messages=state.history,
	trace_info=trace_info, # NEW: Pass trace_info to workflow
	)
	```

	### 3. Workflow State Enhancement
	Location: `api/workflows/base.py` - `AgentState`

	```python
	class AgentState(TypedDict):
	# ... existing fields ...
	trace_info: Optional[TraceInfo] = None # NEW
	```

	### 4. Workflow Builder Enhancement
	Location: `api/workflows/base.py` - `build_workflow_with_state()`

	```python
	def build_workflow_with_state(handler: AsyncCallbackHandler,
	zep_session_id: Optional[str] = None,
	freeplay_session_id: Optional[str] = None,
	email: Optional[str] = None,
	first_name: Optional[str] = None,
	last_name: Optional[str] = None,
	persona: Optional[str] = None,
	messages: Optional[List[BaseMessage]] = None,
	trace_info: Optional[TraceInfo] = None) -> Tuple[WorkflowBundle, AgentState]: # NEW PARAMETER
	"""
	Utility to build workflow and initial state in one step.
	"""
	bundle = build_workflow(handler)
	state = {
	"zep_session_id": zep_session_id,
	"freeplay_session_id": freeplay_session_id,
	"email": email,
	"first_name": first_name,
	"last_name": last_name,
	"persona": persona,
	"messages": messages or [],
	"start_time": time.time(),
	"zep_memory": None,
	"trace_info": trace_info, # NEW: Populate trace_info in AgentState
	}
	return bundle, state
	```

	### 5. LLM Processing with Trace Association
	Location: `api/workflows/base.py` - `call_model()`

	```python
	async def call_model(state: AgentState,
	handler: AsyncCallbackHandler,
	zep_client: ZepClient,
	freeplay_client: FreeplayClient) -> dict:

	# ... existing prompt and LLM logic ...

	# Record trace output with LLM response
	if state.get("trace_info"):
	final_response = response.content if hasattr(response, 'content') else str(response)
	state["trace_info"].record_output(
	project_id=FREEPLAY_PROJECT_ID,
	output=final_response
	)

	return {'messages': [response], 'zep_memory': memory}
	```

	### 6. Session Recording with Trace
	Location: `api/workflows/base.py` - `should_continue()`

	```python
	async def should_continue(state: AgentState,
	handler: AsyncCallbackHandler,
	zep_client: ZepClient,
	freeplay_client: FreeplayClient) -> str:
	# ... existing logic ...

	if 'tool_calls' not in last_message.additional_kwargs:
	# Record session with trace association
	freeplay_client.record_session(state, trace_info=state.get("trace_info"))
	return 'end'
	return 'continue'
	```

	## Data Flow Diagram

	```
	┌─────────────────┐ ┌──────────────────┐ ┌─────────────────┐
	│ User Input │ │ submit_helper │ │ AppState │
	│ │ │ │ │ │
	│ "Tell me about │───▶│ 1. ensure_sessions│───▶│ freeplay_session│
	│ Ryan Martinez" │ │ 2. create_trace() │ │ .create_trace() │
	└─────────────────┘ │ 3. pass trace_info│ │ │
	└──────────────────┘ └─────────────────┘
	│ │
	▼ ▼
	┌──────────────────┐ ┌─────────────────┐
	│ build_workflow_ │ │ Trace Info │
	│ with_state() │ │ │
	│ │ │ - trace_id │
	│ - trace_info │ │ - input │
	│ - user_query │ │ - metadata │
	└──────────────────┘ └─────────────────┘
	│
	▼
	┌──────────────────┐
	│ AgentState │
	│ │
	│ - trace_info │
	│ - messages │
	│ - persona │
	└──────────────────┘
	│
	▼
	┌──────────────────┐
	│ Workflow │
	│ Execution │
	│ │
	│ call_model() │
	│ - get_prompt() │
	│ - LLM call │
	│ - record_output()│
	└──────────────────┘
	│
	▼
	┌──────────────────┐
	│ should_continue()│
	│ │
	│ record_session() │
	│ with trace_info │
	└──────────────────┘
	│
	▼
	┌──────────────────┐
	│ Freeplay Session │
	│ Recording │
	│ │
	│ - Session ID │
	│ - Trace Info │
	│ - Input/Output │
	└──────────────────┘
	```

	## Trace Lifecycle

	### 1. Creation Phase
	- Trigger: User submits question
	- Location: `submit_helper()`
	- Action: Create trace with user input and metadata
	- Output: `TraceInfo` object with `trace_id`

	### 2. Association Phase
	- Trigger: Workflow initialization
	- Location: `build_workflow_with_state()`
	- Action: Pass `trace_info` to workflow state
	- Output: `AgentState` with trace context

	### 3. Processing Phase
	- Trigger: LLM interaction
	- Location: `call_model()`
	- Action: Record LLM output to trace
	- Output: Trace with input/output pair

	### 4. Recording Phase
	- Trigger: Workflow completion
	- Location: `should_continue()`
	- Action: Associate trace with session recording
	- Output: Complete session with trace linkage

	## Benefits

	### Enhanced Observability
	- Input/Output Tracking: Complete visibility of user questions and LLM responses
	- Metadata Association: Persona, user info, and interaction context
	- Performance Metrics: Response times, tool usage, and evaluation results

	### Debugging & Analysis
	- Trace Isolation: Individual interactions can be analyzed separately
	- Session Context: Traces maintain session-level context
	- Evaluation Support: Built-in support for feedback and evaluation metrics

	### Backward Compatibility
	- Optional Implementation: Traces are opt-in, existing functionality unchanged
	- Graceful Degradation: System works without traces if disabled
	- Incremental Rollout: Can be enabled per user or session

	## Implementation Phases

	### Phase 1: AppState Enhancement
	1. Add `freeplay_session` property to `AppState` class
	2. Modify `ensure_sessions()` to store session object
	3. Test session object storage and retrieval

	### Phase 2: Workflow Integration
	1. Update `AgentState` to include `trace_info` field
	2. Modify `build_workflow_with_state()` to accept `trace_info` parameter
	3. Update `call_model()` to record trace outputs
	4. Update `should_continue()` to associate traces with sessions

	### Phase 3: Server Integration
	1. Update `submit_helper()` to create traces using session object
	2. Pass `trace_info` through workflow initialization
	3. Test complete trace lifecycle

	### Phase 4: Session Recording Enhancement
	1. Modify `record_session()` to accept `trace_info` parameter
	2. Update `RecordPayload` to include trace information
	3. Test session recording with trace association

	### Phase 5: Testing & Validation
	1. Unit tests for trace creation and recording
	2. Integration tests for workflow trace integration
	3. End-to-end tests for complete trace lifecycle
	4. Performance impact assessment

	## Risk Mitigation

	### High Risk Areas
	1. Session State Management
	- Risk: Breaking existing session flow
	- Mitigation: Optional trace parameters, backward compatibility

	2. Workflow State Changes
	- Risk: Breaking existing state structure
	- Mitigation: Optional fields with defaults, thorough testing

	### Medium Risk Areas
	1. Performance Impact
	- Risk: Slowing down workflow execution
	- Mitigation: Lazy trace creation, optional features

	2. Data Consistency
	- Risk: Trace/recording mismatches
	- Mitigation: Validation and error handling

	## Success Criteria

	1. Functionality: Traces created and recorded for all user interactions
	2. Performance: No measurable impact on workflow execution time
	3. Compatibility: All existing functionality continues to work
	4. Observability: Complete trace data available in Freeplay dashboard
	5. Reliability: Graceful handling of trace creation/recording failures

	## Next Steps

	1. User Approval: Review and approve implementation plan
	2. Phase 1 Implementation: Begin with core infrastructure
	3. Testing Strategy: Validate each phase before proceeding
	4. Documentation: Update API documentation with trace features
	5. Monitoring: Implement trace-specific logging and metrics