Spaces:

shon98
/

PyCatan-AI

Configuration error

App Files Files Community

PyCatan-AI / .github /instructions /LOGGING_INTERMEDIATE_RESPONSES.md

EZTIME2025

unfified updated

88ee9d9 5 months ago

preview code

raw

history blame contribute delete

3.97 kB

Intermediate Responses Logging

📋 Overview

The system now saves all intermediate LLM responses - including raw content when the LLM requests tools instead of providing a final answer.

🗂️ Directory Structure

session_YYYYMMDD_HHMMSS/
├── Alice/
│   ├── prompts/
│   │   ├── prompt_1.json           # Initial prompt
│   │   └── iterations/
│   │       └── prompt_1_iter2.json # Follow-up with tool results
│   └── responses/
│       ├── response_1.json         # Final response (type: "final")
│       └── intermediate/
│           └── response_1_iter1.json  # NEW! Intermediate response with tool_calls
├── tool_executions.json
└── llm_communication.log

📝 What Gets Saved

Intermediate Response Format

Location: responses/intermediate/response_X_iterY.json

{
  "request_number": 1,
  "iteration": 1,
  "timestamp": "2026-01-09T16:07:34.123456",
  "player_name": "Alice",
  "type": "intermediate",
  "success": true,
  "raw_content": "...",  // Raw LLM response content
  "has_tool_calls": true,
  "tool_calls": [        // Full tool_calls array from LLM
    {
      "name": "find_best_nodes",
      "parameters": {
        "reasoning": "Looking for high-yield nodes...",
        "min_pips": 10
      }
    }
  ],
  "model": "gemini-2.0-flash-exp",
  "tokens": {
    "prompt": 2172,
    "completion": 79,
    "thinking": 0,
    "total": 2251
  },
  "latency_seconds": 16.234,
  "error": null
}

Final Response Format

Location: responses/response_X.json

{
  "request_number": 1,
  "timestamp": "2026-01-09T16:09:24.617751",
  "player_name": "Alice",
  "type": "final",       // Marked as final
  "success": true,
  "raw_content": "...",  // Final structured response
  "parsed": {            // Parsed action
    "action_type": "place_starting_settlement",
    "parameters": {"node": 43}
  },
  "model": "gemini-2.0-flash-exp",
  "tokens": {
    "prompt": 3538,      // Accumulated tokens
    "completion": 355,
    "thinking": 5366,
    "total": 13070       // Total including all iterations + tools
  },
  "latency_seconds": 26.136,
  "error": null
}

🔄 Complete Flow Example

Initial Prompt → prompts/prompt_1.json
LLM Response (requests tools) → responses/intermediate/response_1_iter1.json ✨ NEW!
Tool Execution → tool_executions.json
Follow-up Prompt → prompts/iterations/prompt_1_iter2.json
Final Response → responses/response_1.json

🎯 Benefits

Complete Audit Trail - Every LLM interaction is saved
Debug Tool Requests - See exactly what the LLM asked for
Analyze Reasoning - Understand why tools were requested
Replay Capability - Can reconstruct entire conversation
Cost Tracking - Token counts for each iteration

📊 Usage

The intermediate responses are automatically saved by AILogger.log_intermediate_response() whenever the LLM returns tool_calls instead of a final answer.

No changes needed to your code - it happens automatically!

🔍 Finding Intermediate Responses

from pathlib import Path

session_dir = Path("examples/ai_testing/my_games/session_20260109_160732")

# Find all intermediate responses for Alice
intermediate_dir = session_dir / "Alice" / "responses" / "intermediate"
for response_file in intermediate_dir.glob("*.json"):
    print(f"Found: {response_file.name}")

💡 Why This Matters

Previously, when the LLM requested tools, we only saved:

That tools were requested (in logs)
Which tools (in tool_executions.json)
The follow-up prompt (in iterations/)

Now we also save:

✅ The raw LLM response content
✅ Full tool_calls structure
✅ Token counts for this specific iteration
✅ Timing information
✅ Any error messages

This gives complete visibility into the AI agent's decision-making process!