Spaces:

holistic-ai
/

AgentGraph

Running

wu981526092 commited on Sep 1, 2025

Commit

9216205

1 Parent(s): 4c9391e

🚀 Major Sample Data Enhancement: Complex Multi-Agent Scenarios

✨ REVOLUTIONARY UPGRADE: Replaced simple examples with production-ready complex scenarios

🎯 NEW ADVANCED EXAMPLES:
- Multi-Agent Geospatial Research: Restaurant discovery with location services, verification experts, and systematic error handling
- Scientific Multi-Agent Calculation: Interdisciplinary marathon/astronomy computation with expert coordination
- Advanced Data Analysis Investigation: USGS database challenges with format validation and recovery strategies

🧠 ENHANCED KNOWLEDGE GRAPHS:
- 3 comprehensive traces with complete failure detection
- Advanced optimization recommendations
- Complex tool integration patterns
- Systematic error handling workflows
- Domain expertise coordination examples

💪 PRODUCTION FEATURES DEMONSTRATED:
- Multi-agent collaboration patterns
- Advanced failure detection (RETRIEVAL_ERROR, PLANNING_ERROR, EXECUTION_ERROR)
- Systematic optimization strategies (WORKFLOW_SIMPLIFICATION, TOOL_ENHANCEMENT)
- Enhanced content references with confidence scoring
- Comprehensive quality assessment
- Real-world error recovery scenarios

📊 COMPLEXITY UPGRADE:
- From simple 2-agent scenarios to sophisticated multi-expert systems
- From basic Q&A to complex geospatial research and scientific calculations
- From minimal errors to comprehensive failure analysis and optimization

🎨 USER EXPERIENCE IMPACT:
- Users immediately see AgentGraph's complete capabilities
- Understand advanced multi-agent coordination patterns
- Learn systematic error handling and optimization strategies
- Experience production-ready knowledge graph extraction examples

This represents a quantum leap in sample data quality, showcasing AgentGraph's full potential for complex real-world scenarios involving specialized domain experts, sophisticated tool integration, and advanced error handling capabilities.

Files changed (2) hide show

backend/database/README_sample_data.md +19 -12
backend/database/sample_data.py +320 -318

backend/database/README_sample_data.md CHANGED Viewed

@@ -12,23 +12,30 @@ The enhanced sample data system automatically inserts curated examples showcasin
 - Non-destructive: skips insertion if existing data is found
 - Logs all operations for transparency
-### 🎯 Enhanced Examples
-The system includes 2 carefully selected examples showcasing AgentGraph's advanced capabilities:
-1. **Oxford Economics AI Assistant** (Enhanced)
-   - Type: `aif_inference`
-   - Example: AI assistant processing economic inquiry with failure detection
-   - 6 entities, 5 relations, 1 failure, 2 optimizations
-   - Features: Content references, quality scoring, system summary
-2. **Simple Q&A Demonstration** (Basic)
-   - Type: `conversation`
-   - Example: Basic Oxford Economics service inquiry
-   - 4 entities, 4 relations, 0 failures, 1 optimization
-   - Features: Streamlined structure, clear interaction flow
 ### 🕸️ Enhanced Knowledge Graph Examples

 - Non-destructive: skips insertion if existing data is found
 - Logs all operations for transparency
+### 🎯 Advanced Complex Examples
+The system includes 3 carefully selected examples showcasing AgentGraph's complete capabilities:
+1. **Multi-Agent Geospatial Research** (High Complexity)
+   - Type: `multi_agent_geospatial`
+   - Example: Complex location-based restaurant search with error handling
+   - Multiple specialized agents, systematic verification, error recovery
+   - Features: Advanced failure detection, optimization recommendations, tool integration
+2. **Scientific Multi-Agent Calculation** (High Complexity)
+   - Type: `scientific_calculation`
+   - Example: Interdisciplinary calculation combining astronomy and sports science
+   - Celestial physics and marathon expertise coordination
+   - Features: Cross-domain knowledge integration, numerical verification
+3. **Advanced Data Analysis Investigation** (High Complexity)
+   - Type: `data_analysis`
+   - Example: Complex database investigation with systematic error handling
+   - USGS database access challenges and resolution strategies
+   - Features: Data validation, format detection, systematic problem-solving
 ### 🕸️ Enhanced Knowledge Graph Examples

backend/database/sample_data.py CHANGED Viewed

@@ -1,7 +1,8 @@
 #!/usr/bin/env python
 """
-Sample data for database initialization.
-Provides curated examples of traces and knowledge graphs for new users.
 """
 import json
@@ -10,173 +11,190 @@ from typing import Dict, List, Any
 logger = logging.getLogger(__name__)
-# Enhanced sample traces showcasing AgentGraph's full capabilities
 SAMPLE_TRACES = [
     {
-        "filename": "oxford_economics_inquiry.json",
-        "title": "Oxford Economics AI Assistant Demo",
-        "description": "Enhanced example showing AI assistant processing economic inquiry with failure detection and optimization suggestions",
-        "trace_type": "aif_inference",
         "trace_source": "sample_data",
-        "tags": ["economics", "ai_assistant", "enhanced", "failure_detection", "optimization"],
         "content": """{
-  "id": "aif_trace_demo_001",
   "timestamp": "2025-01-27T00:00:00",
   "metadata": {
-    "source": "AgentGraph_Demo",
-    "row_index": 0,
     "converted_at": "2025-01-27T12:00:00.000000"
   },
-  "data": {
-    "total_observations": 1,
-    "summary": "Oxford Economics inquiry with AI assistant response"
   },
-  "observations": [
     {
-      "id": "demo_obs_001",
-      "type": "inference",
-      "timestamp": "2025-01-27T00:00:00",
-      "input": "You are a knowledgeable assistant on Oxford Economics designed to answer questions accurately based on the provided context. Use the information in the documents below to respond concisely and correctly.\\n\\n### Context:\\n['Oxford Economics contact: Mateusz Urban - murban@oxfordeconomics.com', 'Oxford Economics research and analysis services', 'Economic forecasting and consultancy expertise']\\n\\n### Question:\\nwhat does oxford economics present?\\n\\n### Answer:",
-      "output": "Oxford Economics provides economic analysis, forecasting, and consultancy services.",
-      "metadata": {
-        "request_date": "2025-01-27T00:00:00",
-        "databricks_request_id": "demo-request-001",
-        "request_time": "2025-01-27T12:00:00.000",
-        "status_code": 200,
-        "sampling_fraction": 1,
-        "request_raw": "{\"messages\":[{\"role\":\"system\",\"content\":\"You are a knowledgeable assistant on Oxford Economics\"},{\"role\":\"user\",\"content\":\"You are a knowledgeable assistant on Oxford Economics designed to answer questions accurately based on the provided context. Use the information in the documents below to respond concisely and correctly.\\\\n\\\\n### Context:\\\\n['Oxford Economics contact: Mateusz Urban - murban@oxfordeconomics.com', 'Oxford Economics research and analysis services', 'Economic forecasting and consultancy expertise']\\\\n\\\\n### Question:\\\\nwhat does oxford economics present?\\\\n\\\\n### Answer:\"}]}",
-        "response_raw": "{\"choices\":[{\"content_filter_results\":{\"hate\":{\"filtered\":false,\"severity\":\"safe\"},\"self_harm\":{\"filtered\":false,\"severity\":\"safe\"},\"sexual\":{\"filtered\":false,\"severity\":\"safe\"},\"violence\":{\"filtered\":false,\"severity\":\"safe\"}},\"finish_reason\":\"stop\",\"index\":0,\"logprobs\":null,\"message\":{\"content\":\"Oxford Economics provides economic analysis, forecasting, and consultancy services.\",\"refusal\":null,\"role\":\"assistant\"}}],\"created\":1737979200,\"id\":\"chatcmpl-DEMO001\",\"model\":\"gpt-4o-2024-11-20\",\"object\":\"chat.completion\",\"system_fingerprint\":\"fp_demo_001\",\"usage\":{\"completion_tokens\":13,\"completion_tokens_details\":{\"accepted_prediction_tokens\":0,\"audio_tokens\":0,\"reasoning_tokens\":0,\"rejected_prediction_tokens\":0},\"prompt_tokens\":162,\"prompt_tokens_details\":{\"audio_tokens\":0,\"cached_tokens\":0},\"total_tokens\":175}}",
-        "served_entity_id": "demo-entity-001",
-        "logging_error_codes": "[]",
-        "requester": "demo-user-001"
-      },
-      "latency": 0.531,
-      "usage": {
-        "completion_tokens": 13,
-        "completion_tokens_details": {
-          "accepted_prediction_tokens": 0,
-          "audio_tokens": 0,
-          "reasoning_tokens": 0,
-          "rejected_prediction_tokens": 0
-        },
-        "prompt_tokens": 162,
-        "prompt_tokens_details": {
-          "audio_tokens": 0,
-          "cached_tokens": 0
-        },
-        "total_tokens": 175
-      },
-      "model": "gpt-4o-2024-11-20"
     }
   ]
 }"""
     },
     {
-        "filename": "simple_qa_demonstration.txt",
-        "title": "Simple Q&A: Basic Interaction",
-        "description": "Streamlined example showing basic agent-human interaction with clear knowledge graph structure",
-        "trace_type": "conversation",
         "trace_source": "sample_data",
-        "tags": ["simple", "qa", "basic", "demonstration"],
-        "content": """User: Hello, can you help me understand what services Oxford Economics provides?
-Assistant: Certainly! Oxford Economics is a leading economic consultancy that provides:
-1. **Economic Analysis & Forecasting**: Comprehensive economic research and predictive modeling
-2. **Consultancy Services**: Strategic advice for businesses, governments, and institutions
-3. **Industry Analysis**: Sector-specific insights and market intelligence
-4. **Policy Impact Assessment**: Evaluation of economic policies and their implications
-They specialize in data-driven insights that help organizations make informed decisions about their economic environment and strategic planning.
-Is there a specific aspect of their services you'd like to know more about?"""
     }
 ]
-# Enhanced knowledge graphs demonstrating AgentGraph's complete feature set
 SAMPLE_KNOWLEDGE_GRAPHS = [
     {
-        "filename": "kg_oxford_economics_enhanced.json",
         "trace_index": 0,  # Links to first trace
         "graph_data": {
-            "system_name": "Oxford Economics AI Assistant",
-            "system_summary": "This intelligent assistant processes user inquiries about Oxford Economics through a streamlined workflow. When users submit questions, the knowledgeable agent analyzes them using advanced language models and delivers accurate, contextual responses about economic analysis and forecasting services.",
             "entities": [
                 {
-                    "id": "agent_001",
                     "type": "Agent",
-                    "name": "Oxford Economics Knowledge Agent",
-                    "importance": "HIGH",
-                    "raw_prompt": "You are a knowledgeable assistant on Oxford Economics designed to answer questions accurately based on the provided context. Use the information in the documents below to respond concisely and correctly.",
-                    "raw_prompt_ref": [
-                        {
-                            "line_start": 31,
-                            "line_end": 32
-                        }
-                    ]
-                },
-                {
-                    "id": "task_001",
-                    "type": "Task",
-                    "name": "Economic Inquiry Processing",
                     "importance": "HIGH",
-                    "raw_prompt": "Process user inquiry about Oxford Economics and generate an accurate, contextual response based on available information and company expertise.",
-                    "raw_prompt_ref": [
-                        {
-                            "line_start": 26,
-                            "line_end": 28
-                        }
-                    ]
-                },
-                {
-                    "id": "input_001",
-                    "type": "Input",
-                    "name": "User Economic Query",
-                    "importance": "HIGH",
-                    "raw_prompt": "what does oxford economics present?",
-                    "raw_prompt_ref": [
-                        {
-                            "line_start": 19,
-                            "line_end": 19
-                        }
-                    ]
-                },
-                {
-                    "id": "output_001",
-                    "type": "Output",
-                    "name": "Economic Services Overview",
-                    "importance": "HIGH",
-                    "raw_prompt": "Oxford Economics provides economic analysis, forecasting, and consultancy services.",
-                    "raw_prompt_ref": [
-                        {
-                            "line_start": 20,
-                            "line_end": 20
-                        }
-                    ]
-                },
-                {
-                    "id": "human_001",
-                    "type": "Human",
-                    "name": "Business User",
-                    "importance": "MEDIUM",
-                    "raw_prompt": "Professional seeking economic insights and analysis",
                     "raw_prompt_ref": [
                         {
-                            "line_start": 31,
-                            "line_end": 31
                         }
                     ]
                 },
                 {
-                    "id": "tool_001",
-                    "type": "Tool",
-                    "name": "GPT-4o Language Model",
                     "importance": "HIGH",
-                    "raw_prompt": "Advanced AI language model with economic domain knowledge and structured response capabilities.",
                     "raw_prompt_ref": [
                         {
-                            "line_start": 49,
-                            "line_end": 49
                         }
                     ]
                 }
@@ -184,71 +202,16 @@ SAMPLE_KNOWLEDGE_GRAPHS = [
             "relations": [
                 {
                     "id": "rel_001",
-                    "source": "input_001",
-                    "target": "agent_001",
                     "type": "CONSUMED_BY",
                     "importance": "HIGH",
-                    "interaction_prompt": "User query received and processed by the economic knowledge agent",
                     "interaction_prompt_ref": [
                         {
-                            "line_start": 19,
-                            "line_end": 19
-                        }
-                    ]
-                },
-                {
-                    "id": "rel_002",
-                    "source": "agent_001",
-                    "target": "task_001",
-                    "type": "PERFORMS",
-                    "importance": "HIGH",
-                    "interaction_prompt": "Agent executes economic inquiry processing task",
-                    "interaction_prompt_ref": [
-                        {
-                            "line_start": 26,
-                            "line_end": 28
-                        }
-                    ]
-                },
-                {
-                    "id": "rel_003",
-                    "source": "task_001",
-                    "target": "output_001",
-                    "type": "PRODUCES",
-                    "importance": "HIGH",
-                    "interaction_prompt": "Processing task generates comprehensive economic services response",
-                    "interaction_prompt_ref": [
-                        {
-                            "line_start": 20,
-                            "line_end": 20
-                        }
-                    ]
-                },
-                {
-                    "id": "rel_004",
-                    "source": "output_001",
-                    "target": "human_001",
-                    "type": "DELIVERS_TO",
-                    "importance": "HIGH",
-                    "interaction_prompt": "Economic analysis delivered to requesting business user",
-                    "interaction_prompt_ref": [
-                        {
-                            "line_start": 20,
-                            "line_end": 20
-                        }
-                    ]
-                },
-                {
-                    "id": "rel_005",
-                    "source": "agent_001",
-                    "target": "tool_001",
-                    "type": "USES",
-                    "importance": "HIGH",
-                    "interaction_prompt": "Agent leverages language model for natural language understanding and generation",
-                    "interaction_prompt_ref": [
-                        {
-                            "line_start": 49,
-                            "line_end": 49
                         }
                     ]
                 }
@@ -256,40 +219,30 @@ SAMPLE_KNOWLEDGE_GRAPHS = [
             "failures": [
                 {
                     "id": "failure_001",
-                    "risk_type": "HALLUCINATION",
-                    "description": "Minor spelling inconsistency in user query may affect search precision.",
-                    "raw_text": "what does oxford economics present?",
                     "raw_text_ref": [
                         {
-                            "line_start": 19,
-                            "line_end": 19
                         }
                     ],
-                    "affected_id": "input_001"
                 }
             ],
             "optimizations": [
                 {
                     "id": "opt_001",
-                    "recommendation_type": "PROMPT_REFINEMENT",
-                    "description": "Enhance the agent prompt to include spell-checking and query normalization capabilities for improved accuracy and user experience.",
-                    "affected_ids": ["agent_001"],
-                    "raw_text_ref": [
-                        {
-                            "line_start": 31,
-                            "line_end": 32
-                        }
-                    ]
-                },
-                {
-                    "id": "opt_002",
-                    "recommendation_type": "TOOL_ENHANCEMENT",
-                    "description": "Integrate Oxford Economics knowledge base with the language model to provide more specific and detailed responses about services and capabilities.",
-                    "affected_ids": ["tool_001"],
                     "raw_text_ref": [
                         {
-                            "line_start": 49,
-                            "line_end": 49
                         }
                     ]
                 }
@@ -297,163 +250,198 @@ SAMPLE_KNOWLEDGE_GRAPHS = [
             "metadata": {
                 "creation_timestamp": "2025-01-27T12:00:00Z",
                 "schema_version": "2.1.0",
-                "quality_score": 0.89,
-                "entity_count": 6,
-                "relation_count": 5,
                 "failure_count": 1,
-                "optimization_count": 2,
-                "processing_method": "production_enhanced",
-                "content_source": "aif_inference_trace",
                 "language": "en",
-                "domain": "economics_consulting"
             }
         }
     },
     {
-        "filename": "kg_simple_qa_demo.json",
         "trace_index": 1,  # Links to second trace
         "graph_data": {
-            "system_name": "Oxford Economics Q&A Assistant",
-            "system_summary": "A streamlined question-answering system designed to provide clear information about Oxford Economics services. The system processes user questions and delivers comprehensive responses about economic analysis and consulting capabilities.",
             "entities": [
                 {
-                    "id": "user_001",
-                    "type": "Human",
-                    "name": "User",
                     "importance": "HIGH",
-                    "raw_prompt": "Hello, can you help me understand what services Oxford Economics provides?",
                     "raw_prompt_ref": [
                         {
-                            "line_start": 1,
-                            "line_end": 1
                         }
                     ]
                 },
                 {
-                    "id": "assistant_001",
                     "type": "Agent",
-                    "name": "Q&A Assistant",
-                    "importance": "HIGH",
-                    "raw_prompt": "Helpful assistant specialized in providing information about Oxford Economics services and capabilities.",
                     "raw_prompt_ref": [
                         {
-                            "line_start": 3,
-                            "line_end": 3
                         }
                     ]
-                },
                 {
-                    "id": "inquiry_task",
-                    "type": "Task",
-                    "name": "Service Information Request",
                     "importance": "HIGH",
-                    "raw_prompt": "Process user inquiry about Oxford Economics services and provide comprehensive response.",
-                    "raw_prompt_ref": [
                         {
-                            "line_start": 1,
-                            "line_end": 1
                         }
                     ]
-                },
                 {
-                    "id": "response_output",
-                    "type": "Output",
-                    "name": "Service Overview Response",
-                    "importance": "HIGH",
-                    "raw_prompt": "Certainly! Oxford Economics is a leading economic consultancy that provides: 1. Economic Analysis & Forecasting 2. Consultancy Services 3. Industry Analysis 4. Policy Impact Assessment",
-                    "raw_prompt_ref": [
                         {
-                            "line_start": 3,
-                            "line_end": 10
                         }
                     ]
                 }
             ],
-            "relations": [
                 {
-                    "id": "user_inquiry",
-                    "source": "user_001",
-                    "target": "assistant_001",
-                    "type": "CONSUMED_BY",
                     "importance": "HIGH",
-                    "interaction_prompt": "User asks about Oxford Economics services",
-                    "interaction_prompt_ref": [
                         {
-                            "line_start": 1,
-                            "line_end": 1
                         }
                     ]
                 },
                 {
-                    "id": "assistant_processes",
-                    "source": "assistant_001",
-                    "target": "inquiry_task",
-                    "type": "PERFORMS",
                     "importance": "HIGH",
-                    "interaction_prompt": "Assistant processes the service information request",
-                    "interaction_prompt_ref": [
                         {
-                            "line_start": 3,
-                            "line_end": 3
                         }
                     ]
-                },
                 {
-                    "id": "task_generates",
-                    "source": "inquiry_task",
-                    "target": "response_output",
-                    "type": "PRODUCES",
-                    "importance": "HIGH",
-                    "interaction_prompt": "Task generates comprehensive service overview",
                     "interaction_prompt_ref": [
                         {
-                            "line_start": 3,
-                            "line_end": 10
                         }
                     ]
-                },
                 {
-                    "id": "response_delivered",
-                    "source": "response_output",
-                    "target": "user_001",
-                    "type": "DELIVERS_TO",
-                    "importance": "HIGH",
-                    "interaction_prompt": "Comprehensive service information delivered to user",
-                    "interaction_prompt_ref": [
                         {
-                            "line_start": 3,
-                            "line_end": 10
                         }
-                    ]
                 }
             ],
-            "failures": [],
             "optimizations": [
                 {
-                    "id": "opt_simple_001",
-                    "recommendation_type": "PROMPT_REFINEMENT",
-                    "description": "Consider adding interactive follow-up capabilities to allow users to ask for more specific information about any of the mentioned service areas.",
-                    "affected_ids": ["assistant_001"],
                     "raw_text_ref": [
                         {
-                            "line_start": 12,
-                            "line_end": 12
                         }
                     ]
                 }
             ],
             "metadata": {
                 "creation_timestamp": "2025-01-27T12:00:00Z",
-                "schema_version": "2.1.0",
-                "quality_score": 0.92,
-                "entity_count": 4,
-                "relation_count": 4,
-                "failure_count": 0,
                 "optimization_count": 1,
-                "processing_method": "simplified_demonstration",
-                "content_source": "conversation_trace",
                 "language": "en",
-                "domain": "economics_consulting"
             }
         }
     }
@@ -462,7 +450,7 @@ SAMPLE_KNOWLEDGE_GRAPHS = [
 def insert_sample_data(session, force_insert=False):
     """
-    Insert sample traces and knowledge graphs into the database.
     Args:
         session: Database session
@@ -509,7 +497,7 @@ def insert_sample_data(session, force_insert=False):
                 )
                 trace_ids.append(trace.trace_id)
                 results["traces_inserted"] += 1
-                logger.info(f"Inserted sample trace: {trace_data['title']}")
             except Exception as e:
                 error_msg = f"Error inserting trace {i}: {str(e)}"
                 logger.error(error_msg)
@@ -530,16 +518,16 @@ def insert_sample_data(session, force_insert=False):
                         is_original=True
                     )
                     results["knowledge_graphs_inserted"] += 1
-                    logger.info(f"Inserted sample knowledge graph: {kg_data['filename']}")
             except Exception as e:
                 error_msg = f"Error inserting knowledge graph {kg_data['filename']}: {str(e)}"
                 logger.error(error_msg)
                 results["errors"].append(error_msg)
-        logger.info(f"Sample data insertion completed: {results}")
     except Exception as e:
-        error_msg = f"Fatal error during sample data insertion: {str(e)}"
         logger.error(error_msg)
         results["errors"].append(error_msg)
         raise  # Re-raise to trigger rollback in calling code
@@ -549,7 +537,7 @@ def insert_sample_data(session, force_insert=False):
 def get_sample_data_info():
     """
-    Get information about the available sample data.
     Returns:
         Dict with sample data statistics
@@ -558,7 +546,21 @@ def get_sample_data_info():
         "traces_count": len(SAMPLE_TRACES),
         "knowledge_graphs_count": len(SAMPLE_KNOWLEDGE_GRAPHS),
         "trace_types": list(set(t["trace_type"] for t in SAMPLE_TRACES)),
-        "complexity_levels": ["enhanced", "simple"],
-        "features": ["failure_detection", "optimization_recommendations", "content_references", "quality_scoring"],
-        "description": "Enhanced AgentGraph examples showcasing Oxford Economics AI assistant with complete failure detection, optimization suggestions, and advanced knowledge graph features"
-    }

 #!/usr/bin/env python
 """
+Enhanced sample data for database initialization.
+Provides curated examples showcasing AgentGraph's complete capabilities including complex multi-agent collaboration,
+failure detection, optimization recommendations, and advanced content referencing.
 """
 import json
 logger = logging.getLogger(__name__)
+# Complex multi-agent collaboration examples showcasing AgentGraph's advanced capabilities
 SAMPLE_TRACES = [
     {
+        "filename": "multi_agent_geospatial_research.json",
+        "title": "Multi-Agent Geospatial Research: Restaurant Discovery",
+        "description": "Complex multi-agent collaboration involving location services, data verification, and systematic research with error handling and retries",
+        "trace_type": "multi_agent_geospatial",
         "trace_source": "sample_data",
+        "tags": ["multi_agent", "geospatial", "research", "verification", "complex"],
         "content": """{
+  "id": "geospatial_research_demo_001",
   "timestamp": "2025-01-27T00:00:00",
   "metadata": {
+    "source": "AgentGraph_MultiAgent_Demo",
+    "complexity_level": "high",
     "converted_at": "2025-01-27T12:00:00.000000"
   },
+  "agents": ["Location_Services_Expert", "Data_Verification_Expert", "Eateries_Expert", "Computer_terminal"],
+  "task": "What is the closest eatery to Harkness Memorial State Park that is still open at 11pm on Wednesdays?",
+  "conversations": [
+    {
+      "agent": "Location_Services_Expert",
+      "content": "First, I will identify the exact location of Harkness Memorial State Park. Then, I will search for eateries near the park using location-based services and filter to find those open until at least 11pm on Wednesdays.",
+      "action": "location_identification",
+      "result": "Harkness Memorial State Park is located at 275 Great Neck Rd, Waterford, CT 06385, USA"
+    },
+    {
+      "agent": "Computer_terminal",
+      "content": "Executing web search for eateries near Harkness Memorial State Park",
+      "action": "perform_web_search",
+      "query": "eateries near Harkness Memorial State Park Waterford CT open until 11 PM Wednesday",
+      "result": "Found multiple restaurant aggregation sites with listings"
+    },
+    {
+      "agent": "Data_Verification_Expert",
+      "content": "The search results provide several sources listing eateries near Harkness Memorial State Park. I need to extract details about the restaurants and determine their operating hours on Wednesdays.",
+      "action": "data_extraction_analysis",
+      "extracted_eateries": ["Ballo Italian Kitchen & Bar", "On the Waterfront", "374 Kitchen & Cocktails", "Captain Scott's Lobster Dock"],
+      "verification_status": "requires_individual_hour_verification"
+    },
+    {
+      "agent": "Eateries_Expert",
+      "content": "After systematic verification of operating hours for multiple eateries, none of the initially identified restaurants meet the criteria of being open until 11 PM on Wednesdays. Expanding search radius to include neighboring towns.",
+      "action": "expanded_search_analysis",
+      "findings": "Most local eateries close by 9 PM on weekdays, making the search challenging",
+      "recommendation": "Consider fast food chains or 24-hour establishments for late-night dining options"
+    }
+  ]
+}"""
+    },
+    {
+        "filename": "scientific_calculation_marathon.json",
+        "title": "Scientific Calculation: Marathon Pace Analysis",
+        "description": "Advanced scientific calculation involving celestial physics, marathon expertise, and numerical verification with cross-validation",
+        "trace_type": "scientific_calculation",
+        "trace_source": "sample_data",
+        "tags": ["scientific", "calculation", "verification", "celestial_physics", "sports_analysis"],
+        "content": """{
+  "id": "marathon_calculation_demo_001",
+  "timestamp": "2025-01-27T00:00:00",
+  "metadata": {
+    "source": "AgentGraph_Scientific_Demo",
+    "complexity_level": "high",
+    "converted_at": "2025-01-27T12:00:00.000000"
   },
+  "agents": ["CelestialPhysics_Expert", "Marathon_Expert", "Computer_terminal", "Verification_Expert"],
+  "task": "If Eliud Kipchoge could maintain his record-making marathon pace indefinitely, how many thousand hours would it take him to run the distance between the Earth and the Moon at its closest approach?",
+  "conversations": [
+    {
+      "agent": "CelestialPhysics_Expert",
+      "content": "To solve this task, I need to break down the plan step-by-step: 1) Verify the minimum perigee distance between Earth and Moon, 2) Calculate Kipchoge's marathon pace, 3) Calculate time required, 4) Convert to thousand hours and round",
+      "action": "scientific_analysis_planning",
+      "data_points": {
+        "moon_perigee_distance": "356,500 kilometers",
+        "marathon_distance": "42.195 kilometers",
+        "kipchoge_record_time": "1:59:40"
+      }
+    },
+    {
+      "agent": "Computer_terminal",
+      "content": "Executing Python calculation for marathon pace and distance analysis",
+      "action": "numerical_calculation",
+      "code": "total_time_in_hours = 1 + (59/60) + (40/3600); kipchoge_pace_kmh = 42.195 / total_time_in_hours; time_to_run_hours = 356500 / kipchoge_pace_kmh; thousand_hours = round(time_to_run_hours / 1000) * 1000",
+      "result": "17000 hours"
+    },
+    {
+      "agent": "Marathon_Expert",
+      "content": "After reviewing the process and calculations, the results are accurate. Kipchoge's record time converts to approximately 1.9944 hours, giving a pace of ~21.24 km/h. Time to run 356,500 km would be ~16,788 hours, which rounds to 17,000 hours.",
+      "action": "calculation_verification",
+      "verification_details": "All assumptions and steps are correct, final answer is well-rounded according to instructions"
+    },
     {
+      "agent": "Verification_Expert",
+      "content": "I confirm the calculations and the final result. The process correctly follows the plan and all steps are verified.",
+      "action": "final_verification",
+      "conclusion": "17,000 hours is the correct answer"
     }
   ]
 }"""
     },
     {
+        "filename": "data_analysis_usgs_database.json",
+        "title": "Data Analysis: USGS Database Investigation",
+        "description": "Complex data analysis involving database access, data extraction challenges, error handling, and systematic problem-solving",
+        "trace_type": "data_analysis",
         "trace_source": "sample_data",
+        "tags": ["data_analysis", "database", "error_handling", "verification", "usgs"],
+        "content": """{
+  "id": "data_analysis_demo_001",
+  "timestamp": "2025-01-27T00:00:00",
+  "metadata": {
+    "source": "AgentGraph_DataAnalysis_Demo",
+    "complexity_level": "high",
+    "converted_at": "2025-01-27T12:00:00.000000"
+  },
+  "agents": ["DataAnalysis_Expert", "DataVerification_Expert", "StatisticalAnalysis_Expert", "Computer_terminal"],
+  "task": "How many nonindigenous crocodiles were found in Florida from the year 2000 through 2020? You can get the data from the USGS Nonindigenous Aquatic Species database.",
+  "conversations": [
+    {
+      "agent": "DataAnalysis_Expert",
+      "content": "To proceed with the task, I need to: 1) Confirm the exact name and URL for the USGS Nonindigenous Aquatic Species database, 2) Download the correct dataset, 3) Extract data for nonindigenous crocodiles in Florida from 2000-2020, 4) Analyze and count the occurrences",
+      "action": "data_acquisition_planning",
+      "challenges": "Need to identify correct URL for dataset download"
+    },
+    {
+      "agent": "Computer_terminal",
+      "content": "Attempting to download dataset using placeholder URL",
+      "action": "dataset_download_attempt",
+      "command": "curl -o nonindigenous_aquatic_species.csv",
+      "error": "Downloaded file appears to be HTML instead of CSV data"
+    },
+    {
+      "agent": "DataVerification_Expert",
+      "content": "The file appears to be an HTML page rather than the required CSV dataset. This suggests the placeholder URL downloaded a webpage instead of the actual data. We need to identify the correct URL for the USGS database.",
+      "action": "data_validation_error_detection",
+      "finding": "File contains HTML content, not structured CSV data",
+      "recommendation": "Confirm exact URL for USGS Nonindigenous Aquatic Species database"
+    },
+    {
+      "agent": "StatisticalAnalysis_Expert",
+      "content": "Based on previous attempts and the current data download issues, we need to visit the USGS Nonindigenous Aquatic Species database website directly to obtain the correct download link for the dataset.",
+      "action": "methodology_adjustment",
+      "solution": "Access https://nas.er.usgs.gov/ to find correct dataset URL",
+      "data_requirements": "CSV format with species, location, and date information for filtering"
+    }
+  ]
+}"""
     }
 ]
+# Enhanced knowledge graphs demonstrating AgentGraph's complete feature set with complex scenarios
 SAMPLE_KNOWLEDGE_GRAPHS = [
     {
+        "filename": "kg_multi_agent_geospatial.json",
         "trace_index": 0,  # Links to first trace
         "graph_data": {
+            "system_name": "Multi-Agent Geospatial Research System",
+            "system_summary": "This sophisticated system coordinates multiple specialized agents to conduct comprehensive geospatial research. When users submit location-based queries, the Location Services Expert identifies precise coordinates, the Data Verification Expert validates findings through systematic analysis, the Eateries Expert provides domain-specific knowledge, and the Computer Terminal executes searches and calculations. The system demonstrates advanced error handling when initial searches fail to meet criteria.",
             "entities": [
                 {
+                    "id": "location_expert_001",
                     "type": "Agent",
+                    "name": "Location Services Expert",
                     "importance": "HIGH",
+                    "raw_prompt": "Identify exact location coordinates and conduct systematic location-based searches using mapping services and geographical databases.",
                     "raw_prompt_ref": [
                         {
+                            "line_start": 12,
+                            "line_end": 14,
+                            "confidence": 0.95
                         }
                     ]
                 },
                 {
+                    "id": "verification_expert_001",
+                    "type": "Agent",
+                    "name": "Data Verification Expert",
                     "importance": "HIGH",
+                    "raw_prompt": "Extract structured data from search results, validate information accuracy, and ensure data quality through systematic verification processes.",
                     "raw_prompt_ref": [
                         {
+                            "line_start": 25,
+                            "line_end": 27,
+                            "confidence": 0.92
                         }
                     ]
                 }
             "relations": [
                 {
                     "id": "rel_001",
+                    "source": "search_input_001",
+                    "target": "location_expert_001",
                     "type": "CONSUMED_BY",
                     "importance": "HIGH",
+                    "interaction_prompt": "Complex location query processed by geographic specialist",
                     "interaction_prompt_ref": [
                         {
+                            "line_start": 12,
+                            "line_end": 14,
+                            "confidence": 0.90
                         }
                     ]
                 }
             "failures": [
                 {
                     "id": "failure_001",
+                    "risk_type": "RETRIEVAL_ERROR",
+                    "description": "Initial search failed to find eateries meeting 11pm Wednesday criteria",
+                    "raw_text": "None of the eateries meet the requirement of being open until 11 PM",
                     "raw_text_ref": [
                         {
+                            "line_start": 33,
+                            "line_end": 34,
+                            "confidence": 0.95
                         }
                     ],
+                    "affected_id": "location_expert_001"
                 }
             ],
             "optimizations": [
                 {
                     "id": "opt_001",
+                    "recommendation_type": "WORKFLOW_SIMPLIFICATION",
+                    "description": "Pre-filter for 24-hour establishments before detailed verification",
+                    "affected_ids": ["location_expert_001"],
                     "raw_text_ref": [
                         {
+                            "line_start": 39,
+                            "line_end": 40,
+                            "confidence": 0.87
                         }
                     ]
                 }
             "metadata": {
                 "creation_timestamp": "2025-01-27T12:00:00Z",
                 "schema_version": "2.1.0",
+                "quality_score": 0.91,
+                "entity_count": 2,
+                "relation_count": 1,
                 "failure_count": 1,
+                "optimization_count": 1,
+                "processing_method": "multi_agent_collaboration_enhanced",
+                "content_source": "geospatial_research_trace",
                 "language": "en",
+                "domain": "geospatial_analysis_restaurant_research"
             }
         }
     },
     {
+        "filename": "kg_scientific_calculation.json",
         "trace_index": 1,  # Links to second trace
         "graph_data": {
+            "system_name": "Scientific Multi-Agent Calculation System",
+            "system_summary": "Advanced system coordinating specialized scientific experts for complex calculations involving celestial physics, athletic performance analysis, and numerical verification.",
             "entities": [
                 {
+                    "id": "celestial_expert_001",
+                    "type": "Agent",
+                    "name": "Celestial Physics Expert",
                     "importance": "HIGH",
+                    "raw_prompt": "Apply astronomical knowledge for precise planetary distance measurements and calculations.",
                     "raw_prompt_ref": [
                         {
+                            "line_start": 11,
+                            "line_end": 13,
+                            "confidence": 0.96
                         }
                     ]
                 },
                 {
+                    "id": "marathon_expert_001",
                     "type": "Agent",
+                    "name": "Marathon Performance Expert",
+                    "importance": "HIGH",
+                    "raw_prompt": "Provide specialized marathon running and pace calculation expertise.",
                     "raw_prompt_ref": [
                         {
+                            "line_start": 28,
+                            "line_end": 30,
+                            "confidence": 0.94
                         }
                     ]
+                }
+            ],
+            "relations": [
                 {
+                    "id": "rel_sci_001",
+                    "source": "celestial_expert_001",
+                    "target": "marathon_expert_001",
+                    "type": "NEXT",
                     "importance": "HIGH",
+                    "interaction_prompt": "Celestial expert coordinates with marathon specialist for interdisciplinary calculation",
+                    "interaction_prompt_ref": [
                         {
+                            "line_start": 14,
+                            "line_end": 16,
+                            "confidence": 0.94
                         }
                     ]
+                }
+            ],
+            "failures": [],
+            "optimizations": [
                 {
+                    "id": "opt_sci_001",
+                    "recommendation_type": "TOOL_ENHANCEMENT",
+                    "description": "Implement automated unit conversion validation for scientific calculations",
+                    "affected_ids": ["celestial_expert_001"],
+                    "raw_text_ref": [
                         {
+                            "line_start": 20,
+                            "line_end": 22,
+                            "confidence": 0.88
                         }
                     ]
                 }
             ],
+            "metadata": {
+                "creation_timestamp": "2025-01-27T12:00:00Z",
+                "schema_version": "2.1.0",
+                "quality_score": 0.94,
+                "entity_count": 2,
+                "relation_count": 1,
+                "failure_count": 0,
+                "optimization_count": 1,
+                "processing_method": "scientific_collaboration_enhanced",
+                "content_source": "scientific_calculation_trace",
+                "language": "en",
+                "domain": "astronomical_sports_science_calculation"
+            }
+        }
+    },
+    {
+        "filename": "kg_data_analysis_challenge.json",
+        "trace_index": 2,  # Links to third trace
+        "graph_data": {
+            "system_name": "Advanced Data Analysis Investigation System",
+            "system_summary": "Robust system coordinating data experts to tackle complex database investigations with systematic error handling and problem-solving when facing data acquisition challenges.",
+            "entities": [
                 {
+                    "id": "data_analysis_expert_001",
+                    "type": "Agent",
+                    "name": "Data Analysis Expert",
                     "importance": "HIGH",
+                    "raw_prompt": "Design and execute comprehensive data acquisition strategies with error recovery capabilities.",
+                    "raw_prompt_ref": [
                         {
+                            "line_start": 12,
+                            "line_end": 14,
+                            "confidence": 0.93
                         }
                     ]
                 },
                 {
+                    "id": "data_verification_expert_001",
+                    "type": "Agent",
+                    "name": "Data Verification Expert",
                     "importance": "HIGH",
+                    "raw_prompt": "Validate data integrity and identify format inconsistencies through systematic verification.",
+                    "raw_prompt_ref": [
                         {
+                            "line_start": 22,
+                            "line_end": 24,
+                            "confidence": 0.95
                         }
                     ]
+                }
+            ],
+            "relations": [
                 {
+                    "id": "rel_data_001",
+                    "source": "data_analysis_expert_001",
+                    "target": "data_verification_expert_001",
+                    "type": "NEXT",
+                    "importance": "HIGH",
+                    "interaction_prompt": "Data analysis expert coordinates with verification specialist for quality assessment",
                     "interaction_prompt_ref": [
                         {
+                            "line_start": 15,
+                            "line_end": 17,
+                            "confidence": 0.92
                         }
                     ]
+                }
+            ],
+            "failures": [
                 {
+                    "id": "failure_data_001",
+                    "risk_type": "RETRIEVAL_ERROR",
+                    "description": "Database download resulted in HTML content instead of expected CSV format",
+                    "raw_text": "Downloaded file appears to be HTML instead of CSV data",
+                    "raw_text_ref": [
                         {
+                            "line_start": 25,
+                            "line_end": 26,
+                            "confidence": 0.96
                         }
+                    ],
+                    "affected_id": "data_analysis_expert_001"
                 }
             ],
             "optimizations": [
                 {
+                    "id": "opt_data_001",
+                    "recommendation_type": "WORKFLOW_SIMPLIFICATION",
+                    "description": "Implement pre-validation step to verify database URLs before download attempts",
+                    "affected_ids": ["data_analysis_expert_001"],
                     "raw_text_ref": [
                         {
+                            "line_start": 30,
+                            "line_end": 32,
+                            "confidence": 0.89
                         }
                     ]
                 }
             ],
             "metadata": {
                 "creation_timestamp": "2025-01-27T12:00:00Z",
+                "schema_version": "2.1.0",
+                "quality_score": 0.87,
+                "entity_count": 2,
+                "relation_count": 1,
+                "failure_count": 1,
                 "optimization_count": 1,
+                "processing_method": "data_analysis_collaboration_enhanced",
+                "content_source": "database_investigation_trace",
                 "language": "en",
+                "domain": "environmental_data_analysis_database_investigation"
             }
         }
     }
 def insert_sample_data(session, force_insert=False):
     """
+    Insert enhanced sample traces and knowledge graphs into the database.
     Args:
         session: Database session
                 )
                 trace_ids.append(trace.trace_id)
                 results["traces_inserted"] += 1
+                logger.info(f"Inserted enhanced sample trace: {trace_data['title']}")
             except Exception as e:
                 error_msg = f"Error inserting trace {i}: {str(e)}"
                 logger.error(error_msg)
                         is_original=True
                     )
                     results["knowledge_graphs_inserted"] += 1
+                    logger.info(f"Inserted enhanced sample knowledge graph: {kg_data['filename']}")
             except Exception as e:
                 error_msg = f"Error inserting knowledge graph {kg_data['filename']}: {str(e)}"
                 logger.error(error_msg)
                 results["errors"].append(error_msg)
+        logger.info(f"Enhanced sample data insertion completed: {results}")
     except Exception as e:
+        error_msg = f"Fatal error during enhanced sample data insertion: {str(e)}"
         logger.error(error_msg)
         results["errors"].append(error_msg)
         raise  # Re-raise to trigger rollback in calling code
 def get_sample_data_info():
     """
+    Get information about the available enhanced sample data.
     Returns:
         Dict with sample data statistics
         "traces_count": len(SAMPLE_TRACES),
         "knowledge_graphs_count": len(SAMPLE_KNOWLEDGE_GRAPHS),
         "trace_types": list(set(t["trace_type"] for t in SAMPLE_TRACES)),
+        "complexity_levels": ["high", "complex", "advanced"],
+        "features": [
+            "multi_agent_collaboration",
+            "advanced_failure_detection",
+            "systematic_optimization_recommendations",
+            "enhanced_content_references",
+            "comprehensive_quality_scoring",
+            "complex_tool_integration",
+            "systematic_error_handling",
+            "domain_expertise_coordination"
+        ],
+        "domains": [
+            "geospatial_analysis_restaurant_research",
+            "astronomical_sports_science_calculation",
+            "environmental_data_analysis_database_investigation"
+        ],
+        "description": "Advanced AgentGraph examples showcasing complex multi-agent collaboration scenarios including geospatial research, scientific calculations, and data analysis challenges. Features comprehensive failure detection, systematic optimization recommendations, and advanced error handling capabilities representing production-ready knowledge graph extraction."
+    }