SDR-Arena / data /test_submission.json
behavior-in-the-wild's picture
Deploy SDR-Arena leaderboard
f9e2361 verified
{
"agent_name": "test-dummy-agent",
"agent_author": "Test User",
"agent_description": "Dummy agent for testing the upload workflow",
"results": {
"0": {
"prompt_id": 0,
"status": "completed",
"output": "1. Acme Corp's cloud-native platform directly addresses RetailMax's challenge of managing 2,000+ store locations with fragmented legacy systems, reducing infrastructure overhead by consolidating to a single pane of glass.\n2. RetailMax's 35% year-over-year e-commerce growth strains their existing data pipeline; Acme's real-time analytics engine can unify online and in-store signals to prevent stockouts that currently cost them an estimated $12M annually.\n3. With RetailMax expanding into three new APAC markets in Q3, Acme's multi-region deployment model with built-in compliance templates for local data-residency laws eliminates a major barrier their CTO flagged in the last earnings call.",
"duration_seconds": 30.5,
"tokens": {
"prompt_tokens": 8000,
"completion_tokens": 2000,
"total_tokens": 10000
},
"searches": [
{
"queries": ["RetailMax company overview", "RetailMax technology challenges retail"],
"num_results": 10
}
]
},
"1": {
"prompt_id": 1,
"status": "completed",
"output": "1. FinServ Global's regulatory reporting currently takes 14 days per quarter; DataBridge's automated compliance engine reduced similar timelines by 80% for three top-20 banks, directly addressing the CFO's stated priority of 'operational efficiency in compliance.'\n2. DataBridge's API-first architecture integrates with FinServ's existing SAP and Salesforce stack without a rip-and-replace migration, which aligns with FinServ's publicly stated 'incremental modernization' IT strategy.\n3. FinServ's recent $200M acquisition of a wealth-management platform creates urgent data-integration needs; DataBridge's M&A data-unification playbook has been deployed in 15+ post-merger integrations with an average time-to-value of 6 weeks.",
"duration_seconds": 42.1,
"tokens": {
"prompt_tokens": 9500,
"completion_tokens": 2200,
"total_tokens": 11700
},
"searches": [
{
"queries": ["FinServ Global quarterly report", "FinServ Global IT modernization strategy"],
"num_results": 10
},
{
"queries": ["FinServ Global acquisition wealth management"],
"num_results": 5
}
]
},
"2": {
"prompt_id": 2,
"status": "completed",
"output": "1. MediCare Health's 120-hospital network generates over 5PB of imaging data annually; CloudVault's healthcare-optimized object storage with HIPAA-native encryption eliminates the patchwork of regional NAS silos their VP of Engineering called 'our biggest operational risk' at HIMSS 2024.\n2. MediCare's push toward AI-assisted diagnostics requires low-latency access to historical imaging datasets; CloudVault's tiered-storage architecture with sub-100ms retrieval for warm data cuts the data-preparation phase from weeks to hours, directly enabling their radiology AI roadmap.\n3. With MediCare facing a 15% annual increase in cyber-insurance premiums due to ransomware exposure, CloudVault's immutable backup and air-gapped recovery vault addresses the exact attack vector that caused their peer HealthFirst's 72-hour outage last quarter.",
"duration_seconds": 38.7,
"tokens": {
"prompt_tokens": 8800,
"completion_tokens": 2100,
"total_tokens": 10900
},
"searches": [
{
"queries": ["MediCare Health hospital network infrastructure", "MediCare Health data strategy"],
"num_results": 10
},
{
"queries": ["MediCare Health cybersecurity challenges healthcare"],
"num_results": 8
}
]
}
}
}