File size: 5,102 Bytes
8b21729
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
90b0a17
 
8b21729
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
REVENUE-FOCUSED FINANCIAL DATA EXTRACTION

=== DOCUMENT TO ANALYZE ===
File: {file_path}
(Document will be provided directly to you for analysis)

=== YOUR MISSION ===
Extract ONLY revenue-related financial data from the provided document with 100% accuracy.
Focus exclusively on company name and revenue data - ignore all other financial metrics.
You are using gemini-2.5-pro with thinking budget optimization.

=== WHAT TO EXTRACT (REVENUE ONLY) ===

**MANDATORY (Must Extract):**
1. **Company Name** - Official legal company name
2. **Total Revenue** - Consolidated revenue/net sales for most recent period
3. **Document Type** - 10-K, 10-Q, Annual Report, Quarterly Report, etc.
4. **Reporting Period** - FY 2023, Q1 2024, etc.
5. **Currency** - USD, EUR, etc.

**HIGH VALUE (Extract if clearly present):**
6. **Segment Revenue** - Revenue by business segment/division/product line
7. **Regional Revenue** - Revenue by geographic region/country

**IGNORE COMPLETELY:**
- Net income, profit, losses
- Assets, liabilities, equity
- Cash flow data
- Expenses, costs, operating income
- Balance sheet items
- Ratios, per-share metrics
- Non-financial data

=== SYSTEMATIC EXTRACTION PROCESS ===

**Step 1: Document Structure Analysis**
- Scan document to understand structure and layout
- Identify document type and reporting period
- Locate revenue-related sections (Income Statement, Segment Reporting, Geographic Data)

**Step 2: Company Identification** 
- Extract official company name from document header/title
- Verify name consistency throughout document
- Use parent company name if multiple entities present

**Step 3: Total Revenue Extraction (CRITICAL)**
- Find consolidated revenue figure for most recent period
- Look in: Consolidated Statements of Operations, Income Statement
- Search terms: "Revenue", "Net Sales", "Total Revenue", "Net Revenue"
- Record exact value with currency and time period

**Step 4: Segment Revenue Analysis**
- Locate segment reporting section (usually separate section after financial statements)  
- Extract revenue by business segment, division, or product line
- Common segments: Products, Services, Geographic regions, Business units
- Ensure segment revenues sum to total revenue for validation

**Step 5: Regional Revenue Analysis** 
- Find geographic revenue breakdown section
- Look for revenue by: Americas, EMEA, APAC, US vs International, specific countries
- Extract revenue figures for major geographic regions
- Validate regional totals match consolidated revenue

**Step 6: Data Validation**
- Verify company name is not empty
- Confirm total revenue has high confidence score (>0.7)
- Check that segment/regional breakdowns sum to total
- Ensure all mandatory fields are extracted

=== CONFIDENCE SCORING (REVENUE DATA ONLY) ===
- **1.0**: Revenue clearly stated in financial table with proper labels
- **0.8**: Revenue stated in structured text with clear context
- **0.6**: Revenue derived from segment/regional totals
- **0.4**: Revenue estimated or context somewhat unclear
- **0.2**: Revenue barely visible or questionable source
- **0.0**: Revenue not found or completely unclear

=== OUTPUT REQUIREMENTS ===
Return ExtractedFinancialData with ONLY revenue-related data:

```json
{
  "company_name": "[Official Company Name]",
  "document_type": "[10-K|10-Q|Annual Report|etc.]",
  "reporting_period": "[FY 2023|Q1 2024|etc.]", 
  "currency": "[USD|EUR|etc.]",
  "data_points": [
    {
      "field_name": "Total Revenue",
      "value": "$50.3 billion", 
      "category": "Revenue",
      "period": "FY 2023",
      "unit": "USD billions",
      "confidence": 1.0
    },
    {
      "field_name": "Product Revenue", 
      "value": "$30.2 billion",
      "category": "Segment Revenue", 
      "period": "FY 2023",
      "unit": "USD billions",
      "confidence": 0.9
    },
    {
      "field_name": "Americas Revenue",
      "value": "$25.1 billion",
      "category": "Regional Revenue",
      "period": "FY 2023", 
      "unit": "USD billions",
      "confidence": 0.8
    }
  ],
  "summary": "[2-3 sentences describing key revenue findings and trends]"
}
```

=== SUCCESS CRITERIA ===
Extraction is successful ONLY if:
βœ“ Company name extracted (never empty)
βœ“ Total revenue extracted with confidence > 0.5
βœ“ Document type and period identified
βœ“ Currency specified
βœ“ All data points are revenue-related only
βœ“ Summary focuses on revenue insights (2-3 sentences)
βœ“ Segment/regional data sums to total (if present)

=== REVENUE EXTRACTION STRATEGY ===
1. **Income Statement First** - Look for consolidated revenue in primary financial statements
2. **Segment Section Second** - Find detailed segment revenue breakdowns  
3. **Geographic Section Third** - Locate regional revenue data
4. **Management Discussion** - Check for revenue highlights and explanations
5. **Tables Over Text** - Prioritize tabular data over narrative mentions

**Remember**: Focus EXCLUSIVELY on revenue data. Ignore all other financial metrics. Your goal is 100% accuracy on revenue extraction with proper segment and regional breakdowns.