maurocarlu commited on
Commit
70cbf15
·
1 Parent(s): 4634e2c

nginx endpoints addition - grafana documentation update

Browse files
docker/nginx.conf CHANGED
@@ -81,6 +81,15 @@ http {
81
  proxy_set_header Host $host;
82
  }
83
 
 
 
 
 
 
 
 
 
 
84
  # Grafana
85
  location = /grafana {
86
  return 301 /grafana/;
 
81
  proxy_set_header Host $host;
82
  }
83
 
84
+ # FastAPI Metrics endpoint for Prometheus
85
+ location /metrics {
86
+ proxy_pass http://fastapi/metrics;
87
+ proxy_set_header Host $host;
88
+ proxy_set_header X-Real-IP $remote_addr;
89
+ proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;
90
+ proxy_set_header X-Forwarded-Proto $scheme;
91
+ }
92
+
93
  # Grafana
94
  location = /grafana {
95
  return 301 /grafana/;
docs/design_choices.md CHANGED
@@ -465,11 +465,12 @@ async def monitor_requests(request, call_next):
465
  ### Grafana Visualization
466
 
467
  **Dashboard Panels:**
468
- 1. API Request Rate (time series)
469
- 2. API Latency Percentiles (heatmap)
470
- 3. Drift Detection Status (stat panel)
471
- 4. Drift P-Value Trend (time series)
472
- 5. Error Rate (gauge)
 
473
 
474
  **Data Sources:**
475
  - Prometheus: Real-time metrics
 
465
  ### Grafana Visualization
466
 
467
  **Dashboard Panels:**
468
+ 1. Request Rate (gauge)
469
+ 2. Request Latency p50/p95 (time series)
470
+ 3. In-Progress Requests (stat panel)
471
+ 4. Error Rate 5xx (stat panel)
472
+ 5. Model Prediction Time (time series)
473
+ 6. Requests by Endpoint (bar chart)
474
 
475
  **Data Sources:**
476
  - Prometheus: Real-time metrics
docs/milestone_summaries.md CHANGED
@@ -242,10 +242,12 @@ Jobs:
242
 
243
  ### Grafana Dashboards
244
 
245
- - **API Request Rate**: Real-time requests per second
246
- - **API Latency**: P50, P90, P99 percentiles
247
- - **Drift Detection Status**: Binary indicator (0/1)
248
- - **Drift P-Value**: Statistical significance metric
 
 
249
 
250
  ### Data Drift Detection
251
 
 
242
 
243
  ### Grafana Dashboards
244
 
245
+ - **Request Rate**: Real-time requests per second
246
+ - **Request Latency (p50, p95)**: Response time percentiles
247
+ - **In-Progress Requests**: Currently processing requests
248
+ - **Error Rate (5xx)**: Failed request percentage
249
+ - **Model Prediction Time**: Inference latency
250
+ - **Requests by Endpoint**: Traffic distribution
251
 
252
  ### Data Drift Detection
253
 
docs/user_guide.md CHANGED
@@ -406,11 +406,12 @@ The pre-configured dashboard includes:
406
 
407
  | Panel | Description |
408
  |-------|-------------|
409
- | API Request Rate | Real-time requests per endpoint |
410
- | API Latency | Response time distribution |
411
- | Drift Detection Status | Binary indicator (0=No Drift, 1=Drift) |
412
- | Drift P-Value | Statistical significance |
413
- | Drift Distance | KS test distance metric |
 
414
 
415
  ### Data Drift Detection
416
 
 
406
 
407
  | Panel | Description |
408
  |-------|-------------|
409
+ | Request Rate | Real-time requests per second |
410
+ | Request Latency (p50, p95) | Response time percentiles |
411
+ | In-Progress Requests | Currently processing requests |
412
+ | Error Rate (5xx) | Percentage of failed requests |
413
+ | Model Prediction Time | Average model inference latency |
414
+ | Requests by Endpoint | Traffic distribution per endpoint |
415
 
416
  ### Data Drift Detection
417