Commit
·
70cbf15
1
Parent(s):
4634e2c
nginx endpoints addition - grafana documentation update
Browse files- docker/nginx.conf +9 -0
- docs/design_choices.md +6 -5
- docs/milestone_summaries.md +6 -4
- docs/user_guide.md +6 -5
docker/nginx.conf
CHANGED
|
@@ -81,6 +81,15 @@ http {
|
|
| 81 |
proxy_set_header Host $host;
|
| 82 |
}
|
| 83 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 84 |
# Grafana
|
| 85 |
location = /grafana {
|
| 86 |
return 301 /grafana/;
|
|
|
|
| 81 |
proxy_set_header Host $host;
|
| 82 |
}
|
| 83 |
|
| 84 |
+
# FastAPI Metrics endpoint for Prometheus
|
| 85 |
+
location /metrics {
|
| 86 |
+
proxy_pass http://fastapi/metrics;
|
| 87 |
+
proxy_set_header Host $host;
|
| 88 |
+
proxy_set_header X-Real-IP $remote_addr;
|
| 89 |
+
proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;
|
| 90 |
+
proxy_set_header X-Forwarded-Proto $scheme;
|
| 91 |
+
}
|
| 92 |
+
|
| 93 |
# Grafana
|
| 94 |
location = /grafana {
|
| 95 |
return 301 /grafana/;
|
docs/design_choices.md
CHANGED
|
@@ -465,11 +465,12 @@ async def monitor_requests(request, call_next):
|
|
| 465 |
### Grafana Visualization
|
| 466 |
|
| 467 |
**Dashboard Panels:**
|
| 468 |
-
1.
|
| 469 |
-
2.
|
| 470 |
-
3.
|
| 471 |
-
4.
|
| 472 |
-
5.
|
|
|
|
| 473 |
|
| 474 |
**Data Sources:**
|
| 475 |
- Prometheus: Real-time metrics
|
|
|
|
| 465 |
### Grafana Visualization
|
| 466 |
|
| 467 |
**Dashboard Panels:**
|
| 468 |
+
1. Request Rate (gauge)
|
| 469 |
+
2. Request Latency p50/p95 (time series)
|
| 470 |
+
3. In-Progress Requests (stat panel)
|
| 471 |
+
4. Error Rate 5xx (stat panel)
|
| 472 |
+
5. Model Prediction Time (time series)
|
| 473 |
+
6. Requests by Endpoint (bar chart)
|
| 474 |
|
| 475 |
**Data Sources:**
|
| 476 |
- Prometheus: Real-time metrics
|
docs/milestone_summaries.md
CHANGED
|
@@ -242,10 +242,12 @@ Jobs:
|
|
| 242 |
|
| 243 |
### Grafana Dashboards
|
| 244 |
|
| 245 |
-
- **
|
| 246 |
-
- **
|
| 247 |
-
- **
|
| 248 |
-
- **
|
|
|
|
|
|
|
| 249 |
|
| 250 |
### Data Drift Detection
|
| 251 |
|
|
|
|
| 242 |
|
| 243 |
### Grafana Dashboards
|
| 244 |
|
| 245 |
+
- **Request Rate**: Real-time requests per second
|
| 246 |
+
- **Request Latency (p50, p95)**: Response time percentiles
|
| 247 |
+
- **In-Progress Requests**: Currently processing requests
|
| 248 |
+
- **Error Rate (5xx)**: Failed request percentage
|
| 249 |
+
- **Model Prediction Time**: Inference latency
|
| 250 |
+
- **Requests by Endpoint**: Traffic distribution
|
| 251 |
|
| 252 |
### Data Drift Detection
|
| 253 |
|
docs/user_guide.md
CHANGED
|
@@ -406,11 +406,12 @@ The pre-configured dashboard includes:
|
|
| 406 |
|
| 407 |
| Panel | Description |
|
| 408 |
|-------|-------------|
|
| 409 |
-
|
|
| 410 |
-
|
|
| 411 |
-
|
|
| 412 |
-
|
|
| 413 |
-
|
|
|
|
|
| 414 |
|
| 415 |
### Data Drift Detection
|
| 416 |
|
|
|
|
| 406 |
|
| 407 |
| Panel | Description |
|
| 408 |
|-------|-------------|
|
| 409 |
+
| Request Rate | Real-time requests per second |
|
| 410 |
+
| Request Latency (p50, p95) | Response time percentiles |
|
| 411 |
+
| In-Progress Requests | Currently processing requests |
|
| 412 |
+
| Error Rate (5xx) | Percentage of failed requests |
|
| 413 |
+
| Model Prediction Time | Average model inference latency |
|
| 414 |
+
| Requests by Endpoint | Traffic distribution per endpoint |
|
| 415 |
|
| 416 |
### Data Drift Detection
|
| 417 |
|