Adds Better Stack uptime monitoring details
Browse filesDocuments the use of Better Stack for production uptime monitoring,
including monitored endpoints, alerting setup, and evidence location.
- monitoring/README.md +22 -0
monitoring/README.md
CHANGED
|
@@ -34,3 +34,25 @@ Shows the 90th percentile of model prediction time.
|
|
| 34 |
```promql
|
| 35 |
hopcroft_prediction_processing_seconds{quantile="0.9"}
|
| 36 |
```
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 34 |
```promql
|
| 35 |
hopcroft_prediction_processing_seconds{quantile="0.9"}
|
| 36 |
```
|
| 37 |
+
|
| 38 |
+
---
|
| 39 |
+
|
| 40 |
+
## Uptime Monitoring (Better Stack)
|
| 41 |
+
|
| 42 |
+
We used Better Stack Uptime to monitor the availability of the production deployment hosted on Hugging Face Spaces.
|
| 43 |
+
|
| 44 |
+
**Base URL**
|
| 45 |
+
- https://dacrow13-hopcroft-skill-classification.hf.space
|
| 46 |
+
|
| 47 |
+
**Monitored endpoints**
|
| 48 |
+
- https://dacrow13-hopcroft-skill-classification.hf.space/health
|
| 49 |
+
- https://dacrow13-hopcroft-skill-classification.hf.space/openapi.json
|
| 50 |
+
- https://dacrow13-hopcroft-skill-classification.hf.space/docs
|
| 51 |
+
|
| 52 |
+
**Checks and alerts**
|
| 53 |
+
- Monitors are configured to run from multiple locations.
|
| 54 |
+
- Email notifications are enabled for failures.
|
| 55 |
+
- A failure scenario was tested to confirm Better Stack reports the server error details.
|
| 56 |
+
|
| 57 |
+
**Evidence**
|
| 58 |
+
- Screenshots are available in `monitoring/screenshots/`.
|