Spaces:
Sleeping
Sleeping
Initial SurvivalAI Pro deploy
Browse files
README.md
CHANGED
|
@@ -1,16 +1,41 @@
|
|
| 1 |
---
|
| 2 |
-
title:
|
| 3 |
-
emoji:
|
| 4 |
-
colorFrom:
|
| 5 |
-
colorTo:
|
| 6 |
sdk: gradio
|
| 7 |
-
sdk_version:
|
| 8 |
app_file: app.py
|
| 9 |
pinned: false
|
| 10 |
-
hf_oauth: true
|
| 11 |
-
hf_oauth_scopes:
|
| 12 |
-
- inference-api
|
| 13 |
license: mit
|
|
|
|
| 14 |
---
|
| 15 |
|
| 16 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
+
title: SurvivalAI Pro
|
| 3 |
+
emoji: ποΈ
|
| 4 |
+
colorFrom: green
|
| 5 |
+
colorTo: gray
|
| 6 |
sdk: gradio
|
| 7 |
+
sdk_version: 5.0.0
|
| 8 |
app_file: app.py
|
| 9 |
pinned: false
|
|
|
|
|
|
|
|
|
|
| 10 |
license: mit
|
| 11 |
+
short_description: Off-grid survival AI β runs fully on CPU, no internet needed
|
| 12 |
---
|
| 13 |
|
| 14 |
+
# SurvivalAI Pro
|
| 15 |
+
|
| 16 |
+
A fine-tuned Phi-3-mini survival assistant running entirely on CPU.
|
| 17 |
+
Trained on ~150,000 survival Q/A pairs covering medical, water, food,
|
| 18 |
+
shelter, fire, navigation, signaling, foraging, hunting, and tools.
|
| 19 |
+
|
| 20 |
+
## Setup secrets
|
| 21 |
+
|
| 22 |
+
This Space loads its model from a private HF model repo. Set the following
|
| 23 |
+
secret in **Settings β Variables and secrets**:
|
| 24 |
+
|
| 25 |
+
- `HF_TOKEN` β a read-scope HF token with access to `Znilsson/survivalai-phi3-gguf`
|
| 26 |
+
|
| 27 |
+
## Hardware
|
| 28 |
+
|
| 29 |
+
Designed for **CPU Upgrade** tier (16 GB RAM, 8 vCPU). Performance:
|
| 30 |
+
|
| 31 |
+
- Cold start: ~30β60 s (model download + load)
|
| 32 |
+
- Inference: ~6β10 tokens/sec
|
| 33 |
+
- Typical response: 30β60 sec for 200β400 tokens
|
| 34 |
+
|
| 35 |
+
## Architecture
|
| 36 |
+
|
| 37 |
+
- Base model: `microsoft/phi-3-mini-4k-instruct`
|
| 38 |
+
- Fine-tune: QLoRA r=16, Ξ±=32, 3 epochs on ~150 k pairs
|
| 39 |
+
- Quantization: Q4_K_M (2.4 GB)
|
| 40 |
+
- Runtime: `llama-cpp-python`
|
| 41 |
+
- UI: Gradio `ChatInterface` with streaming
|