Znilsson commited on
Commit
bf6bc86
Β·
verified Β·
1 Parent(s): d253e3b

Initial SurvivalAI Pro deploy

Browse files
Files changed (1) hide show
  1. README.md +34 -9
README.md CHANGED
@@ -1,16 +1,41 @@
1
  ---
2
- title: Survival Model II
3
- emoji: πŸ’¬
4
- colorFrom: yellow
5
- colorTo: purple
6
  sdk: gradio
7
- sdk_version: 6.5.1
8
  app_file: app.py
9
  pinned: false
10
- hf_oauth: true
11
- hf_oauth_scopes:
12
- - inference-api
13
  license: mit
 
14
  ---
15
 
16
- An example chatbot using [Gradio](https://gradio.app), [`huggingface_hub`](https://huggingface.co/docs/huggingface_hub/v0.22.2/en/index), and the [Hugging Face Inference API](https://huggingface.co/docs/api-inference/index).
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ title: SurvivalAI Pro
3
+ emoji: πŸ•οΈ
4
+ colorFrom: green
5
+ colorTo: gray
6
  sdk: gradio
7
+ sdk_version: 5.0.0
8
  app_file: app.py
9
  pinned: false
 
 
 
10
  license: mit
11
+ short_description: Off-grid survival AI β€” runs fully on CPU, no internet needed
12
  ---
13
 
14
+ # SurvivalAI Pro
15
+
16
+ A fine-tuned Phi-3-mini survival assistant running entirely on CPU.
17
+ Trained on ~150,000 survival Q/A pairs covering medical, water, food,
18
+ shelter, fire, navigation, signaling, foraging, hunting, and tools.
19
+
20
+ ## Setup secrets
21
+
22
+ This Space loads its model from a private HF model repo. Set the following
23
+ secret in **Settings β†’ Variables and secrets**:
24
+
25
+ - `HF_TOKEN` β€” a read-scope HF token with access to `Znilsson/survivalai-phi3-gguf`
26
+
27
+ ## Hardware
28
+
29
+ Designed for **CPU Upgrade** tier (16 GB RAM, 8 vCPU). Performance:
30
+
31
+ - Cold start: ~30–60 s (model download + load)
32
+ - Inference: ~6–10 tokens/sec
33
+ - Typical response: 30–60 sec for 200–400 tokens
34
+
35
+ ## Architecture
36
+
37
+ - Base model: `microsoft/phi-3-mini-4k-instruct`
38
+ - Fine-tune: QLoRA r=16, Ξ±=32, 3 epochs on ~150 k pairs
39
+ - Quantization: Q4_K_M (2.4 GB)
40
+ - Runtime: `llama-cpp-python`
41
+ - UI: Gradio `ChatInterface` with streaming