Spaces:

build-small-hackathon
/

HearthNet

Running on Zero

App Files Files Community

HearthNet / docs /ENV.md

GitHub Actions

feat: UI theme + voice fix + ENV.md + improvements roadmap

84e964e 19 days ago

preview code

Raw

History Blame Contribute Delete

8.54 kB

A newer version of the Gradio SDK is available: 6.19.0

Upgrade

HearthNet — Environment Variables, Secrets & Models Reference

Last updated: June 15, 2026

Quick Start: Minimum required for HF Space

NVIDIA_API_KEY = nvapi-...          # enables Nemotron + NVIDIA prize track
HEARTHNET_DATA_DIR = /data/hearthnet  # persistent storage (needs Persistent Storage enabled)

Everything else has sensible defaults.

All Environment Variables

🔴 Secrets (never commit, use HF Space secrets)

Variable	Purpose	Example
`NVIDIA_API_KEY`	NVIDIA NIM API key — activates NemotronBackend, enables all `nvidia/*` models. Get free at build.nvidia.com	`nvapi-abc123...`
`MODAL_TOKEN`	Modal API token — needed only if `modal deploy` is used. Set automatically by Modal CLI.	`ak-...`
`HEARTHNET_HF_TOKEN`	HuggingFace Inference API token — activates `HfApiBackend` for cloud inference. Get at hf.co/settings/tokens	`hf_abc...`
`ANTHROPIC_API_KEY`	Anthropic Claude API — activates `AnthropicBackend`. Not used by default.	`sk-ant-...`

🟡 Configuration (safe to set in Space settings, not truly secret)

Variable	Default	Purpose	Where read
`MODEL_ID`	`openbmb/MiniCPM3-4B`	HF Transformers model to load locally	`app.py:49`
`MODEL_REVISION`	`None`	Model git revision/hash to pin	`app.py:50`
`HEARTHNET_DATA_DIR`	`tempfile.gettempdir()`	Base directory for RAG corpora, BLAKE3 blobs, relay DB. Set to `/data/hearthnet` when Persistent Storage is enabled on the Space.	`app.py:352`
`SPACE_TITLE`	`HearthNet Space (xxxx)`	Display name shown in the UI header and relay roster	`app.py:186`
`MODAL_ENDPOINT`	`""`	URL of deployed Modal LLM endpoint — activates `ModalBackend`	`app.py:273`
`MODAL_MODEL`	`HuggingFaceTB/SmolLM2-1.7B-Instruct`	Model served by the Modal endpoint	`modal_backend.py:65`
`MINICPM_URL`	`""`	OpenAI-compatible endpoint for a local/remote MiniCPM vLLM server	`app.py:287`
`MINICPM_MODELS`	`""`	Comma-separated model names exposed by `MINICPM_URL`	`app.py:292`
`MINICPM_LIGHTWEIGHT`	`""`	Set to `1` or `true` to force SmolLM2-135M instead of MiniCPM3-4B	`app.py:294`
`NEMOTRON_URL`	`""`	Local NVIDIA NIM endpoint (e.g. `http://localhost:8000`) — used by both `app.py` and `app_nemotron.py`	`app_nemotron.py:31`
`HEARTHNET_NODE`	`""`	URL of a HearthNet mesh node to connect to (used by `app_nemotron.py` Push-to-Mesh tab)	`app_nemotron.py:29`
`PORT`	`7869`	Port for `app_nemotron.py` standalone server	`app_nemotron.py:506`
`SPACE_HOST`	`""`	Auto-set by HF — used internally to detect ZeroGPU context	`app.py:181`
`GRADIO_SSR_MODE`	`false`	Forced to `false` to prevent Node.js intercepting custom FastAPI routes	`app.py:565`

All Models

Prize Track Mapping

Prize	Requirement	Models to Use
🐜 Tiny Titan	≤ 32B parameters	MiniCPM3-4B (4B) ✅, SmolLM2-135M (135M) ✅, Nemotron-nano-8B (8B) ✅
🔬 NVIDIA Nemotron	Use Nemotron models	`nvidia/llama-3.1-nemotron-nano-8b-instruct` ✅
🏭 OpenBMB	Use MiniCPM models	`openbmb/MiniCPM3-4B` ✅, `openbmb/MiniCPM4-8B` ✅
⚡ Modal	Deploy on Modal	Any model via `scripts/modal_deploy.py` ✅
🎨 Off Brand	Custom UI/theme	Custom CSS + purple gradient ✅

Models by Backend

Local (runs on HF Space / Pi / laptop)

Model	Size	Backend	Set via	Notes
`openbmb/MiniCPM3-4B`	4B	`HfLocalBackend`	`MODEL_ID` env	Default. OpenBMB + Tiny Titan eligible
`HuggingFaceTB/SmolLM2-135M-Instruct`	135M	`HfLocalBackend`	`MODEL_ID=HuggingFaceTB/SmolLM2-135M-Instruct`	Pi Zero / ultra-light mode
`HuggingFaceTB/SmolLM2-1.7B-Instruct`	1.7B	`HfLocalBackend`	`MODEL_ID=...`	Good balance on CPU
`openbmb/MiniCPM4-8B`	8B	`HfLocalBackend`	`MODEL_ID=openbmb/MiniCPM4-8B`	Faster than MiniCPM3, better quality
`openbmb/MiniCPM-V-2_6`	8B	`HfLocalBackend`	`MODEL_ID=openbmb/MiniCPM-V-2_6`	Multimodal — vision + text

NVIDIA NIM (cloud — needs `NVIDIA_API_KEY`)

Model	Size	Prize eligible	Best for
`nvidia/llama-3.1-nemotron-nano-8b-instruct`	8B	Tiny Titan ✅	Fast, edge reasoning
`nvidia/nemotron-mini-4b-instruct`	4B	Tiny Titan ✅	Smallest Nemotron
`nvidia/nemotron-3-nano-30b-a3b`	30B MoE (3B active)	Tiny Titan ✅	MoE routing brain
`nvidia/llama-3.3-nemotron-super-49b-v1`	49B	NVIDIA track	Best reasoning
`nvidia/llama-3.1-nemotron-70b-instruct`	70B	NVIDIA track	Highest quality
`nvidia/nemotron-nano-12b-v2-vl`	12B	Tiny Titan ✅	Vision-language

Contest note: For Tiny Titan prize (≤32B), use: nano-8B, nano-4B, nano-30B-a3b, or nano-12b-v2-vl. The 49B and 70B models exceed the 32B limit and only qualify for the main Nemotron hardware prize.

OpenBMB (cloud — needs `MINICPM_URL` pointing to a vLLM server)

Model	Size	Notes
`openbmb/MiniCPM4-8B`	8B	Best OpenBMB model for 2026
`openbmb/MiniCPM3-4B`	4B	Default, runs locally

Modal (serverless GPU — needs `MODAL_ENDPOINT`)

Model	Size	Set via
`HuggingFaceTB/SmolLM2-1.7B-Instruct`	1.7B	Hardcoded in `scripts/modal_deploy.py:23`
Any HF model	—	Change `MODEL_ID` in `scripts/modal_deploy.py` and redeploy

Other (cloud)

Backend	Model	Needs
`HfApiBackend`	`HuggingFaceH4/zephyr-7b-beta`	`HEARTHNET_HF_TOKEN`
`AnthropicBackend`	`claude-3-haiku-20240307`	`ANTHROPIC_API_KEY`

Recommended Configurations

HF Space (current, contest submission)

MODEL_ID = openbmb/MiniCPM3-4B          # OpenBMB + Tiny Titan prizes
NVIDIA_API_KEY = nvapi-...               # Nemotron Hardware Prize
HEARTHNET_DATA_DIR = /data/hearthnet     # Persistent Storage (if enabled)
SPACE_TITLE = HearthNet                  # Display name

Pi Zero / Ultra-Light Mode

MODEL_ID = HuggingFaceTB/SmolLM2-135M-Instruct   # 135M, fits in 512MB RAM
MINICPM_LIGHTWEIGHT = 1

Full Local Stack (laptop/desktop)

MODEL_ID = openbmb/MiniCPM4-8B          # Best local model
NVIDIA_API_KEY = nvapi-...               # For Nemotron fallback
NEMOTRON_URL = http://localhost:8000     # Local NIM server (optional)
HEARTHNET_DATA_DIR = ~/.hearthnet/data   # Persistent local data

With Modal GPU Backend

MODAL_ENDPOINT = https://your-org--hearthnet-llm-chat.modal.run
# Deploy first: modal deploy scripts/modal_deploy.py

HF Space Secrets Checklist

[x] NVIDIA_API_KEY        — free at build.nvidia.com, no credit card
[ ] HEARTHNET_DATA_DIR    — set to /data/hearthnet after enabling Persistent Storage
[ ] SPACE_TITLE           — optional display name override
[ ] MODAL_ENDPOINT        — after running: modal deploy scripts/modal_deploy.py
[ ] MINICPM_URL           — if running a separate vLLM server with MiniCPM4-8B

Model Size Quick Reference

SmolLM2-135M      ████░░░░░░░░░░░░░░░░  135M    — Pi Zero, embedded
MiniCPM3-4B       ████████░░░░░░░░░░░░  4B      — Default ★
SmolLM2-1.7B      █████████░░░░░░░░░░░  1.7B    — CPU laptop
Nemotron-nano-4B  ████████░░░░░░░░░░░░  4B      — Tiny Titan ★
MiniCPM4-8B       ████████████░░░░░░░░  8B      — Best quality local
Nemotron-nano-8B  ████████████░░░░░░░░  8B      — Tiny Titan ★
Nemotron-12B-VL   █████████████████░░░  12B     — Vision+text
Nemotron-30B-a3b  ██████████████░░░░░░  30B MoE (3B active) — Tiny Titan ★
                                ↑ 32B Tiny Titan limit ↑
Nemotron-Super-49B███████████████████░  49B     — NVIDIA prize only
Nemotron-70B      ████████████████████  70B     — NVIDIA prize only