Spaces:

AshwinP
/

compounding-test

Running on Zero

App Files Files Community

compounding-test / README.md

apingali

fix(hf-space): bump Gradio to 5.x to fix HfFolder ImportError

69d1664 4 days ago

preview code

raw

history blame contribute delete

2.94 kB

	---
	title: The Compounding Test
	emoji: 📈
	colorFrom: indigo
	colorTo: purple
	sdk: gradio
	sdk_version: 5.9.1
	app_file: app.py
	pinned: false
	license: mit
	short_description: A diagnostic for AI investments at non-tech companies.
	hardware: zero-a10g
	---

	# The Compounding Test

	A diagnostic for AI investments at non-technology companies. Paste a
	description of your AI initiative (200–5000 words); receive a scored
	writeup in one of four quadrants — compounder, one-shot win,
	compounding the wrong thing, or Roman Candle.

	Framework essay: <https://www.mile-hi.ai/journal/the-berkshire-test>

	## Backends

	The Space supports three interchangeable model backends. The dropdown
	in the UI lets you switch per-submission to compare writeup quality.

	\| Backend \| Model (default) \| Credentials \| Where it runs \|
	\|---\|---\|---\|---\|
	\| `anthropic` \| `claude-opus-4-7` \| `ANTHROPIC_API_KEY` (Space secret) \| Anthropic API \|
	\| `huggingface` \| `google/gemma-2-9b-it` \| none on a Space; `HF_TOKEN` locally \| HF Inference Providers \|
	\| `zerogpu` \| `microsoft/Phi-4-mini-instruct` \| none — Pro plan handles it \| On-Space ZeroGPU \|

	Auto-detect precedence:

	1. Explicit `MODEL_PROVIDER` env var wins.
	2. On a Pro Space (zerogpu deps installed) → `zerogpu`.
	3. Else if `ANTHROPIC_API_KEY` is set → `anthropic`.
	4. Else if `HF_TOKEN` is set, or running on any Space → `huggingface`.
	5. Else fall through to `anthropic` (call-time error guides the user).

	## Configuration

	See `.env.example` for the full list of env vars. Common overrides:

	```
	MODEL_PROVIDER=zerogpu
	ZEROGPU_MODEL_ID=google/gemma-2-9b-it # bigger; ~30s cold start on A10
	ZEROGPU_DURATION_SECONDS=120 # max GPU allocation per request
	HF_MODEL_ID=meta-llama/Llama-3.3-70B-Instruct
	MODEL_ID=claude-sonnet-4-6 # cheaper Anthropic fallback
	```

	## Local development

	```bash
	python3 -m venv .venv && source .venv/bin/activate
	pip install -r requirements.txt # ~2GB with torch/transformers
	cp .env.example .env # fill in whatever you have
	python app.py # http://127.0.0.1:7860
	```

	If you only need to test the `anthropic` backend locally, you can skip
	the heavy `spaces` / `torch` / `transformers` / `accelerate` lines in
	`requirements.txt` — the app degrades gracefully (the zerogpu dropdown
	option won't appear).

	## Tests

	```bash
	pytest test_diagnose.py -v
	```

	31 tests covering the parser contract (15 — what JSON shapes the parser
	accepts and rejects) and the provider routing (16 — auto-detection
	precedence, dispatcher routing, env-driven overrides).

	## Repository

	Source lives in [apingali/effectiveness][repo] under
	`gradio-apps/compounding-test/`. The Space is deployed from that path.
	The published framework essay and four portrait articles live at
	<https://www.mile-hi.ai/journal/the-berkshire-test>.

	[repo]: https://github.com/apingali/effectiveness