Spaces:

sumitdotml
/

robuchan-demo

Sleeping

robuchan-demo / RUNBOOK.md

Initial Gradio demo for robuchan recipe adapter

c576ff8 verified about 2 months ago

3.81 kB

A newer version of the Gradio SDK is available: 6.13.0

Robuchan HF Space Deployment Runbook

Deploy the Gradio demo to sumitdotml/robuchan-demo on Hugging Face Spaces.

curl -LsSf https://hf.co/cli/install.sh | bash

Or via uvx (no install needed):

uvx hf --help

hf auth login
# Paste your token when prompted (needs write access)
# Say yes to saving as git credential

Verify:

hf auth whoami
# Should print: sumitdotml

hf repos create robuchan-demo --repo-type space --space-sdk gradio

Expected output:

Successfully created sumitdotml/robuchan-demo on the Hub.
Your repo is now available at https://huggingface.co/spaces/sumitdotml/robuchan-demo

If the Space already exists, add --exist-ok:

hf repos create robuchan-demo --repo-type space --space-sdk gradio --exist-ok

From the repo root:

hf upload sumitdotml/robuchan-demo demo/space . --repo-type space \
  --commit-message "Initial Gradio demo for robuchan recipe adapter"

This uploads the contents of demo/space/ (app.py, requirements.txt, README.md) to the root of the Space repo.

Expected output:

https://huggingface.co/spaces/sumitdotml/robuchan-demo/tree/main/

The README.md frontmatter includes suggested_hardware: t4-small, but you may need to set it manually in Space settings:

(Requires HF Pro or a hardware grant.)

The Space will auto-build on push. Monitor the build log:

Build typically takes 3-5 minutes (dependency install + model download on first boot).

Run both pre-loaded examples:

Example	Constraint	Check
Tonkotsu ramen	vegan	No pork/eggs/animal products in adapted recipe
Japanese curry	gluten_free	No wheat flour/soy sauce in adapted recipe

Also test a custom input: paste any recipe, select a constraint, verify structured output.

After editing files in demo/space/, re-upload:

hf upload sumitdotml/robuchan-demo demo/space . --repo-type space \
  --commit-message "description of changes"

Symptom	Fix
Build fails on `bitsandbytes`	Needs CUDA runtime. Verify hardware is set to T4, not CPU.
OOM during model load	T4 has 16GB VRAM. 4-bit quantization should fit 8B model. If OOM, check no other process is using GPU. Restart Space.
"Model not found" error	Verify `sumitdotml/robuchan` adapter is public (or set `HF_TOKEN` as a Space secret).
Space stuck on "Building"	Check build logs for pip install errors. May need to pin a specific torch version in requirements.txt.
Slow first inference	Expected. First request triggers CUDA kernel compilation. Subsequent requests are faster.