Spaces:

build-small-hackathon
/

workbench

Running on Zero

App Files Files Community

workbench / docs /EXTENDING.md

GitHub Actions

Initial ZeroGPU deployment with spaces shim

7f9dfed 18 days ago

preview code

Raw

History Blame Contribute Delete

3.5 kB

A newer version of the Gradio SDK is available: 6.19.0

Upgrade

How To Extend The Workbench

Add A New Model

Open config/models.yaml.
Add a new model entry.
Set type to text, vision, or omnimodal.
Keep parameters_b at or below 32 for hackathon eligibility.
Keep backend: placeholder until a real service supports it.

Example:

models:
  my_model:
    hf_id: org/model-name
    display_name: My Model
    type: text
    parameters_b: 7
    backend: placeholder
    context_length: 32768
    local_first: true
    notes: Why this model is useful.

Add A Real Backend

Create a service in models/, for example:

models/ollama_service.py

The service should expose a small interface:

class OllamaService:
    def chat(self, system_prompt: str, user_prompt: str) -> str:
        ...

Then update the service factory or relevant UI tab to choose between placeholder and real services.

Add A New Gradio Tab

Create ui/new_tab.py.
Add a build_new_tab(...) function.
Import it in app.py.
Add it inside the gr.Tabs() block.
Update docs/ARCHITECTURE.md.
Add a checklist item in docs/TASKS.md.
Update docs/IMPLEMENTATION_STATUS.md.

Add Field Notes Export

Next useful field notes upgrade:

Add a button to export data/field_notes.csv to JSONL.
Add a button to upload that JSONL as a Hugging Face Dataset.
Document the dataset schema in README.md.

Suggested JSONL schema:

{"model_id":"minicpm5_1b","prompt":"...","response":"...","correction":"...","tags":["demo"]}

Add OCR Corrections

The local OCR extension starts from prediction files rather than running an OCR engine directly. Use .csv, .jsonl, or .ndjson rows with fields like:

{"source_path":"receipt.png","text":"Tota1 12.30","confidence":0.54}

The Field Notes tab can preview uncertain rows, import them as correction tasks, and export corrected OCR rows to data/ocr_corrections.jsonl. The intended wiring is:

OCR predictions -> uncertain Field Notes -> corrected JSONL/HF Dataset -> training/evaluation

Add VINDEX Execution

The current VINDEX integration is a safety boundary, not an edit runner. It validates the eight PRD methods, builds non-executing local FastAPI call plans, and reports whether a local VINDEX package or http://127.0.0.1:8765/health server is available.

Before allowing execution:

Verify the local VINDEX package or FastAPI server.
Re-check the PRD bug list: GPU cache cleanup, dead-code paths, star-spread over-editing, and causal-window limits.
Keep star_spread.n_neighbors <= 5 and calibrated_edit.causal_window <= 3 until the scaling formula is validated.
Add protected-relation tests for every edit workflow.
Only then add an explicit user-triggered execute button or MCP tool.

Add Training

Training should be added only after local inference works.

Recommended order:

Export field notes to JSONL.
Load JSONL as a dataset.
Add PEFT/TRL LoRA for text model.
Add Trackio logging.
Add checkpoint output folder.
Add README instructions.

Add Hugging Face Space Deployment

After the local app runs:

.venv\Scripts\python.exe scripts\plan_hf_space.py --user <hf-user-or-org>
huggingface-cli login
huggingface-cli repo create openbmb-local-ai-workbench --type space --space-sdk gradio
git remote add space https://huggingface.co/spaces/<user>/openbmb-local-ai-workbench
git push space main

Never commit Hugging Face tokens.