ObjectverseDiary / docs /DEVELOPMENT_STATUS.md
qqyule's picture
Deploy live MiniCPM-V vision defaults
0cadcec verified

A newer version of the Gradio SDK is available: 6.17.3

Upgrade

Development Status

Last updated: 2026-06-09

Completed

  • Project skeleton, README, AGENTS instructions, and Gradio app entrypoint.
  • Mock MVP flow: upload/description, personality mode, object JSON, persona JSON, diary, object chat, share card, and trace saving.
  • Archive-style Gradio UI with English-first / Chinese-second copy and six stable examples.
  • Stable demo baseline:
    • example buttons replay committed sample traces before falling back to live generation
    • cached and live generation share the same UI output formatter
    • manual upload/description path still saves new runtime traces
  • Trace and dataset tooling:
    • six public mock sample traces
    • public trace JSONL export
    • deterministic SFT preview JSONL
    • initial-stage acceptance script
  • Hugging Face Space created: https://huggingface.co/spaces/build-small-hackathon/ObjectverseDiary
  • MiniCPM-V 2.6 optional vision backend wiring with mock fallback.
  • Optional llama.cpp / llama-cpp-python text runtime wiring through TEXT_MODEL_PATH, with mock fallback.
  • Space VLM validation tooling:
    • scripts/check_space_vlm.py
    • hidden /vision_runtime_probe API for non-secret MiniCPM-V diagnostics
    • probe output support in Space VLM markdown and JSON reports
    • failure-note updater for the latest Space VLM failure summary
    • failed L4 validation report at docs/SPACE_VLM_REPORT.md
    • optional --trace-output-dir evidence export for validation traces
  • Hosted MiniCPM-V 2.6 ZeroGPU validation passed on 2026-06-08 for public mug, keyboard, and shoe images after adding an HF_TOKEN Space secret for gated model access.
  • Public Space live vision target:
    • hardware: zero-a10g
    • OBJECTVERSE_VISION_BACKEND=minicpm-v
    • VISION_MODEL_ID=openbmb/MiniCPM-V-2_6
    • OBJECTVERSE_TEXT_BACKEND=mock
    • text generation intentionally remains mock for this release
  • Real VLM trace evidence from the hosted validation is available under data/traces/space-vlm/; text generation remained mock by design.
  • ZeroGPU compatibility:
    • optional src/utils/zero_gpu.py
    • Gradio generation callback wrapped with @zero_gpu(duration=180)
    • hidden /zero_gpu_probe API confirms ZeroGPU CUDA availability when run through direct hf.space URL
  • Stable submission materials:
    • Field Notes draft
    • demo video script
    • social post draft
    • stable submission guide
  • Well-Tuned evidence:
  • LoRA v2 GGUF runtime evidence:
  • Local tests and initial acceptance currently pass.

Not Completed

  • Hosted Space text runtime validation with the published GGUF. The public Space still uses mock text until this passes.
  • Real text model traces from the hosted non-mock text runtime.
  • Published Field Notes URL, recorded demo video URL, social post URL, and final public submission.

Current Runtime Defaults

Local development:

  • OBJECTVERSE_VISION_BACKEND=mock
  • OBJECTVERSE_TEXT_BACKEND=mock

Live Space:

  • OBJECTVERSE_VISION_BACKEND=minicpm-v

  • VISION_MODEL_ID=openbmb/MiniCPM-V-2_6

  • OBJECTVERSE_TEXT_BACKEND=mock

  • hardware: zero-a10g

  • No commercial model API is used.

  • GGUF files, tokens, credentials, and private images should not be committed.

Next Recommended Gate

For live vision, keep the Space on ZeroGPU with MiniCPM-V and rerun hosted validation after dependency, Space variable, or deploy changes.

Next text model gate:

Download or mount the published GGUF on the target runtime, set:

OBJECTVERSE_TEXT_BACKEND=llama-cpp
TEXT_MODEL_PATH=/absolute/path/to/objectverse-diary-qwen15b-lora-v2-q4_k_m.gguf

Then rerun the local or Space smoke path before claiming live text runtime.

Live VLM validation gate:

.venv/bin/python -B scripts/check_space_vlm.py \
  --configure-space \
  --hardware zero-a10g \
  --space-url https://huggingface.co/spaces/build-small-hackathon/ObjectverseDiary \
  --output docs/SPACE_VLM_REPORT.md \
  --json-output docs/SPACE_VLM_REPORT.json \
  --trace-output-dir data/traces/space-vlm \
  --timeout-seconds 1200

If rollback is needed, use:

.venv/bin/python -B scripts/check_space_vlm.py \
  --space-url https://huggingface.co/spaces/build-small-hackathon/ObjectverseDiary \
  --configure-space \
  --hardware zero-a10g \
  --skip-validation \
  --rollback-to-mock