Spaces:

build-small-hackathon
/

ObjectverseDiary

Running on Zero

App Files Files Community

ObjectverseDiary / docs /DEVELOPMENT_STATUS.md

qqyule

Deploy live MiniCPM-V vision defaults

0cadcec verified 3 days ago

preview code

raw

history blame contribute delete

5.09 kB

	# Development Status

	Last updated: 2026-06-09

	## Completed

	- Project skeleton, README, AGENTS instructions, and Gradio app entrypoint.
	- Mock MVP flow: upload/description, personality mode, object JSON, persona JSON, diary, object chat, share card, and trace saving.
	- Archive-style Gradio UI with English-first / Chinese-second copy and six stable examples.
	- Stable demo baseline:
	- example buttons replay committed sample traces before falling back to live generation
	- cached and live generation share the same UI output formatter
	- manual upload/description path still saves new runtime traces
	- Trace and dataset tooling:
	- six public mock sample traces
	- public trace JSONL export
	- deterministic SFT preview JSONL
	- initial-stage acceptance script
	- Hugging Face Space created: https://huggingface.co/spaces/build-small-hackathon/ObjectverseDiary
	- MiniCPM-V 2.6 optional vision backend wiring with mock fallback.
	- Optional llama.cpp / llama-cpp-python text runtime wiring through `TEXT_MODEL_PATH`, with mock fallback.
	- Space VLM validation tooling:
	- `scripts/check_space_vlm.py`
	- hidden `/vision_runtime_probe` API for non-secret MiniCPM-V diagnostics
	- probe output support in Space VLM markdown and JSON reports
	- failure-note updater for the latest Space VLM failure summary
	- failed L4 validation report at `docs/SPACE_VLM_REPORT.md`
	- optional `--trace-output-dir` evidence export for validation traces
	- Hosted MiniCPM-V 2.6 ZeroGPU validation passed on 2026-06-08 for public mug, keyboard, and shoe images after adding an `HF_TOKEN` Space secret for gated model access.
	- Public Space live vision target:
	- hardware: `zero-a10g`
	- `OBJECTVERSE_VISION_BACKEND=minicpm-v`
	- `VISION_MODEL_ID=openbmb/MiniCPM-V-2_6`
	- `OBJECTVERSE_TEXT_BACKEND=mock`
	- text generation intentionally remains mock for this release
	- Real VLM trace evidence from the hosted validation is available under `data/traces/space-vlm/`; text generation remained mock by design.
	- ZeroGPU compatibility:
	- optional `src/utils/zero_gpu.py`
	- Gradio generation callback wrapped with `@zero_gpu(duration=180)`
	- hidden `/zero_gpu_probe` API confirms ZeroGPU CUDA availability when run through direct `hf.space` URL
	- Stable submission materials:
	- Field Notes draft
	- demo video script
	- social post draft
	- stable submission guide
	- Well-Tuned evidence:
	- 200-row synthetic curated v2 SFT dataset published at https://huggingface.co/datasets/qqyule/objectverse-diary-sft-curated
	- Modal Qwen 1.5B LoRA v2 run completed with 120 steps, 180 train rows, and 20 eval rows
	- LoRA v2 adapter published at https://huggingface.co/qqyule/objectverse-diary-qwen15b-lora
	- LoRA v2 GGUF runtime evidence:
	- `scripts/check_llama_cpp_smoke.py`
	- adapter merged into `Qwen/Qwen2.5-1.5B-Instruct`
	- pinned `llama.cpp` commit: `8f83d6c271d194bde2d410145a0ce73bc42e85cd`
	- published Q4_K_M GGUF: https://huggingface.co/qqyule/objectverse-diary-qwen15b-lora/blob/main/objectverse-diary-qwen15b-lora-v2-q4_k_m.gguf
	- local smoke passed with `llama-cpp text generation`, schema-valid persona/diary, non-empty chat, and no `text-fallback-to-mock`
	- trace runtime no longer records literal `TEXT_MODEL_PATH`
	- Local tests and initial acceptance currently pass.

	## Not Completed

	- Hosted Space text runtime validation with the published GGUF. The public Space still uses mock text until this passes.
	- Real text model traces from the hosted non-mock text runtime.
	- Published Field Notes URL, recorded demo video URL, social post URL, and final public submission.

	## Current Runtime Defaults

	Local development:

	- `OBJECTVERSE_VISION_BACKEND=mock`
	- `OBJECTVERSE_TEXT_BACKEND=mock`

	Live Space:

	- `OBJECTVERSE_VISION_BACKEND=minicpm-v`
	- `VISION_MODEL_ID=openbmb/MiniCPM-V-2_6`
	- `OBJECTVERSE_TEXT_BACKEND=mock`
	- hardware: `zero-a10g`

	- No commercial model API is used.
	- GGUF files, tokens, credentials, and private images should not be committed.

	## Next Recommended Gate

	For live vision, keep the Space on ZeroGPU with MiniCPM-V and rerun hosted validation after dependency, Space variable, or deploy changes.

	Next text model gate:

	Download or mount the published GGUF on the target runtime, set:

	```bash
	OBJECTVERSE_TEXT_BACKEND=llama-cpp
	TEXT_MODEL_PATH=/absolute/path/to/objectverse-diary-qwen15b-lora-v2-q4_k_m.gguf
	```

	Then rerun the local or Space smoke path before claiming live text runtime.

	Live VLM validation gate:

	```bash
	.venv/bin/python -B scripts/check_space_vlm.py \
	--configure-space \
	--hardware zero-a10g \
	--space-url https://huggingface.co/spaces/build-small-hackathon/ObjectverseDiary \
	--output docs/SPACE_VLM_REPORT.md \
	--json-output docs/SPACE_VLM_REPORT.json \
	--trace-output-dir data/traces/space-vlm \
	--timeout-seconds 1200
	```

	If rollback is needed, use:

	```bash
	.venv/bin/python -B scripts/check_space_vlm.py \
	--space-url https://huggingface.co/spaces/build-small-hackathon/ObjectverseDiary \
	--configure-space \
	--hardware zero-a10g \
	--skip-validation \
	--rollback-to-mock
	```