ObjectverseDiary / docs /FINAL_VERIFICATION_REPORT.md
qqyule's picture
Deploy latest Objectverse Diary from fa09aac
dd6cefc verified

A newer version of the Gradio SDK is available: 6.18.0

Upgrade

Final Verification Report

  • Generated at: 2026-06-08 16:24:23 CST
  • Verified source commit: uncommitted local implementation on main
  • Branch: main
  • Verification target: mock-safe submission baseline plus local diagnostics/smoke-helper implementation
  • Local app URL: not launched during this verification update

Summary

Objectverse Diary's stable mock-safe baseline remains locally verifiable. This update adds non-secret MiniCPM-V runtime diagnostics through a hidden Gradio API, probe-aware Space VLM reporting, a latest-failure-note updater, and local llama.cpp GGUF smoke-test support. A later local run merged the LoRA v2 adapter, produced a Q4_K_M GGUF, uploaded it to the model repo, and passed local llama.cpp smoke.

This report does not claim live Space LoRA/GGUF runtime wiring, Field Notes publication, demo video publication, social post publication, or final public submission URLs are complete.

Implementation Additions

  • Hidden /vision_runtime_probe Gradio API returns sanitized backend, dependency, GPU, and MiniCPM-V load diagnostics.
  • scripts/check_space_vlm.py can include probe output in markdown/JSON reports and update the latest failure section in docs/FAILURES.md.
  • scripts/check_llama_cpp_smoke.py validates persona, diary, and chat through an externally configured GGUF without committing model files.
  • LoRA v2 GGUF tooling now covers merge, publish, and local smoke for objectverse-diary-qwen15b-lora-v2-q4_k_m.gguf.
  • Runtime status no longer records literal TEXT_MODEL_PATH; traces only record whether an external GGUF path is configured.
  • Submission docs now distinguish final-draft materials from published URLs.

Command Verification

Check Result Notes
.venv/bin/python -B -m unittest discover -s tests PASS 46 tests passed. Gradio 6.0 deprecation warnings and an asyncio ResourceWarning remain non-blocking.
.venv/bin/python -B scripts/check_initial_stage.py PASS Required files, runtime defaults, trace generation, sample traces, dataset preview, trace export, and Gradio build all passed.
.venv/bin/python -B scripts/export_traces.py PASS Exported 6 traces to data/traces/samples/objectverse_public_mock_traces.jsonl.
git diff --check PASS No whitespace errors.

Browser Verification

Not re-run in this verification update. The previous stable baseline browser verification remains useful evidence for the mock-safe UI, but the new hidden /vision_runtime_probe API was verified through unit coverage rather than a browser session.

Previous local app command:

GRADIO_SERVER_NAME=127.0.0.1 GRADIO_SERVER_PORT=7860 .venv/bin/python app.py

Browser checks:

Scenario Result Evidence
App loads at http://127.0.0.1:7860/ PASS Page rendered after Gradio load state.
English-first / Chinese-second UI appears PASS Title, subtitle, section headings, and helper text visible.
Six example buttons visible PASS OVD-001 through OVD-006 visible in the Example Objects section.
Coffee mug example PASS Object file, Secret Diary, Share Card, and trace content appeared.
Mechanical keyboard example PASS Object file, Secret Diary, Share Card, and trace content appeared.
Running shoe example PASS Object file, Secret Diary, Share Card, trace content, and saved sample path appeared.
Desk lamp example PASS Expected object term, Secret Diary, Share Card, and trace saved state appeared.
Water bottle example PASS Expected object term, Secret Diary, Share Card, and trace saved state appeared.
Notebook example PASS Expected object term, Secret Diary, Share Card, and trace saved state appeared.
Object chat PASS Message What did you see today? returned a persona-consistent Shoe Afterlight reply.
Browser console PASS No warning or error logs observed during local verification.

Trace Verification

  • Six stable public mock sample traces remain under data/traces/samples/.
  • The trace export JSONL was regenerated successfully.
  • Hosted Space VLM traces under data/traces/space-vlm/ now provide passing MiniCPM-V object-understanding evidence for mug, keyboard, and shoe. Text generation in those traces remains mock by design.
  • New runtime traces do not include literal TEXT_MODEL_PATH values.

Security Scan

Targeted safety coverage now includes unit tests and an rg scan for probe/report/trace outputs that reject or redact:

  • hf_
  • HF_TOKEN
  • HUGGINGFACE_TOKEN
  • .env

Result: PASS for the targeted diagnostic/report paths and repository scan.

Known safe hits:

  • scripts/check_space_vlm.py sensitive marker constants and auth helper names
  • tests intentionally containing fake hf_forbidden and .env strings to verify redaction
  • publish_hf_adapter filenames/imports that match the broad hf_ scan pattern but are not tokens

No GGUF file, real token, private key, credential, or .env file was added by this implementation.

Remaining External Items

  • GitHub push is not performed in this verification run.
  • Hugging Face Space hardware or environment variables are not changed in this verification run.
  • Demo video URL is still pending recording/publication.
  • Field Notes URL is still pending publication.
  • Social post URL is still pending publication.
  • Hosted Space text runtime validation with the published GGUF remains pending.
  • Live Space runtime wiring for the published LoRA/GGUF remains future work.

Verdict

PASS for the stable mock-safe local submission baseline plus local diagnostics/smoke-helper implementation.

The project is ready for explicit-confirmation external steps: push main, sync the Space, rerun probe-aware Space VLM validation if needed, validate the published GGUF in the Space runtime before claiming live text generation, record/publish the demo video, publish Field Notes/social post, and fill final submission URLs.