Spaces:
Running on Zero
Running on Zero
| # Final Verification Report | |
| - Generated at: 2026-06-08 16:24:23 CST | |
| - Verified source commit: uncommitted local implementation on `main` | |
| - Branch: `main` | |
| - Verification target: mock-safe submission baseline plus local diagnostics/smoke-helper implementation | |
| - Local app URL: not launched during this verification update | |
| ## Summary | |
| Objectverse Diary's stable mock-safe baseline remains locally verifiable. This update adds non-secret MiniCPM-V runtime diagnostics through a hidden Gradio API, probe-aware Space VLM reporting, a latest-failure-note updater, and local llama.cpp GGUF smoke-test support. A later local run merged the LoRA v2 adapter, produced a Q4_K_M GGUF, uploaded it to the model repo, and passed local llama.cpp smoke. | |
| This report does not claim live Space LoRA/GGUF runtime wiring, Field Notes publication, demo video publication, social post publication, or final public submission URLs are complete. | |
| ## Implementation Additions | |
| - Hidden `/vision_runtime_probe` Gradio API returns sanitized backend, dependency, GPU, and MiniCPM-V load diagnostics. | |
| - `scripts/check_space_vlm.py` can include probe output in markdown/JSON reports and update the latest failure section in `docs/FAILURES.md`. | |
| - `scripts/check_llama_cpp_smoke.py` validates persona, diary, and chat through an externally configured GGUF without committing model files. | |
| - LoRA v2 GGUF tooling now covers merge, publish, and local smoke for `objectverse-diary-qwen15b-lora-v2-q4_k_m.gguf`. | |
| - Runtime status no longer records literal `TEXT_MODEL_PATH`; traces only record whether an external GGUF path is configured. | |
| - Submission docs now distinguish final-draft materials from published URLs. | |
| ## Command Verification | |
| | Check | Result | Notes | | |
| | --- | --- | --- | | |
| | `.venv/bin/python -B -m unittest discover -s tests` | PASS | 46 tests passed. Gradio 6.0 deprecation warnings and an asyncio ResourceWarning remain non-blocking. | | |
| | `.venv/bin/python -B scripts/check_initial_stage.py` | PASS | Required files, runtime defaults, trace generation, sample traces, dataset preview, trace export, and Gradio build all passed. | | |
| | `.venv/bin/python -B scripts/export_traces.py` | PASS | Exported 6 traces to `data/traces/samples/objectverse_public_mock_traces.jsonl`. | | |
| | `git diff --check` | PASS | No whitespace errors. | | |
| ## Browser Verification | |
| Not re-run in this verification update. The previous stable baseline browser verification remains useful evidence for the mock-safe UI, but the new hidden `/vision_runtime_probe` API was verified through unit coverage rather than a browser session. | |
| Previous local app command: | |
| ```bash | |
| GRADIO_SERVER_NAME=127.0.0.1 GRADIO_SERVER_PORT=7860 .venv/bin/python app.py | |
| ``` | |
| Browser checks: | |
| | Scenario | Result | Evidence | | |
| | --- | --- | --- | | |
| | App loads at `http://127.0.0.1:7860/` | PASS | Page rendered after Gradio load state. | | |
| | English-first / Chinese-second UI appears | PASS | Title, subtitle, section headings, and helper text visible. | | |
| | Six example buttons visible | PASS | OVD-001 through OVD-006 visible in the Example Objects section. | | |
| | Coffee mug example | PASS | Object file, Secret Diary, Share Card, and trace content appeared. | | |
| | Mechanical keyboard example | PASS | Object file, Secret Diary, Share Card, and trace content appeared. | | |
| | Running shoe example | PASS | Object file, Secret Diary, Share Card, trace content, and saved sample path appeared. | | |
| | Desk lamp example | PASS | Expected object term, Secret Diary, Share Card, and trace saved state appeared. | | |
| | Water bottle example | PASS | Expected object term, Secret Diary, Share Card, and trace saved state appeared. | | |
| | Notebook example | PASS | Expected object term, Secret Diary, Share Card, and trace saved state appeared. | | |
| | Object chat | PASS | Message `What did you see today?` returned a persona-consistent `Shoe Afterlight` reply. | | |
| | Browser console | PASS | No warning or error logs observed during local verification. | | |
| ## Trace Verification | |
| - Six stable public mock sample traces remain under `data/traces/samples/`. | |
| - The trace export JSONL was regenerated successfully. | |
| - Hosted Space VLM traces under `data/traces/space-vlm/` now provide passing MiniCPM-V object-understanding evidence for mug, keyboard, and shoe. Text generation in those traces remains mock by design. | |
| - New runtime traces do not include literal `TEXT_MODEL_PATH` values. | |
| ## Security Scan | |
| Targeted safety coverage now includes unit tests and an `rg` scan for probe/report/trace outputs that reject or redact: | |
| - `hf_` | |
| - `HF_TOKEN` | |
| - `HUGGINGFACE_TOKEN` | |
| - `.env` | |
| Result: PASS for the targeted diagnostic/report paths and repository scan. | |
| Known safe hits: | |
| - `scripts/check_space_vlm.py` sensitive marker constants and auth helper names | |
| - tests intentionally containing fake `hf_forbidden` and `.env` strings to verify redaction | |
| - `publish_hf_adapter` filenames/imports that match the broad `hf_` scan pattern but are not tokens | |
| No GGUF file, real token, private key, credential, or `.env` file was added by this implementation. | |
| ## Remaining External Items | |
| - GitHub push is not performed in this verification run. | |
| - Hugging Face Space hardware or environment variables are not changed in this verification run. | |
| - Demo video URL is still pending recording/publication. | |
| - Field Notes URL is still pending publication. | |
| - Social post URL is still pending publication. | |
| - Hosted Space text runtime validation with the published GGUF remains pending. | |
| - Live Space runtime wiring for the published LoRA/GGUF remains future work. | |
| ## Verdict | |
| PASS for the stable mock-safe local submission baseline plus local diagnostics/smoke-helper implementation. | |
| The project is ready for explicit-confirmation external steps: push `main`, sync the Space, rerun probe-aware Space VLM validation if needed, validate the published GGUF in the Space runtime before claiming live text generation, record/publish the demo video, publish Field Notes/social post, and fill final submission URLs. | |