File size: 5,976 Bytes
1e2c036
 
d30bd8e
 
1e2c036
d30bd8e
 
1e2c036
 
 
dd6cefc
1e2c036
dd6cefc
d30bd8e
 
 
 
 
 
dd6cefc
d30bd8e
 
1e2c036
 
 
 
 
d30bd8e
1e2c036
 
 
 
 
 
d30bd8e
 
 
1e2c036
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
4a4024d
d30bd8e
1e2c036
 
 
d30bd8e
1e2c036
 
 
 
 
 
d30bd8e
1e2c036
 
 
 
d30bd8e
 
1e2c036
d30bd8e
1e2c036
 
 
 
 
 
 
 
dd6cefc
 
1e2c036
 
 
d30bd8e
1e2c036
dd6cefc
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
# Final Verification Report

- Generated at: 2026-06-08 16:24:23 CST
- Verified source commit: uncommitted local implementation on `main`
- Branch: `main`
- Verification target: mock-safe submission baseline plus local diagnostics/smoke-helper implementation
- Local app URL: not launched during this verification update

## Summary

Objectverse Diary's stable mock-safe baseline remains locally verifiable. This update adds non-secret MiniCPM-V runtime diagnostics through a hidden Gradio API, probe-aware Space VLM reporting, a latest-failure-note updater, and local llama.cpp GGUF smoke-test support. A later local run merged the LoRA v2 adapter, produced a Q4_K_M GGUF, uploaded it to the model repo, and passed local llama.cpp smoke.

This report does not claim live Space LoRA/GGUF runtime wiring, Field Notes publication, demo video publication, social post publication, or final public submission URLs are complete.

## Implementation Additions

- Hidden `/vision_runtime_probe` Gradio API returns sanitized backend, dependency, GPU, and MiniCPM-V load diagnostics.
- `scripts/check_space_vlm.py` can include probe output in markdown/JSON reports and update the latest failure section in `docs/FAILURES.md`.
- `scripts/check_llama_cpp_smoke.py` validates persona, diary, and chat through an externally configured GGUF without committing model files.
- LoRA v2 GGUF tooling now covers merge, publish, and local smoke for `objectverse-diary-qwen15b-lora-v2-q4_k_m.gguf`.
- Runtime status no longer records literal `TEXT_MODEL_PATH`; traces only record whether an external GGUF path is configured.
- Submission docs now distinguish final-draft materials from published URLs.

## Command Verification

| Check | Result | Notes |
| --- | --- | --- |
| `.venv/bin/python -B -m unittest discover -s tests` | PASS | 46 tests passed. Gradio 6.0 deprecation warnings and an asyncio ResourceWarning remain non-blocking. |
| `.venv/bin/python -B scripts/check_initial_stage.py` | PASS | Required files, runtime defaults, trace generation, sample traces, dataset preview, trace export, and Gradio build all passed. |
| `.venv/bin/python -B scripts/export_traces.py` | PASS | Exported 6 traces to `data/traces/samples/objectverse_public_mock_traces.jsonl`. |
| `git diff --check` | PASS | No whitespace errors. |

## Browser Verification

Not re-run in this verification update. The previous stable baseline browser verification remains useful evidence for the mock-safe UI, but the new hidden `/vision_runtime_probe` API was verified through unit coverage rather than a browser session.

Previous local app command:

```bash
GRADIO_SERVER_NAME=127.0.0.1 GRADIO_SERVER_PORT=7860 .venv/bin/python app.py
```

Browser checks:

| Scenario | Result | Evidence |
| --- | --- | --- |
| App loads at `http://127.0.0.1:7860/` | PASS | Page rendered after Gradio load state. |
| English-first / Chinese-second UI appears | PASS | Title, subtitle, section headings, and helper text visible. |
| Six example buttons visible | PASS | OVD-001 through OVD-006 visible in the Example Objects section. |
| Coffee mug example | PASS | Object file, Secret Diary, Share Card, and trace content appeared. |
| Mechanical keyboard example | PASS | Object file, Secret Diary, Share Card, and trace content appeared. |
| Running shoe example | PASS | Object file, Secret Diary, Share Card, trace content, and saved sample path appeared. |
| Desk lamp example | PASS | Expected object term, Secret Diary, Share Card, and trace saved state appeared. |
| Water bottle example | PASS | Expected object term, Secret Diary, Share Card, and trace saved state appeared. |
| Notebook example | PASS | Expected object term, Secret Diary, Share Card, and trace saved state appeared. |
| Object chat | PASS | Message `What did you see today?` returned a persona-consistent `Shoe Afterlight` reply. |
| Browser console | PASS | No warning or error logs observed during local verification. |

## Trace Verification

- Six stable public mock sample traces remain under `data/traces/samples/`.
- The trace export JSONL was regenerated successfully.
- Hosted Space VLM traces under `data/traces/space-vlm/` now provide passing MiniCPM-V object-understanding evidence for mug, keyboard, and shoe. Text generation in those traces remains mock by design.
- New runtime traces do not include literal `TEXT_MODEL_PATH` values.

## Security Scan

Targeted safety coverage now includes unit tests and an `rg` scan for probe/report/trace outputs that reject or redact:

- `hf_`
- `HF_TOKEN`
- `HUGGINGFACE_TOKEN`
- `.env`

Result: PASS for the targeted diagnostic/report paths and repository scan.

Known safe hits:

- `scripts/check_space_vlm.py` sensitive marker constants and auth helper names
- tests intentionally containing fake `hf_forbidden` and `.env` strings to verify redaction
- `publish_hf_adapter` filenames/imports that match the broad `hf_` scan pattern but are not tokens

No GGUF file, real token, private key, credential, or `.env` file was added by this implementation.

## Remaining External Items

- GitHub push is not performed in this verification run.
- Hugging Face Space hardware or environment variables are not changed in this verification run.
- Demo video URL is still pending recording/publication.
- Field Notes URL is still pending publication.
- Social post URL is still pending publication.
- Hosted Space text runtime validation with the published GGUF remains pending.
- Live Space runtime wiring for the published LoRA/GGUF remains future work.

## Verdict

PASS for the stable mock-safe local submission baseline plus local diagnostics/smoke-helper implementation.

The project is ready for explicit-confirmation external steps: push `main`, sync the Space, rerun probe-aware Space VLM validation if needed, validate the published GGUF in the Space runtime before claiming live text generation, record/publish the demo video, publish Field Notes/social post, and fill final submission URLs.