File size: 4,337 Bytes
6f8d8d9
 
 
 
e20e3d9
4a4024d
1e2c036
 
9e874de
 
1e2c036
 
bc02199
 
 
 
 
 
 
 
4a4024d
 
0cadcec
1e2c036
bc02199
1e2c036
e20e3d9
dd6cefc
e20e3d9
 
 
 
1e2c036
e20e3d9
 
1e2c036
4a4024d
dd6cefc
 
 
 
1e2c036
e20e3d9
 
 
0cadcec
dd6cefc
4a4024d
6f8d8d9
 
 
4a4024d
 
0cadcec
d30bd8e
1e2c036
 
4a4024d
1e2c036
9e874de
 
4a4024d
6f8d8d9
4a4024d
6f8d8d9
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
# Submission Guide

## Required Package

- [x] Hugging Face Space URL: https://huggingface.co/spaces/build-small-hackathon/ObjectverseDiary
- [x] GitHub Repository URL: https://github.com/qqyule/Objectverse-Diary
- [x] Demo Video Script: `docs/DEMO_VIDEO_SCRIPT.md`
- [x] Social Media Post Draft: `docs/SOCIAL_POST.md`
- [x] Fine-tuned Model URL: https://huggingface.co/qqyule/objectverse-diary-qwen15b-lora
- [x] Dataset URL: https://huggingface.co/datasets/qqyule/objectverse-diary-sft-curated
- [x] Trace Dataset: local public mock JSONL export at `data/traces/samples/objectverse_public_mock_traces.jsonl`
- [x] Field Notes Draft: `docs/FIELD_NOTES.md`
- [x] Short project description: available in README

## Local Evidence Ready

- Initial mock MVP report: `docs/INITIAL_STAGE_REPORT.md`
- Runtime boundary: `docs/RUNTIME.md`
- Dataset plan and preview workflow: `docs/DATASET.md`
- External setup checklist: `docs/EXTERNAL_SETUP.md`
- Space VLM validation report: `docs/SPACE_VLM_REPORT.md` currently passes for public mug, keyboard, and shoe images on ZeroGPU with `OBJECTVERSE_VISION_BACKEND=minicpm-v`.
- Space VLM diagnostics: hidden `/vision_runtime_probe` API confirms Torch/Transformers, CUDA, and MiniCPM-V model load status.
- Live Space runtime: ZeroGPU `zero-a10g` with `OBJECTVERSE_VISION_BACKEND=minicpm-v`, `VISION_MODEL_ID=openbmb/MiniCPM-V-2_6`, and `OBJECTVERSE_TEXT_BACKEND=mock`.
- Space VLM trace evidence: `data/traces/space-vlm/`
- Public mock traces: `data/traces/samples/`
- Stable demo baseline: Gradio example buttons replay committed sample traces first, then fall back to the live generation pipeline if a cached trace is missing.
- Optional llama.cpp runtime wiring: `src/models/llama_cpp_runner.py`
- Published LoRA v2 Q4_K_M GGUF: https://huggingface.co/qqyule/objectverse-diary-qwen15b-lora/blob/main/objectverse-diary-qwen15b-lora-v2-q4_k_m.gguf

## Completed Locally

- Mock MVP flow, archive-style UI, share card, trace logging, sample traces, dataset preview, and initial acceptance tooling.
- Stable local demo baseline with six replayable example outputs, shared cached/live UI formatting, chat wake state, share card, and trace panel output.
- MiniCPM-V 2.6 backend wiring with fallback markers.
- Optional llama.cpp text runtime wiring through `TEXT_MODEL_PATH`.
- Hosted Space VLM validation script, report, JSON summary, and trace evidence export.
- Hosted Space VLM probe support, latest failure-note update support, and passing MiniCPM-V ZeroGPU validation after adding an `HF_TOKEN` Space secret for gated model access.
- Local GGUF smoke-test helper passed with `models/objectverse-diary-qwen15b-lora-v2-q4_k_m.gguf`; trace text runtime was `llama-cpp text generation` and no `text-fallback-to-mock` was present.
- Synthetic curated v2 SFT dataset published to Hugging Face Datasets: 200 rows, 40 objects, 5 personality modes.
- Modal Qwen 1.5B LoRA v2 run completed and adapter published to Hugging Face Models.
- LoRA v2 adapter merged into `Qwen/Qwen2.5-1.5B-Instruct`, converted with pinned `llama.cpp`, quantized to Q4_K_M, and uploaded to the same model repo.
- Field Notes draft, demo video script, and social post draft for the stable submission package.

## Not Completed Yet

- Hosted Space text runtime validation with the published GGUF. The local runtime passed, but the public Space intentionally remains on mock text for the live MiniCPM-V vision release.
- Real text-model traces from the hosted Space.
- Field Notes publication URL, recorded demo video URL, social post URL, and final public submission.

## Final Checks

- [x] Space is under the official organization.
- [x] Space MiniCPM-V validation passes for mug, keyboard, and shoe.
- [x] Space is configured for live MiniCPM-V vision on ZeroGPU with mock text.
- [x] Space MiniCPM-V non-secret diagnostic probe is implemented locally.
- [x] Demo video script targets under 2 minutes.
- [x] README includes stable-baseline parameter budget and links to the model card.
- [x] No commercial cloud AI APIs are used.
- [x] Mock-safe local demo baseline is reproducible from committed sample traces.
- [x] Fine-tuned model is linked.
- [x] Dataset is linked.
- [x] Traces are linked.
- [ ] Field Notes are linked.
- [x] UI remains English-first and Chinese-second.
- [ ] Submission is complete before June 15, 2026.