Spaces:
Running on Zero
Running on Zero
| # Submission Guide | |
| ## Required Package | |
| - [x] Hugging Face Space URL: https://huggingface.co/spaces/build-small-hackathon/ObjectverseDiary | |
| - [x] GitHub Repository URL: https://github.com/qqyule/Objectverse-Diary | |
| - [x] Demo Video Script: `docs/DEMO_VIDEO_SCRIPT.md` | |
| - [x] Social Media Post Draft: `docs/SOCIAL_POST.md` | |
| - [x] Fine-tuned Model URL: https://huggingface.co/qqyule/objectverse-diary-qwen15b-lora | |
| - [x] Dataset URL: https://huggingface.co/datasets/qqyule/objectverse-diary-sft-curated | |
| - [x] Trace Dataset: local public mock JSONL export at `data/traces/samples/objectverse_public_mock_traces.jsonl` | |
| - [x] Field Notes Draft: `docs/FIELD_NOTES.md` | |
| - [x] Short project description: available in README | |
| ## Local Evidence Ready | |
| - Initial mock MVP report: `docs/INITIAL_STAGE_REPORT.md` | |
| - Runtime boundary: `docs/RUNTIME.md` | |
| - Dataset plan and preview workflow: `docs/DATASET.md` | |
| - External setup checklist: `docs/EXTERNAL_SETUP.md` | |
| - Space VLM validation report: `docs/SPACE_VLM_REPORT.md` currently passes for public mug, keyboard, and shoe images on ZeroGPU with `OBJECTVERSE_VISION_BACKEND=minicpm-v`. | |
| - Space VLM diagnostics: hidden `/vision_runtime_probe` API confirms Torch/Transformers, CUDA, and MiniCPM-V model load status. | |
| - Live Space runtime: ZeroGPU `zero-a10g` with `OBJECTVERSE_VISION_BACKEND=minicpm-v`, `VISION_MODEL_ID=openbmb/MiniCPM-V-2_6`, and `OBJECTVERSE_TEXT_BACKEND=mock`. | |
| - Space VLM trace evidence: `data/traces/space-vlm/` | |
| - Public mock traces: `data/traces/samples/` | |
| - Stable demo baseline: Gradio example buttons replay committed sample traces first, then fall back to the live generation pipeline if a cached trace is missing. | |
| - Optional llama.cpp runtime wiring: `src/models/llama_cpp_runner.py` | |
| - Published LoRA v2 Q4_K_M GGUF: https://huggingface.co/qqyule/objectverse-diary-qwen15b-lora/blob/main/objectverse-diary-qwen15b-lora-v2-q4_k_m.gguf | |
| ## Completed Locally | |
| - Mock MVP flow, archive-style UI, share card, trace logging, sample traces, dataset preview, and initial acceptance tooling. | |
| - Stable local demo baseline with six replayable example outputs, shared cached/live UI formatting, chat wake state, share card, and trace panel output. | |
| - MiniCPM-V 2.6 backend wiring with fallback markers. | |
| - Optional llama.cpp text runtime wiring through `TEXT_MODEL_PATH`. | |
| - Hosted Space VLM validation script, report, JSON summary, and trace evidence export. | |
| - Hosted Space VLM probe support, latest failure-note update support, and passing MiniCPM-V ZeroGPU validation after adding an `HF_TOKEN` Space secret for gated model access. | |
| - Local GGUF smoke-test helper passed with `models/objectverse-diary-qwen15b-lora-v2-q4_k_m.gguf`; trace text runtime was `llama-cpp text generation` and no `text-fallback-to-mock` was present. | |
| - Synthetic curated v2 SFT dataset published to Hugging Face Datasets: 200 rows, 40 objects, 5 personality modes. | |
| - Modal Qwen 1.5B LoRA v2 run completed and adapter published to Hugging Face Models. | |
| - LoRA v2 adapter merged into `Qwen/Qwen2.5-1.5B-Instruct`, converted with pinned `llama.cpp`, quantized to Q4_K_M, and uploaded to the same model repo. | |
| - Field Notes draft, demo video script, and social post draft for the stable submission package. | |
| ## Not Completed Yet | |
| - Hosted Space text runtime validation with the published GGUF. The local runtime passed, but the public Space intentionally remains on mock text for the live MiniCPM-V vision release. | |
| - Real text-model traces from the hosted Space. | |
| - Field Notes publication URL, recorded demo video URL, social post URL, and final public submission. | |
| ## Final Checks | |
| - [x] Space is under the official organization. | |
| - [x] Space MiniCPM-V validation passes for mug, keyboard, and shoe. | |
| - [x] Space is configured for live MiniCPM-V vision on ZeroGPU with mock text. | |
| - [x] Space MiniCPM-V non-secret diagnostic probe is implemented locally. | |
| - [x] Demo video script targets under 2 minutes. | |
| - [x] README includes stable-baseline parameter budget and links to the model card. | |
| - [x] No commercial cloud AI APIs are used. | |
| - [x] Mock-safe local demo baseline is reproducible from committed sample traces. | |
| - [x] Fine-tuned model is linked. | |
| - [x] Dataset is linked. | |
| - [x] Traces are linked. | |
| - [ ] Field Notes are linked. | |
| - [x] UI remains English-first and Chinese-second. | |
| - [ ] Submission is complete before June 15, 2026. | |