Spaces:

build-small-hackathon
/

small-functional-movement-screening

Running on Zero

Model	Params	License	GGUF	ZeroGPU	Status
YOLO26l-Pose (primary)	0.026B	AGPL-3.0	n/a	✓ (6.5ms T4)	ready
YOLO26x-Pose (HQ alt)	0.058B	AGPL-3.0	n/a	✓ (12.2ms T4)	ready
SAM 3.1 base (sam2.1_hiera_base_plus)	~0.85B	SAM License	n/a	✓	access accepted
SAM 3D Body (facebook/sam-3d-body-dinov3)	0.84B (DINOv3-H+)	SAM License	n/a	✓	INTEGRATED
Sapiens2 Pose (noahcao/sapiens-pose-coco)	~0.6B	CC-BY-NC-4.0	n/a	✓	access accepted
ST-GCN (pyskl)	~0.03B	Apache-2.0	n/a	✓	ready
Qwen3-VL-8B-Instruct	8B	Apache-2.0	✓	llama.cpp	ready
Qwen3-VL-Embedding-8B	8B	Apache-2.0	✓	llama.cpp	ready

Param Sum

~17.63B — well under 32B limit.

Primary pose: YOLO11x-Pose (fastest, well-tested)
Fallback pose: Sapiens2 (more keypoints, slower)
3D body: INTEGRATED — uses setup_sam_3d_body() from notebook.utils, outputs MHR joints
- API: estimator.process_one_image(rgb_image) — single RGB np.ndarray
- Model variants: DINOv3-H+ (840M) default, ViT-H (631M) smaller
- Temporal smoothing via EMA (alpha=0.3) to reduce single-frame jitter
- config.enable_3d=False by default; flipped when checkpoint verified on Space
VLM: Qwen3-VL-8B via llama.cpp (Judge + Classifier)
Embeddings: Qwen3-VL-Embedding-8B via llama.cpp (Retrieval)