--- license: other license_name: apple-amlr license_link: LICENSE tags: - gaussian-splatting - 3d-reconstruction - onnx - tortuise pipeline_tag: image-to-3d --- # SHARP ONNX — Apple's Single-Image 3D Gaussian Splatting ONNX export of [Apple's SHARP model](https://github.com/apple/ml-sharp) for use with [tortuise](https://github.com/buildoak/tortuise), a terminal-native 3D Gaussian Splatting viewer. ## Files | File | Size | Description | |------|------|-------------| | `sharp.onnx` | 1.9 MB | Model structure (ONNX graph) | | `sharp.onnx.data` | 2.6 GB | Model weights (external data) | Both files are required. The model exceeds protobuf's 2GB limit, so weights are stored separately. ## Usage These files are automatically downloaded by tortuise when you run: ```bash cargo install tortuise --features sharp tortuise photo.jpg ``` Or manually place both files in `~/.tortuise/models/`. ## Model Details - **Architecture:** DINOv2 ViT-Large encoder + Sliding Pyramid Network + DPT decoders - **Parameters:** 702M (340M trainable) - **Input:** Single RGB image (resized to 1536×1536 internally) - **Output:** ~1.2M 3D Gaussians (positions, scales, rotations, colors, opacities) - **ONNX opset:** 17 - **Source checkpoint:** `sharp_2572gikvuh.pt` from [apple/Sharp](https://huggingface.co/apple/Sharp) ## License The model weights are licensed under the [Apple Machine Learning Research Model License](LICENSE). This is a **research-only, non-commercial** license. See the LICENSE file for full terms. This ONNX conversion is a format transformation of Apple's original PyTorch checkpoint. No architectural modifications were made. ## Attribution Based on Apple SHARP model. Copyright (C) 2025 Apple Inc. Licensed under the Apple Machine Learning Research Model License Agreement. Paper: [SHARP: Monocular View Synthesis in Less Than a Second](https://arxiv.org/abs/2512.10685)