Upload README.md with huggingface_hub
Browse files
README.md
ADDED
|
@@ -0,0 +1,44 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# SHARP ONNX — Apple's Single-Image 3D Gaussian Splatting
|
| 2 |
+
|
| 3 |
+
ONNX export of [Apple's SHARP model](https://github.com/apple/ml-sharp) for use with [tortuise](https://github.com/buildoak/tortuise), a terminal-native 3D Gaussian Splatting viewer.
|
| 4 |
+
|
| 5 |
+
## Files
|
| 6 |
+
|
| 7 |
+
| File | Size | Description |
|
| 8 |
+
|------|------|-------------|
|
| 9 |
+
| `sharp.onnx` | 1.9 MB | Model structure (ONNX graph) |
|
| 10 |
+
| `sharp.onnx.data` | 2.6 GB | Model weights (external data) |
|
| 11 |
+
|
| 12 |
+
Both files are required. The model exceeds protobuf's 2GB limit, so weights are stored separately.
|
| 13 |
+
|
| 14 |
+
## Usage
|
| 15 |
+
|
| 16 |
+
These files are automatically downloaded by tortuise when you run:
|
| 17 |
+
|
| 18 |
+
```bash
|
| 19 |
+
cargo install tortuise --features sharp
|
| 20 |
+
tortuise photo.jpg
|
| 21 |
+
```
|
| 22 |
+
|
| 23 |
+
Or manually place both files in `~/.tortuise/models/`.
|
| 24 |
+
|
| 25 |
+
## Model Details
|
| 26 |
+
|
| 27 |
+
- **Architecture:** DINOv2 ViT-Large encoder + Sliding Pyramid Network + DPT decoders
|
| 28 |
+
- **Parameters:** 702M (340M trainable)
|
| 29 |
+
- **Input:** Single RGB image (resized to 1536×1536 internally)
|
| 30 |
+
- **Output:** ~1.2M 3D Gaussians (positions, scales, rotations, colors, opacities)
|
| 31 |
+
- **ONNX opset:** 17
|
| 32 |
+
- **Source checkpoint:** `sharp_2572gikvuh.pt` from [apple/Sharp](https://huggingface.co/apple/Sharp)
|
| 33 |
+
|
| 34 |
+
## License
|
| 35 |
+
|
| 36 |
+
The model weights are licensed under the [Apple Machine Learning Research Model License](LICENSE). This is a **research-only, non-commercial** license. See the LICENSE file for full terms.
|
| 37 |
+
|
| 38 |
+
This ONNX conversion is a format transformation of Apple's original PyTorch checkpoint. No architectural modifications were made.
|
| 39 |
+
|
| 40 |
+
## Attribution
|
| 41 |
+
|
| 42 |
+
Based on Apple SHARP model. Copyright (C) 2025 Apple Inc. Licensed under the Apple Machine Learning Research Model License Agreement.
|
| 43 |
+
|
| 44 |
+
Paper: [SHARP: Monocular View Synthesis in Less Than a Second](https://arxiv.org/abs/2512.10685)
|