File size: 1,909 Bytes
2472439
 
 
 
 
 
 
 
 
 
 
 
c792340
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
---
license: other
license_name: apple-amlr
license_link: LICENSE
tags:
  - gaussian-splatting
  - 3d-reconstruction
  - onnx
  - tortuise
pipeline_tag: image-to-3d
---

# SHARP ONNX — Apple's Single-Image 3D Gaussian Splatting

ONNX export of [Apple's SHARP model](https://github.com/apple/ml-sharp) for use with [tortuise](https://github.com/buildoak/tortuise), a terminal-native 3D Gaussian Splatting viewer.

## Files

| File | Size | Description |
|------|------|-------------|
| `sharp.onnx` | 1.9 MB | Model structure (ONNX graph) |
| `sharp.onnx.data` | 2.6 GB | Model weights (external data) |

Both files are required. The model exceeds protobuf's 2GB limit, so weights are stored separately.

## Usage

These files are automatically downloaded by tortuise when you run:

```bash
cargo install tortuise --features sharp
tortuise photo.jpg
```

Or manually place both files in `~/.tortuise/models/`.

## Model Details

- **Architecture:** DINOv2 ViT-Large encoder + Sliding Pyramid Network + DPT decoders
- **Parameters:** 702M (340M trainable)
- **Input:** Single RGB image (resized to 1536×1536 internally)
- **Output:** ~1.2M 3D Gaussians (positions, scales, rotations, colors, opacities)
- **ONNX opset:** 17
- **Source checkpoint:** `sharp_2572gikvuh.pt` from [apple/Sharp](https://huggingface.co/apple/Sharp)

## License

The model weights are licensed under the [Apple Machine Learning Research Model License](LICENSE). This is a **research-only, non-commercial** license. See the LICENSE file for full terms.

This ONNX conversion is a format transformation of Apple's original PyTorch checkpoint. No architectural modifications were made.

## Attribution

Based on Apple SHARP model. Copyright (C) 2025 Apple Inc. Licensed under the Apple Machine Learning Research Model License Agreement.

Paper: [SHARP: Monocular View Synthesis in Less Than a Second](https://arxiv.org/abs/2512.10685)