buildoak commited on
Commit
c792340
·
verified ·
1 Parent(s): 4ea5fb1

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +44 -0
README.md ADDED
@@ -0,0 +1,44 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # SHARP ONNX — Apple's Single-Image 3D Gaussian Splatting
2
+
3
+ ONNX export of [Apple's SHARP model](https://github.com/apple/ml-sharp) for use with [tortuise](https://github.com/buildoak/tortuise), a terminal-native 3D Gaussian Splatting viewer.
4
+
5
+ ## Files
6
+
7
+ | File | Size | Description |
8
+ |------|------|-------------|
9
+ | `sharp.onnx` | 1.9 MB | Model structure (ONNX graph) |
10
+ | `sharp.onnx.data` | 2.6 GB | Model weights (external data) |
11
+
12
+ Both files are required. The model exceeds protobuf's 2GB limit, so weights are stored separately.
13
+
14
+ ## Usage
15
+
16
+ These files are automatically downloaded by tortuise when you run:
17
+
18
+ ```bash
19
+ cargo install tortuise --features sharp
20
+ tortuise photo.jpg
21
+ ```
22
+
23
+ Or manually place both files in `~/.tortuise/models/`.
24
+
25
+ ## Model Details
26
+
27
+ - **Architecture:** DINOv2 ViT-Large encoder + Sliding Pyramid Network + DPT decoders
28
+ - **Parameters:** 702M (340M trainable)
29
+ - **Input:** Single RGB image (resized to 1536×1536 internally)
30
+ - **Output:** ~1.2M 3D Gaussians (positions, scales, rotations, colors, opacities)
31
+ - **ONNX opset:** 17
32
+ - **Source checkpoint:** `sharp_2572gikvuh.pt` from [apple/Sharp](https://huggingface.co/apple/Sharp)
33
+
34
+ ## License
35
+
36
+ The model weights are licensed under the [Apple Machine Learning Research Model License](LICENSE). This is a **research-only, non-commercial** license. See the LICENSE file for full terms.
37
+
38
+ This ONNX conversion is a format transformation of Apple's original PyTorch checkpoint. No architectural modifications were made.
39
+
40
+ ## Attribution
41
+
42
+ Based on Apple SHARP model. Copyright (C) 2025 Apple Inc. Licensed under the Apple Machine Learning Research Model License Agreement.
43
+
44
+ Paper: [SHARP: Monocular View Synthesis in Less Than a Second](https://arxiv.org/abs/2512.10685)