bag100
/

triangulang

@@ -6,8 +6,6 @@ tags:
 - segmentation
 - language-grounding
 - pose-free
-datasets:
-- scannetpp
 pipeline_tag: image-segmentation
 ---
@@ -25,25 +23,22 @@ TrianguLang is a feed-forward, pose-free method for language-guided 3D localizat
 ## Checkpoints
-| Checkpoint | Description | Eval mIoU |
-|---|---|---|
-| `mo_v11/best.pt` | Multi-object (text + spatial), 230 scenes, 8 views, 100 epochs | Best MO |
-| `fullscale_no_qp/best.pt` | Single-object (text-only), 230 scenes, 100 epochs | Best SO |
-## Usage
-```python
-from triangulang.training.train import TrianguLangModel
-checkpoint = torch.load("best.pt", map_location="cpu")
-model = TrianguLangModel(...)
-model.load_state_dict(checkpoint["model_state_dict"])
-```
-## Architecture
-- **Frozen:** SAM3 (848M) + DA3 (335M) = ~1.2B params
-- **Trainable:** GASA Decoder (~10-12M params)
 ## Citation

 - segmentation
 - language-grounding
 - pose-free
 pipeline_tag: image-segmentation
 ---
 ## Checkpoints
+| Checkpoint | Description |
+|---|---|
+| `mo_v11/best.pt` | Multi-object (text + spatial), 230 scenes, 8 views, 100 epochs |
+| `fullscale_no_qp/best.pt` | Single-object (text-only), 230 scenes, 100 epochs |
+## Architecture
+- **Frozen:** SAM3 (841M) + DA3-NESTED-GIANT-LARGE (1.69B) = ~2.5B params
+- **Trainable:** GASA Decoder (~13.5M params)
+## Results (ScanNet++)
+| Setting | mIoU | mAcc |
+|---|---|---|
+| Text-only (single-object) | **62.4%** | **77.4%** |
+| Text-only + CRF | **65.2%** | - |
 ## Citation