Image Segmentation
Transformers
PyTorch
vision
instance-segmentation
neikos00 commited on
Commit
38e88b4
·
verified ·
1 Parent(s): 84e3dd9

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +50 -0
README.md ADDED
@@ -0,0 +1,50 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ license: mit
4
+ tags:
5
+ - vision
6
+ - image-segmentation
7
+ - instance-segmentation
8
+ - pytorch
9
+ pipeline_tag: image-segmentation
10
+ datasets:
11
+ - coco
12
+ base_model:
13
+ - tue-mps/coco_instance_pmt_large_1280_dinov3
14
+ ---
15
+
16
+ # PMT-DINOv3 (Large, 1280px) for COCO Instance Segmentation
17
+
18
+ <div class="flex flex-wrap space-x-1">
19
+ <img alt="PyTorch" src="https://img.shields.io/badge/PyTorch-DE3412?style=flat&logo=pytorch&logoColor=white">
20
+ <img alt="Transformers" src="https://img.shields.io/badge/Transformers-yellow?style=flat&logo=huggingface&logoColor=white">
21
+ </div>
22
+
23
+ ## Overview
24
+
25
+ This is the **large** variant of the PMT-DINOv3 model trained for **instance segmentation** on COCO at **1280x1280** resolution.
26
+
27
+ ## Model Details
28
+
29
+ | Property | Value |
30
+ |----------|-------|
31
+ | Backbone | DINOv3 ViT-L/16 |
32
+ | Input Resolution | 1280x1280 |
33
+ | Task | Instance Segmentation |
34
+ | Dataset | COCO |
35
+
36
+ ## Citation
37
+
38
+ ```bibtex
39
+ @inproceedings{cavagnero2026pmt,
40
+ author = {Cavagnero, Niccolò and Norouzi, Narges and Dubbelman, Gijs and de Geus, Daan},
41
+ title = {PMT: Plain Mask Transformer for Image and Video Segmentation with Frozen Vision Encoders},
42
+ booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)},
43
+ year = {2026},
44
+ }
45
+ ```
46
+
47
+ ## Acknowledgements
48
+
49
+ - Original implementation: [tue-mps/pmt](https://github.com/tue-mps/pmt)
50
+ - Paper: [arXiv:2503.19108](https://arxiv.org/abs/2603.25398)