| | --- |
| | license: apache-2.0 |
| | tags: |
| | - mask2former |
| | - instance-segmentation |
| | - panoptic-segmentation |
| | - semantic-segmentation |
| | - image-segmentation |
| | datasets: |
| | - custom |
| | pipeline_tag: image-segmentation |
| | --- |
| | |
| | # Mask2Former for Segmentation |
| |
|
| | This model is fine-tuned to detect and segment regions across 3 classes. |
| |
|
| | ## Model description |
| |
|
| | This is a Mask2Former model fine-tuned on a custom dataset with polygon annotations in COCO format. It has 3 classes: |
| | - Background (ID: 0) |
| | - Normal (ID: 1) |
| | - Abnormal (ID: 2) |
| |
|
| | ## Intended uses & limitations |
| |
|
| | This model is intended for universal segmentation tasks to identify the specified region types in images. Mask2Former supports instance, semantic, and panoptic segmentation. |
| |
|
| | ### How to use in CVAT |
| |
|
| | 1. In CVAT, go to Models → Add Model |
| | 2. Select Hugging Face as the source |
| | 3. Enter the model path: "{your-username}/mask2former-segmentation" |
| | 4. Configure the appropriate mapping for your labels |
| |
|
| | ### Usage in Python |
| |
|
| | ```python |
| | from transformers import Mask2FormerForUniversalSegmentation, Mask2FormerImageProcessor |
| | import torch |
| | from PIL import Image |
| | |
| | # Load model and processor |
| | model = Mask2FormerForUniversalSegmentation.from_pretrained("{your-username}/mask2former-segmentation") |
| | processor = Mask2FormerImageProcessor.from_pretrained("{your-username}/mask2former-segmentation") |
| | |
| | # Prepare image |
| | image = Image.open("your_image.jpg") |
| | inputs = processor(images=image, return_tensors="pt") |
| | |
| | # Make prediction |
| | with torch.no_grad(): |
| | outputs = model(**inputs) |
| | |
| | # Process outputs for visualization |
| | # (see example code in model repository) |
| | ``` |
| |
|