Image Segmentation
File size: 3,979 Bytes
57bc8d8
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
6136d87
 
57bc8d8
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
ef74bcc
57bc8d8
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
ef74bcc
 
57bc8d8
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
---
license: apache-2.0
pipeline_tag: image-segmentation
---
# Model Card for Model ID

<!-- YOLO‑based single‑class photocard detector trained on 156 hand‑annotated images. Designed to detect selfie‑style photocards (~55×85 mm) in real‑world scenes. Achieved 97.4% mAP@50, 94.4% precision, 94.8% recall -->

This modelcard aims to be a base template for new models. It has been generated using [this raw template](https://github.com/huggingface/huggingface_hub/blob/main/src/huggingface_hub/templates/modelcard_template.md?plain=1).

## Model Details
Architecture: YOLO object detector (Roboflow hosted training)

Task: Single‑class object detection

Output: Bounding boxes around photocards

Intended use: Stage 1 of a photocard processing pipeline (cataloging, classification, pricing)

https://app.roboflow.com/priyas-workspace-nntro/bdata-497-final-project/models/bdata-497-final-project/1
### Model Description

- **Developed by:** Priya Rasal
- **Model type:** YOLO object detector (bounding box detection)
- **Finetuned from model:** Pretrained YOLO backbone provided by Roboflow

## Uses
### Direct Use
Detecting photocards in images

Cropping photocards for downstream tasks

Preprocessing for classification, cataloging, or pricing systems

Identifying photocards in cluttered or real‑world scenes
### Downstream Use
Photocard classification (e.g., identifying the idol or version)

Marketplace automation (auto‑detecting cards in listings)

Inventory management for collectors

Dataset creation for future models
### Out-of-Scope Use
Detecting people or faces

Identifying which idol is on the photocard

Detecting non‑photocard rectangular objects (phones, books, receipts)

Any biometric or identity recognition
## Bias, Risks, and Limitations
May detect rectangular objects as photocards in rare cases

May miss photocards with extreme glare, reflections, or occlusion

Performance depends on lighting and background diversity

Only trained on 156 images — limited exposure to rare edge cases

Not suitable for identity recognition or personal data analysis
### Recommendations
Use a confidence threshold appropriate for your application

Validate predictions manually in high‑stakes use cases

Retrain or fine‑tune with more diverse data for improved robustness
## How to Get Started with the Model
Use the code below to get started with the model.

from ultralytics import YOLO

model = YOLO("path/to/your/model.pt")
results = model("your_image.jpg")
results.show()
## Training Details
### Training Data
Total images: 156

Source: Personal photocard collection + marketplace images

Annotation method: Foundation model auto‑labeling + manual correction

Class definition: Photocard = selfie‑style, rectangular, ~55×85 mm

Excluded: postcards, album inclusions, sleeves without cards, binder pockets
### Training Procedure
Training platform: Roboflow Hosted Training

Training approach: Transfer learning from pretrained YOLO backbone

Image size: 640×640

Epochs: ~50–100 (auto‑selected)

Batch size: Auto‑selected

Learning rate: Warmup + cosine decay

Optimizer: AdamW or SGD (Roboflow default)

Precision: Mixed precision (fp16)
#### Speeds, Sizes, Times [optional]
Training time: ~10–20 minutes (depending on GPU)

Model size: Depends on YOLO variant exported (typically 10–40 MB)
## Evaluation
### Testing Data, Factors & Metrics
#### Testing Data
Validation and test splits generated automatically by Roboflow (70/20/10)
#### Factors
Lighting variation

Background clutter

Card orientation

Sleeve reflections

Flash glare

Occluding shadows

Overlaps
#### Metrics
mAP@50: 97.4%

Precision: 94.4%

Recall: 94.8%
### Results
#### Summary
The model demonstrates strong localization accuracy and generalization across diverse real‑world scenes. 

High precision and recall indicate low false positives and low missed detections.

Photocard class annotated/identified 1023 times
## Model Card Author

Priya Rasal