File size: 1,131 Bytes
91a6bb1
 
 
 
 
 
 
 
 
 
733228b
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
---
license: mit
language:
- en
tags:
- image-classification
- multi-label
- resnet
- pytorch
---
# Multi-Label Object Classification using ResNet

## Model Description
ResNet18 and ResNet50 models fine-tuned for multi-label object classification, 
capable of detecting 12 objects simultaneously in a single image.

## Classes
`backpack`, `book`, `bottle`, `calculator`, `chair`, `clock`, 
`desk`, `keychain`, `laptop`, `paper`, `pen`, `phone`

## Usage
Download all 10 `.pth` files and use with the inference script from the 
[GitHub repository](https://github.com/pranav1233/multi-label-object-classification).

## Model Files
- `resnet18_fold1.pth` through `resnet18_fold5.pth` — ResNet18 ensemble
- `resnet50_fold1.pth` through `resnet50_fold5.pth` — ResNet50 ensemble

## Performance

| Model    | Exact Match | Micro F1 | Mean IOU |
|----------|-------------|----------|----------|
| ResNet18 | 52.29%      | 76.92%   | 0.7200   |
| ResNet50 | 68.78%      | 86.18%   | 0.8301   |

## Training
- 5-Fold Stratified Cross Validation
- Test Time Augmentation (TTA) with 10 augmented views
- Optimal prediction threshold: 0.40