aitfindonesia commited on
Commit
783ff3e
·
verified ·
1 Parent(s): 15e570e

upload ViT gambling fine-tuned model

Browse files
Files changed (5) hide show
  1. README.md +112 -0
  2. config.json +131 -0
  3. model.safetensors +3 -0
  4. preprocessor_config.json +47 -0
  5. training_args.bin +3 -0
README.md ADDED
@@ -0,0 +1,112 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ license: apache-2.0
4
+ base_model: PekingU/rtdetr_r50vd_coco_o365
5
+ tags:
6
+ - generated_from_trainer
7
+ model-index:
8
+ - name: v3-rtdetr-r50-gambling-finetune
9
+ results: []
10
+ ---
11
+
12
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
13
+ should probably proofread and complete it, then remove this comment. -->
14
+
15
+ # v3-rtdetr-r50-gambling-finetune
16
+
17
+ This model is a fine-tuned version of [PekingU/rtdetr_r50vd_coco_o365](https://huggingface.co/PekingU/rtdetr_r50vd_coco_o365) on an unknown dataset.
18
+ It achieves the following results on the evaluation set:
19
+ - Loss: 6.8580
20
+ - Map: 0.7154
21
+ - Map 50: 0.86
22
+ - Map 75: 0.7982
23
+ - Map Small: 0.4818
24
+ - Map Medium: 0.4823
25
+ - Map Large: 0.5059
26
+ - Mar 1: 0.6019
27
+ - Mar 10: 0.8463
28
+ - Mar 100: 0.876
29
+ - Mar Small: 0.8241
30
+ - Mar Medium: 0.8693
31
+ - Mar Large: 0.8723
32
+ - Map Banner Promo: 0.8704
33
+ - Mar 100 Banner Promo: 0.9604
34
+ - Map Cta Button: 0.7422
35
+ - Mar 100 Cta Button: 0.905
36
+ - Map Game Thumbnail: 0.6783
37
+ - Mar 100 Game Thumbnail: 0.9073
38
+ - Map Logo: 0.7334
39
+ - Mar 100 Logo: 0.848
40
+ - Map Menu Nav: 0.5527
41
+ - Mar 100 Menu Nav: 0.7593
42
+
43
+ ## Model description
44
+
45
+ More information needed
46
+
47
+ ## Intended uses & limitations
48
+
49
+ More information needed
50
+
51
+ ## Training and evaluation data
52
+
53
+ More information needed
54
+
55
+ ## Training procedure
56
+
57
+ ### Training hyperparameters
58
+
59
+ The following hyperparameters were used during training:
60
+ - learning_rate: 5e-05
61
+ - train_batch_size: 8
62
+ - eval_batch_size: 8
63
+ - seed: 42
64
+ - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
65
+ - lr_scheduler_type: linear
66
+ - lr_scheduler_warmup_steps: 300
67
+ - num_epochs: 10
68
+
69
+ ### Training results
70
+
71
+ | Training Loss | Epoch | Step | Validation Loss | Map | Map 50 | Map 75 | Map Small | Map Medium | Map Large | Mar 1 | Mar 10 | Mar 100 | Mar Small | Mar Medium | Mar Large | Map Banner Promo | Mar 100 Banner Promo | Map Cta Button | Mar 100 Cta Button | Map Game Thumbnail | Mar 100 Game Thumbnail | Map Logo | Mar 100 Logo | Map Menu Nav | Mar 100 Menu Nav |
72
+ |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:----------:|:---------:|:------:|:------:|:-------:|:---------:|:----------:|:---------:|:----------------:|:--------------------:|:--------------:|:------------------:|:------------------:|:----------------------:|:--------:|:------------:|:------------:|:----------------:|
73
+ | No log | 1.0 | 107 | 20.5763 | 0.0922 | 0.1237 | 0.1082 | 0.1056 | 0.0185 | 0.2108 | 0.1468 | 0.331 | 0.4918 | 0.301 | 0.4038 | 0.5959 | 0.399 | 0.9118 | 0.0195 | 0.4808 | 0.0367 | 0.5013 | 0.0013 | 0.2806 | 0.0046 | 0.2846 |
74
+ | No log | 2.0 | 214 | 9.0251 | 0.5324 | 0.6626 | 0.6192 | 0.3219 | 0.4183 | 0.6377 | 0.462 | 0.7685 | 0.8228 | 0.7076 | 0.7962 | 0.8858 | 0.7688 | 0.9316 | 0.6739 | 0.8749 | 0.5019 | 0.8567 | 0.4994 | 0.7971 | 0.2179 | 0.6538 |
75
+ | No log | 3.0 | 321 | 7.7306 | 0.6155 | 0.7736 | 0.6929 | 0.3994 | 0.4662 | 0.7004 | 0.5238 | 0.796 | 0.8365 | 0.6789 | 0.8151 | 0.8954 | 0.831 | 0.9507 | 0.672 | 0.8703 | 0.6238 | 0.8584 | 0.6296 | 0.7878 | 0.3213 | 0.7154 |
76
+ | No log | 4.0 | 428 | 7.1577 | 0.6898 | 0.8521 | 0.7639 | 0.4178 | 0.5517 | 0.8467 | 0.5838 | 0.813 | 0.8497 | 0.7595 | 0.8232 | 0.9338 | 0.8822 | 0.9676 | 0.7291 | 0.8863 | 0.6569 | 0.8385 | 0.6631 | 0.8165 | 0.5179 | 0.7396 |
77
+ | 29.4032 | 5.0 | 535 | 6.8177 | 0.7202 | 0.8795 | 0.7981 | 0.4606 | 0.552 | 0.8828 | 0.5851 | 0.83 | 0.8689 | 0.7771 | 0.8316 | 0.9249 | 0.9107 | 0.9699 | 0.743 | 0.9005 | 0.6694 | 0.8797 | 0.665 | 0.8266 | 0.6131 | 0.7681 |
78
+ | 29.4032 | 6.0 | 642 | 6.4833 | 0.7517 | 0.9161 | 0.8315 | 0.4888 | 0.6263 | 0.9139 | 0.5966 | 0.8393 | 0.877 | 0.7423 | 0.8503 | 0.9425 | 0.9318 | 0.975 | 0.792 | 0.9114 | 0.721 | 0.8935 | 0.6903 | 0.8216 | 0.6235 | 0.7835 |
79
+ | 29.4032 | 7.0 | 749 | 6.7079 | 0.7452 | 0.9086 | 0.8268 | 0.4626 | 0.5953 | 0.9113 | 0.5999 | 0.8343 | 0.8751 | 0.7833 | 0.8383 | 0.9446 | 0.927 | 0.9691 | 0.7828 | 0.9087 | 0.7239 | 0.8987 | 0.6797 | 0.8144 | 0.6126 | 0.7846 |
80
+ | 29.4032 | 8.0 | 856 | 6.7679 | 0.743 | 0.8989 | 0.8209 | 0.4715 | 0.6187 | 0.9201 | 0.6016 | 0.8354 | 0.8729 | 0.7509 | 0.837 | 0.9499 | 0.9227 | 0.9699 | 0.78 | 0.91 | 0.7224 | 0.8909 | 0.6738 | 0.8158 | 0.616 | 0.778 |
81
+ | 29.4032 | 9.0 | 963 | 6.5281 | 0.7457 | 0.8999 | 0.8317 | 0.4659 | 0.6117 | 0.9221 | 0.5949 | 0.8343 | 0.8674 | 0.69 | 0.8338 | 0.9453 | 0.9264 | 0.9706 | 0.7669 | 0.9023 | 0.7098 | 0.8719 | 0.6927 | 0.8129 | 0.6326 | 0.7791 |
82
+ | 9.6433 | 10.0 | 1070 | 6.5610 | 0.7537 | 0.9065 | 0.8402 | 0.4672 | 0.6283 | 0.926 | 0.6029 | 0.8364 | 0.8735 | 0.7052 | 0.8386 | 0.9521 | 0.9314 | 0.9743 | 0.7786 | 0.9082 | 0.7256 | 0.8792 | 0.6937 | 0.8158 | 0.6392 | 0.7901 |
83
+
84
+
85
+ ### Framework versions
86
+
87
+ - Transformers 5.0.0.dev0
88
+ - Pytorch 2.8.0+cu126
89
+ - Datasets 4.0.0
90
+ - Tokenizers 0.22.1
91
+
92
+
93
+ ### BibTeX entry and citation info
94
+
95
+ ```bibtex
96
+ @misc{lv2023detrs,
97
+ title={DETRs Beat YOLOs on Real-time Object Detection},
98
+ author={Yian Zhao and Wenyu Lv and Shangliang Xu and Jinman Wei and Guanzhong Wang and Qingqing Dang and Yi Liu and Jie Chen},
99
+ year={2023},
100
+ eprint={2304.08069},
101
+ archivePrefix={arXiv},
102
+ primaryClass={cs.CV}
103
+ ```
104
+
105
+ ```bibtex
106
+ @misc{rogge2025transformerstutorials,
107
+ author = {Rogge, Niels},
108
+ title = {Transformers Tutorials},
109
+ year = {2025},
110
+ howpublished = {\url{https://github.com/NielsRogge/Transformers-Tutorials}}
111
+ }
112
+ ```
config.json ADDED
@@ -0,0 +1,131 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "activation_dropout": 0.0,
3
+ "activation_function": "silu",
4
+ "anchor_image_size": null,
5
+ "architectures": [
6
+ "RTDetrForObjectDetection"
7
+ ],
8
+ "attention_dropout": 0.0,
9
+ "auxiliary_loss": true,
10
+ "backbone": null,
11
+ "backbone_config": {
12
+ "depths": [
13
+ 3,
14
+ 4,
15
+ 6,
16
+ 3
17
+ ],
18
+ "downsample_in_bottleneck": false,
19
+ "downsample_in_first_stage": false,
20
+ "dtype": "float32",
21
+ "embedding_size": 64,
22
+ "hidden_act": "relu",
23
+ "hidden_sizes": [
24
+ 256,
25
+ 512,
26
+ 1024,
27
+ 2048
28
+ ],
29
+ "layer_type": "bottleneck",
30
+ "model_type": "rt_detr_resnet",
31
+ "num_channels": 3,
32
+ "out_features": [
33
+ "stage2",
34
+ "stage3",
35
+ "stage4"
36
+ ],
37
+ "out_indices": [
38
+ 2,
39
+ 3,
40
+ 4
41
+ ],
42
+ "stage_names": [
43
+ "stem",
44
+ "stage1",
45
+ "stage2",
46
+ "stage3",
47
+ "stage4"
48
+ ]
49
+ },
50
+ "backbone_kwargs": null,
51
+ "batch_norm_eps": 1e-05,
52
+ "box_noise_scale": 1.0,
53
+ "d_model": 256,
54
+ "decoder_activation_function": "relu",
55
+ "decoder_attention_heads": 8,
56
+ "decoder_ffn_dim": 1024,
57
+ "decoder_in_channels": [
58
+ 256,
59
+ 256,
60
+ 256
61
+ ],
62
+ "decoder_layers": 6,
63
+ "decoder_n_points": 4,
64
+ "disable_custom_kernels": true,
65
+ "dropout": 0.0,
66
+ "dtype": "float32",
67
+ "encode_proj_layers": [
68
+ 2
69
+ ],
70
+ "encoder_activation_function": "gelu",
71
+ "encoder_attention_heads": 8,
72
+ "encoder_ffn_dim": 1024,
73
+ "encoder_hidden_dim": 256,
74
+ "encoder_in_channels": [
75
+ 512,
76
+ 1024,
77
+ 2048
78
+ ],
79
+ "encoder_layers": 1,
80
+ "eos_coefficient": 0.0001,
81
+ "eval_size": null,
82
+ "feat_strides": [
83
+ 8,
84
+ 16,
85
+ 32
86
+ ],
87
+ "focal_loss_alpha": 0.75,
88
+ "focal_loss_gamma": 2.0,
89
+ "freeze_backbone_batch_norms": true,
90
+ "hidden_expansion": 1.0,
91
+ "id2label": {
92
+ "0": "banner_promo",
93
+ "1": "cta_button",
94
+ "2": "game_thumbnail",
95
+ "3": "logo",
96
+ "4": "menu_nav"
97
+ },
98
+ "initializer_bias_prior_prob": null,
99
+ "initializer_range": 0.01,
100
+ "is_encoder_decoder": true,
101
+ "label2id": {
102
+ "banner_promo": 0,
103
+ "cta_button": 1,
104
+ "game_thumbnail": 2,
105
+ "logo": 3,
106
+ "menu_nav": 4
107
+ },
108
+ "label_noise_ratio": 0.5,
109
+ "layer_norm_eps": 1e-05,
110
+ "learn_initial_query": false,
111
+ "matcher_alpha": 0.25,
112
+ "matcher_bbox_cost": 5.0,
113
+ "matcher_class_cost": 2.0,
114
+ "matcher_gamma": 2.0,
115
+ "matcher_giou_cost": 2.0,
116
+ "model_type": "rt_detr",
117
+ "normalize_before": false,
118
+ "num_denoising": 100,
119
+ "num_feature_levels": 3,
120
+ "num_queries": 300,
121
+ "positional_encoding_temperature": 10000,
122
+ "transformers_version": "5.0.0.dev0",
123
+ "use_cache": false,
124
+ "use_focal_loss": true,
125
+ "use_pretrained_backbone": false,
126
+ "use_timm_backbone": false,
127
+ "weight_loss_bbox": 5.0,
128
+ "weight_loss_giou": 2.0,
129
+ "weight_loss_vfl": 1.0,
130
+ "with_box_refine": true
131
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:13f3dee282d43d423cce6e3b9b38d879cb044b7eaee7580c4becaaed59998c3a
3
+ size 171559340
preprocessor_config.json ADDED
@@ -0,0 +1,47 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_valid_processor_keys": [
3
+ "images",
4
+ "annotations",
5
+ "return_segmentation_masks",
6
+ "masks_path",
7
+ "do_resize",
8
+ "size",
9
+ "resample",
10
+ "do_rescale",
11
+ "rescale_factor",
12
+ "do_normalize",
13
+ "do_convert_annotations",
14
+ "image_mean",
15
+ "image_std",
16
+ "do_pad",
17
+ "pad_size",
18
+ "format",
19
+ "return_tensors",
20
+ "data_format",
21
+ "input_data_format"
22
+ ],
23
+ "do_convert_annotations": true,
24
+ "do_normalize": false,
25
+ "do_pad": false,
26
+ "do_rescale": true,
27
+ "do_resize": true,
28
+ "format": "coco_detection",
29
+ "image_mean": [
30
+ 0.485,
31
+ 0.456,
32
+ 0.406
33
+ ],
34
+ "image_processor_type": "RTDetrImageProcessor",
35
+ "image_std": [
36
+ 0.229,
37
+ 0.224,
38
+ 0.225
39
+ ],
40
+ "pad_size": null,
41
+ "resample": 2,
42
+ "rescale_factor": 0.00392156862745098,
43
+ "size": {
44
+ "height": 480,
45
+ "width": 480
46
+ }
47
+ }
training_args.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6196e526411eface89ece58798b6148f0a1b20e0674627f25587e5867e6143fa
3
+ size 5201