ooliverz commited on
Commit
172abab
·
verified ·
1 Parent(s): da30e57

End of training

Browse files
Files changed (2) hide show
  1. README.md +61 -45
  2. model.safetensors +1 -1
README.md CHANGED
@@ -18,8 +18,8 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [microsoft/git-large-r-coco](https://huggingface.co/microsoft/git-large-r-coco) on the imagefolder dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.1340
22
- - Meteor Score: {'meteor': 0.5103395242652348}
23
 
24
  ## Model description
25
 
@@ -46,53 +46,69 @@ The following hyperparameters were used during training:
46
  - total_train_batch_size: 1024
47
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
48
  - lr_scheduler_type: cosine
49
- - num_epochs: 200
50
  - mixed_precision_training: Native AMP
51
 
52
  ### Training results
53
 
54
- | Training Loss | Epoch | Step | Validation Loss | Meteor Score |
55
- |:-------------:|:-----:|:----:|:---------------:|:-------------------------------:|
56
- | 4.8755 | 5.0 | 5 | 4.6208 | {'meteor': 0.41234022080924504} |
57
- | 4.577 | 10.0 | 10 | 4.2632 | {'meteor': 0.462553330633559} |
58
- | 4.228 | 15.0 | 15 | 3.9351 | {'meteor': 0.4637035445872813} |
59
- | 3.8968 | 20.0 | 20 | 3.6117 | {'meteor': 0.4716545164583618} |
60
- | 3.5731 | 25.0 | 25 | 3.2943 | {'meteor': 0.4775515760416854} |
61
- | 3.2551 | 30.0 | 30 | 2.9844 | {'meteor': 0.4827646049987953} |
62
- | 2.9433 | 35.0 | 35 | 2.6819 | {'meteor': 0.4820540318646651} |
63
- | 2.6406 | 40.0 | 40 | 2.3893 | {'meteor': 0.48387008867521647} |
64
- | 2.348 | 45.0 | 45 | 2.1093 | {'meteor': 0.48688764217538394} |
65
- | 2.0685 | 50.0 | 50 | 1.8438 | {'meteor': 0.48840003275775357} |
66
- | 1.8052 | 55.0 | 55 | 1.5954 | {'meteor': 0.49229450416352066} |
67
- | 1.5592 | 60.0 | 60 | 1.3681 | {'meteor': 0.49462336346473573} |
68
- | 1.3335 | 65.0 | 65 | 1.1642 | {'meteor': 0.4943789886645904} |
69
- | 1.1308 | 70.0 | 70 | 0.9838 | {'meteor': 0.4932081022324161} |
70
- | 0.9511 | 75.0 | 75 | 0.8281 | {'meteor': 0.4949580448605414} |
71
- | 0.7953 | 80.0 | 80 | 0.6959 | {'meteor': 0.4945236890902709} |
72
- | 0.6629 | 85.0 | 85 | 0.5849 | {'meteor': 0.49613363555493917} |
73
- | 0.5518 | 90.0 | 90 | 0.4970 | {'meteor': 0.49476521946537905} |
74
- | 0.4599 | 95.0 | 95 | 0.4203 | {'meteor': 0.49694213467111825} |
75
- | 0.3856 | 100.0 | 100 | 0.3609 | {'meteor': 0.5023234677593583} |
76
- | 0.3248 | 105.0 | 105 | 0.3137 | {'meteor': 0.49291655975794224} |
77
- | 0.2756 | 110.0 | 110 | 0.2757 | {'meteor': 0.49187607478517975} |
78
- | 0.236 | 115.0 | 115 | 0.2432 | {'meteor': 0.4999076360911653} |
79
- | 0.2039 | 120.0 | 120 | 0.2196 | {'meteor': 0.5054381333125716} |
80
- | 0.1782 | 125.0 | 125 | 0.2006 | {'meteor': 0.4998272338605217} |
81
- | 0.1576 | 130.0 | 130 | 0.1864 | {'meteor': 0.5095656338179543} |
82
- | 0.1411 | 135.0 | 135 | 0.1755 | {'meteor': 0.5030929069103355} |
83
- | 0.1279 | 140.0 | 140 | 0.1653 | {'meteor': 0.5097532440481348} |
84
- | 0.1173 | 145.0 | 145 | 0.1576 | {'meteor': 0.5126165420799782} |
85
- | 0.1088 | 150.0 | 150 | 0.1516 | {'meteor': 0.5168283983418568} |
86
- | 0.1023 | 155.0 | 155 | 0.1462 | {'meteor': 0.5145210432669091} |
87
- | 0.097 | 160.0 | 160 | 0.1424 | {'meteor': 0.5135483205500848} |
88
- | 0.0929 | 165.0 | 165 | 0.1399 | {'meteor': 0.5099977164420265} |
89
- | 0.0899 | 170.0 | 170 | 0.1384 | {'meteor': 0.5093303675700068} |
90
- | 0.0876 | 175.0 | 175 | 0.1369 | {'meteor': 0.5097771482308939} |
91
- | 0.086 | 180.0 | 180 | 0.1357 | {'meteor': 0.5080664663529372} |
92
- | 0.085 | 185.0 | 185 | 0.1347 | {'meteor': 0.5101483486776783} |
93
- | 0.0843 | 190.0 | 190 | 0.1342 | {'meteor': 0.5110798690668398} |
94
- | 0.0839 | 195.0 | 195 | 0.1340 | {'meteor': 0.5102824562761434} |
95
- | 0.0838 | 200.0 | 200 | 0.1340 | {'meteor': 0.5103395242652348} |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
96
 
97
 
98
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [microsoft/git-large-r-coco](https://huggingface.co/microsoft/git-large-r-coco) on the imagefolder dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 0.0718
22
+ - Meteor Score: {'meteor': 0.666916447209016}
23
 
24
  ## Model description
25
 
 
46
  - total_train_batch_size: 1024
47
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
48
  - lr_scheduler_type: cosine
49
+ - num_epochs: 280
50
  - mixed_precision_training: Native AMP
51
 
52
  ### Training results
53
 
54
+ | Training Loss | Epoch | Step | Validation Loss | Meteor Score |
55
+ |:-------------:|:-----:|:----:|:---------------:|:------------------------------:|
56
+ | 0.1391 | 5.0 | 5 | 0.1066 | {'meteor': 0.5464308622024807} |
57
+ | 0.0701 | 10.0 | 10 | 0.0890 | {'meteor': 0.5291953574710693} |
58
+ | 0.0468 | 15.0 | 15 | 0.0789 | {'meteor': 0.5496036528506629} |
59
+ | 0.0325 | 20.0 | 20 | 0.0711 | {'meteor': 0.5439550126260527} |
60
+ | 0.0225 | 25.0 | 25 | 0.0672 | {'meteor': 0.552830292712962} |
61
+ | 0.0161 | 30.0 | 30 | 0.0679 | {'meteor': 0.5428427974008356} |
62
+ | 0.0123 | 35.0 | 35 | 0.0667 | {'meteor': 0.5293045890136272} |
63
+ | 0.0114 | 40.0 | 40 | 0.0650 | {'meteor': 0.5470594759810721} |
64
+ | 0.0084 | 45.0 | 45 | 0.0643 | {'meteor': 0.5561520014815091} |
65
+ | 0.0064 | 50.0 | 50 | 0.0651 | {'meteor': 0.5610482415310057} |
66
+ | 0.0051 | 55.0 | 55 | 0.0667 | {'meteor': 0.5487886186676766} |
67
+ | 0.0041 | 60.0 | 60 | 0.0685 | {'meteor': 0.5616454798881054} |
68
+ | 0.0037 | 65.0 | 65 | 0.0692 | {'meteor': 0.5699718193151979} |
69
+ | 0.0032 | 70.0 | 70 | 0.0685 | {'meteor': 0.5580309821952526} |
70
+ | 0.0029 | 75.0 | 75 | 0.0683 | {'meteor': 0.5748316918656275} |
71
+ | 0.0026 | 80.0 | 80 | 0.0684 | {'meteor': 0.5895555620949836} |
72
+ | 0.0024 | 85.0 | 85 | 0.0687 | {'meteor': 0.5857286795606523} |
73
+ | 0.0022 | 90.0 | 90 | 0.0695 | {'meteor': 0.5846091691510659} |
74
+ | 0.0021 | 95.0 | 95 | 0.0697 | {'meteor': 0.594178001025322} |
75
+ | 0.002 | 100.0 | 100 | 0.0694 | {'meteor': 0.6081664555245014} |
76
+ | 0.0019 | 105.0 | 105 | 0.0696 | {'meteor': 0.6221380770749247} |
77
+ | 0.0018 | 110.0 | 110 | 0.0697 | {'meteor': 0.6033596220663302} |
78
+ | 0.0017 | 115.0 | 115 | 0.0699 | {'meteor': 0.5934573428451106} |
79
+ | 0.0016 | 120.0 | 120 | 0.0697 | {'meteor': 0.6100068434120042} |
80
+ | 0.0016 | 125.0 | 125 | 0.0701 | {'meteor': 0.6226574997552852} |
81
+ | 0.0015 | 130.0 | 130 | 0.0704 | {'meteor': 0.6266141282552855} |
82
+ | 0.0015 | 135.0 | 135 | 0.0708 | {'meteor': 0.6266624596822102} |
83
+ | 0.0014 | 140.0 | 140 | 0.0713 | {'meteor': 0.6253640811501537} |
84
+ | 0.0014 | 145.0 | 145 | 0.0715 | {'meteor': 0.6268835998646377} |
85
+ | 0.0013 | 150.0 | 150 | 0.0716 | {'meteor': 0.6391957882900023} |
86
+ | 0.0013 | 155.0 | 155 | 0.0716 | {'meteor': 0.6372778085384403} |
87
+ | 0.0013 | 160.0 | 160 | 0.0714 | {'meteor': 0.6420748904347257} |
88
+ | 0.0012 | 165.0 | 165 | 0.0712 | {'meteor': 0.6542694795600709} |
89
+ | 0.0012 | 170.0 | 170 | 0.0711 | {'meteor': 0.6640970636042774} |
90
+ | 0.0012 | 175.0 | 175 | 0.0712 | {'meteor': 0.6581755350563626} |
91
+ | 0.0012 | 180.0 | 180 | 0.0715 | {'meteor': 0.6563782816038787} |
92
+ | 0.0012 | 185.0 | 185 | 0.0717 | {'meteor': 0.6575711357356673} |
93
+ | 0.0012 | 190.0 | 190 | 0.0719 | {'meteor': 0.6615976015516674} |
94
+ | 0.0011 | 195.0 | 195 | 0.0719 | {'meteor': 0.6664084419111367} |
95
+ | 0.0011 | 200.0 | 200 | 0.0719 | {'meteor': 0.6714641579289897} |
96
+ | 0.0011 | 205.0 | 205 | 0.0719 | {'meteor': 0.6733748542723833} |
97
+ | 0.0011 | 210.0 | 210 | 0.0718 | {'meteor': 0.6716577609340998} |
98
+ | 0.0011 | 215.0 | 215 | 0.0718 | {'meteor': 0.6718891332503508} |
99
+ | 0.0011 | 220.0 | 220 | 0.0718 | {'meteor': 0.6705874088889952} |
100
+ | 0.0011 | 225.0 | 225 | 0.0718 | {'meteor': 0.6687440927433674} |
101
+ | 0.0011 | 230.0 | 230 | 0.0718 | {'meteor': 0.6683625041894395} |
102
+ | 0.0011 | 235.0 | 235 | 0.0717 | {'meteor': 0.667993183281943} |
103
+ | 0.0011 | 240.0 | 240 | 0.0717 | {'meteor': 0.6684321600001021} |
104
+ | 0.0011 | 245.0 | 245 | 0.0718 | {'meteor': 0.668594259646557} |
105
+ | 0.0011 | 250.0 | 250 | 0.0718 | {'meteor': 0.6675028779539088} |
106
+ | 0.0011 | 255.0 | 255 | 0.0718 | {'meteor': 0.6677654410135724} |
107
+ | 0.0011 | 260.0 | 260 | 0.0718 | {'meteor': 0.6664467133271365} |
108
+ | 0.0011 | 265.0 | 265 | 0.0718 | {'meteor': 0.6667295014161946} |
109
+ | 0.0011 | 270.0 | 270 | 0.0718 | {'meteor': 0.6671952775082015} |
110
+ | 0.0011 | 275.0 | 275 | 0.0718 | {'meteor': 0.6672252794210247} |
111
+ | 0.0011 | 280.0 | 280 | 0.0718 | {'meteor': 0.666916447209016} |
112
 
113
 
114
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a6ddbe82843aa67130ca66c8286b8561015c5fdf6e2b2dc7254b0666e43d55d6
3
  size 1576851440
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7e75c4f9c315681a0cf5c21850e4eb24ce6e78a0d8317c4518cc6d62ff5624d8
3
  size 1576851440