| Visible device: cuda | |
| Seed used: 1 | |
| Batch size: 64 | |
| Epochs: 40 | |
| Learning rate: 1e-05 | |
| Entropy weight: 0.01 | |
| Regularization weight: 0.0 | |
| Only use multiwoz like domains: False | |
| We use: 100.0% of the data | |
| Dialogue order used: 0 | |
| Vectorizer: Data set used is multiwoz21 | |
| We filter state by active domains: True | |
| Vectorizer: Data set used is multiwoz21 | |
| Embedding semantic descriptions: True | |
| Embedded descriptions successfully. Size: torch.Size([338, 768]) | |
| Data set used for descriptions: multiwoz21 | |
| We use Roberta to embed actions. | |
| Didnt load a model | |
| Start training | |
| Epoch: 0 | |
| Average actions: 1.957058072090149 | |
| Average target actions: 2.669339895248413 | |
| Precision: 0.13822525597269625 | |
| Recall: 0.10146667362597213 | |
| F1: 0.11702736056346508 | |
| <<dialog policy>> epoch 0: saved network to mdl | |
| Best Precision: 0.13822525597269625 | |
| Best Recall: 0.10146667362597213 | |
| Best F1: 0.11702736056346508 | |
| Epoch: 1 | |
| Precision: 0.13822525597269625 | |
| Recall: 0.10146667362597213 | |
| F1: 0.11702736056346508 | |
| Best Precision: 0.13822525597269625 | |
| Best Recall: 0.10146667362597213 | |
| Best F1: 0.11702736056346508 | |
| Epoch: 2 | |
| Average actions: 2.0794308185577393 | |
| Average target actions: 2.6675729751586914 | |
| Precision: 0.22303363258743134 | |
| Recall: 0.1737564591053813 | |
| F1: 0.19533519143318176 | |
| <<dialog policy>> epoch 2: saved network to mdl | |
| Best Precision: 0.22303363258743134 | |
| Best Recall: 0.1737564591053813 | |
| Best F1: 0.19533519143318176 | |
| Epoch: 3 | |
| Precision: 0.22303363258743134 | |
| Recall: 0.1737564591053813 | |
| F1: 0.19533519143318176 | |
| Best Precision: 0.22303363258743134 | |
| Best Recall: 0.1737564591053813 | |
| Best F1: 0.19533519143318176 | |
| Epoch: 4 | |
| Average actions: 2.0110926628112793 | |
| Average target actions: 2.665806293487549 | |
| Precision: 0.26409084614319345 | |
| Recall: 0.19907093272091445 | |
| F1: 0.22701705306389688 | |
| <<dialog policy>> epoch 4: saved network to mdl | |
| Best Precision: 0.26409084614319345 | |
| Best Recall: 0.19907093272091445 | |
| Best F1: 0.22701705306389688 | |
| Epoch: 5 | |
| Precision: 0.26409084614319345 | |
| Recall: 0.19907093272091445 | |
| F1: 0.22701705306389688 | |
| Best Precision: 0.26409084614319345 | |
| Best Recall: 0.19907093272091445 | |
| Best F1: 0.22701705306389688 | |
| Epoch: 6 | |
| Average actions: 1.9673057794570923 | |
| Average target actions: 2.667219877243042 | |
| Precision: 0.2910210146465719 | |
| Recall: 0.21467717521791324 | |
| F1: 0.2470863871200288 | |
| <<dialog policy>> epoch 6: saved network to mdl | |
| Best Precision: 0.2910210146465719 | |
| Best Recall: 0.21467717521791324 | |
| Best F1: 0.2470863871200288 | |
| Epoch: 7 | |
| Precision: 0.2910210146465719 | |
| Recall: 0.21467717521791324 | |
| F1: 0.2470863871200288 | |
| Best Precision: 0.2910210146465719 | |
| Best Recall: 0.21467717521791324 | |
| Best F1: 0.2470863871200288 | |
| Epoch: 8 | |
| Average actions: 1.8258512020111084 | |
| Average target actions: 2.667926549911499 | |
| Precision: 0.30450038138825325 | |
| Recall: 0.20836160551176994 | |
| F1: 0.24742012457776819 | |
| <<dialog policy>> epoch 8: saved network to mdl | |
| Best Precision: 0.30450038138825325 | |
| Best Recall: 0.21467717521791324 | |
| Best F1: 0.24742012457776819 | |
| Epoch: 9 | |
| Precision: 0.30450038138825325 | |
| Recall: 0.20836160551176994 | |
| F1: 0.24742012457776819 | |
| Best Precision: 0.30450038138825325 | |
| Best Recall: 0.21467717521791324 | |
| Best F1: 0.24742012457776819 | |
| Epoch: 10 | |
| Average actions: 1.7796674966812134 | |
| Average target actions: 2.66333270072937 | |
| Precision: 0.3297132588483475 | |
| Recall: 0.2202620178506185 | |
| F1: 0.2640966268227048 | |
| <<dialog policy>> epoch 10: saved network to mdl | |
| Best Precision: 0.3297132588483475 | |
| Best Recall: 0.2202620178506185 | |
| Best F1: 0.2640966268227048 | |
| Epoch: 11 | |
| Precision: 0.3297132588483475 | |
| Recall: 0.2202620178506185 | |
| F1: 0.2640966268227048 | |
| Best Precision: 0.3297132588483475 | |
| Best Recall: 0.2202620178506185 | |
| Best F1: 0.2640966268227048 | |
| Epoch: 12 | |
| Average actions: 1.8398014307022095 | |
| Average target actions: 2.67004656791687 | |
| Precision: 0.34064769975786924 | |
| Recall: 0.23498094890129964 | |
| F1: 0.27811583011583013 | |
| <<dialog policy>> epoch 12: saved network to mdl | |
| Best Precision: 0.34064769975786924 | |
| Best Recall: 0.23498094890129964 | |
| Best F1: 0.27811583011583013 | |
| Epoch: 13 | |
| Precision: 0.34064769975786924 | |
| Recall: 0.23498094890129964 | |
| F1: 0.27811583011583013 | |
| Best Precision: 0.34064769975786924 | |
| Best Recall: 0.23498094890129964 | |
| Best F1: 0.27811583011583013 | |
| Epoch: 14 | |
| Average actions: 1.7070426940917969 | |
| Average target actions: 2.667219877243042 | |
| Precision: 0.35462034091835903 | |
| Recall: 0.22694295109348087 | |
| F1: 0.2767663908338638 | |
| Best Precision: 0.35462034091835903 | |
| Best Recall: 0.23498094890129964 | |
| Best F1: 0.27811583011583013 | |
| Epoch: 15 | |
| Precision: 0.35462034091835903 | |
| Recall: 0.22694295109348087 | |
| F1: 0.2767663908338638 | |
| Best Precision: 0.35462034091835903 | |
| Best Recall: 0.23498094890129964 | |
| Best F1: 0.27811583011583013 | |
| Epoch: 16 | |
| Average actions: 1.6812468767166138 | |
| Average target actions: 2.6643927097320557 | |
| Precision: 0.34859650575474044 | |
| Recall: 0.21974006994101988 | |
| F1: 0.2695607632219234 | |
| Best Precision: 0.35462034091835903 | |
| Best Recall: 0.23498094890129964 | |
| Best F1: 0.27811583011583013 | |
| Epoch: 17 | |
| Precision: 0.34859650575474044 | |
| Recall: 0.21974006994101988 | |
| F1: 0.2695607632219234 | |
| Best Precision: 0.35462034091835903 | |
| Best Recall: 0.23498094890129964 | |
| Best F1: 0.27811583011583013 | |
| Epoch: 18 | |
| Average actions: 1.675270438194275 | |
| Average target actions: 2.6640396118164062 | |
| Precision: 0.35976419794088343 | |
| Recall: 0.22616002922908293 | |
| F1: 0.27772970547703746 | |
| Best Precision: 0.35976419794088343 | |
| Best Recall: 0.23498094890129964 | |
| Best F1: 0.27811583011583013 | |
| Epoch: 19 | |
| Precision: 0.35976419794088343 | |
| Recall: 0.22616002922908293 | |
| F1: 0.27772970547703746 | |
| Best Precision: 0.35976419794088343 | |
| Best Recall: 0.23498094890129964 | |
| Best F1: 0.27811583011583013 | |
| Epoch: 20 | |
| Average actions: 1.5666790008544922 | |
| Average target actions: 2.6647462844848633 | |
| Precision: 0.3769442716203004 | |
| Recall: 0.2213581084607756 | |
| F1: 0.27892140743176586 | |
| <<dialog policy>> epoch 20: saved network to mdl | |
| Best Precision: 0.3769442716203004 | |
| Best Recall: 0.23498094890129964 | |
| Best F1: 0.27892140743176586 | |
| Epoch: 21 | |
| Precision: 0.3769442716203004 | |
| Recall: 0.2213581084607756 | |
| F1: 0.27892140743176586 | |
| Best Precision: 0.3769442716203004 | |
| Best Recall: 0.23498094890129964 | |
| Best F1: 0.27892140743176586 | |
| Epoch: 22 | |
| Average actions: 1.6693706512451172 | |
| Average target actions: 2.6661596298217773 | |
| Precision: 0.3716379382130069 | |
| Recall: 0.23294535205386502 | |
| F1: 0.2863834702258727 | |
| <<dialog policy>> epoch 22: saved network to mdl | |
| Best Precision: 0.3769442716203004 | |
| Best Recall: 0.23498094890129964 | |
| Best F1: 0.2863834702258727 | |
| Epoch: 23 | |
| Precision: 0.3716379382130069 | |
| Recall: 0.23294535205386502 | |
| F1: 0.2863834702258727 | |
| Best Precision: 0.3769442716203004 | |
| Best Recall: 0.23498094890129964 | |
| Best F1: 0.2863834702258727 | |
| Epoch: 24 | |
| Average actions: 1.6701388359069824 | |
| Average target actions: 2.6643927097320557 | |
| Precision: 0.3714618714618715 | |
| Recall: 0.23289315726290516 | |
| F1: 0.2862917455327067 | |
| Best Precision: 0.3769442716203004 | |
| Best Recall: 0.23498094890129964 | |
| Best F1: 0.2863834702258727 | |
| Epoch: 25 | |
| Precision: 0.3714618714618715 | |
| Recall: 0.23289315726290516 | |
| F1: 0.2862917455327067 | |
| Best Precision: 0.3769442716203004 | |
| Best Recall: 0.23498094890129964 | |
| Best F1: 0.2863834702258727 | |
| Epoch: 26 | |
| Average actions: 1.6909722089767456 | |
| Average target actions: 2.665099620819092 | |
| Precision: 0.3781160016454134 | |
| Recall: 0.2398872592515267 | |
| F1: 0.2935428242958421 | |
| <<dialog policy>> epoch 26: saved network to mdl | |
| Best Precision: 0.3781160016454134 | |
| Best Recall: 0.2398872592515267 | |
| Best F1: 0.2935428242958421 | |
| Epoch: 27 | |
| Precision: 0.3781160016454134 | |
| Recall: 0.2398872592515267 | |
| F1: 0.2935428242958421 | |
| Best Precision: 0.3781160016454134 | |
| Best Recall: 0.2398872592515267 | |
| Best F1: 0.2935428242958421 | |
| Epoch: 28 | |
| Average actions: 1.8047566413879395 | |
| Average target actions: 2.6643927097320557 | |
| Precision: 0.3654779326811985 | |
| Recall: 0.24766428310454616 | |
| F1: 0.29525231783958683 | |
| <<dialog policy>> epoch 28: saved network to mdl | |
| Best Precision: 0.3781160016454134 | |
| Best Recall: 0.24766428310454616 | |
| Best F1: 0.29525231783958683 | |
| Epoch: 29 | |
| Precision: 0.3654779326811985 | |
| Recall: 0.24766428310454616 | |
| F1: 0.29525231783958683 | |
| Best Precision: 0.3781160016454134 | |
| Best Recall: 0.24766428310454616 | |
| Best F1: 0.29525231783958683 | |
| Epoch: 30 | |
| Average actions: 1.680601716041565 | |
| Average target actions: 2.6640396118164062 | |
| Precision: 0.37665562913907286 | |
| Recall: 0.23748629886737305 | |
| F1: 0.2913025384935497 | |
| Best Precision: 0.3781160016454134 | |
| Best Recall: 0.24766428310454616 | |
| Best F1: 0.29525231783958683 | |
| Epoch: 31 | |
| Precision: 0.37665562913907286 | |
| Recall: 0.23748629886737305 | |
| F1: 0.2913025384935497 | |
| Best Precision: 0.3781160016454134 | |
| Best Recall: 0.24766428310454616 | |
| Best F1: 0.29525231783958683 | |
| Epoch: 32 | |
| Average actions: 1.7778853178024292 | |
| Average target actions: 2.667219877243042 | |
| Precision: 0.3660120491354354 | |
| Recall: 0.2441672321102354 | |
| F1: 0.2929242329367564 | |
| Best Precision: 0.3781160016454134 | |
| Best Recall: 0.24766428310454616 | |
| Best F1: 0.29525231783958683 | |
| Epoch: 33 | |
| Precision: 0.3660120491354354 | |
| Recall: 0.2441672321102354 | |
| F1: 0.2929242329367564 | |
| Best Precision: 0.3781160016454134 | |
| Best Recall: 0.24766428310454616 | |
| Best F1: 0.29525231783958683 | |
| Epoch: 34 | |
| Average actions: 1.726846694946289 | |
| Average target actions: 2.66333270072937 | |
| Precision: 0.3723121526938874 | |
| Recall: 0.24129651860744297 | |
| F1: 0.29281732961743095 | |
| Best Precision: 0.3781160016454134 | |
| Best Recall: 0.24766428310454616 | |
| Best F1: 0.29525231783958683 | |
| Epoch: 35 | |
| Precision: 0.3723121526938874 | |
| Recall: 0.24129651860744297 | |
| F1: 0.29281732961743095 | |
| Best Precision: 0.3781160016454134 | |
| Best Recall: 0.24766428310454616 | |
| Best F1: 0.29525231783958683 | |
| Epoch: 36 | |
| Average actions: 1.8067078590393066 | |
| Average target actions: 2.6675729751586914 | |
| Precision: 0.37099753694581283 | |
| Recall: 0.2515788924265358 | |
| F1: 0.29983515287238344 | |
| <<dialog policy>> epoch 36: saved network to mdl | |
| Best Precision: 0.3781160016454134 | |
| Best Recall: 0.2515788924265358 | |
| Best F1: 0.29983515287238344 | |
| Epoch: 37 | |
| Precision: 0.37099753694581283 | |
| Recall: 0.2515788924265358 | |
| F1: 0.29983515287238344 | |
| Best Precision: 0.3781160016454134 | |
| Best Recall: 0.2515788924265358 | |
| Best F1: 0.29983515287238344 | |
| Epoch: 38 | |
| Average actions: 1.7964909076690674 | |
| Average target actions: 2.6647462844848633 | |
| Precision: 0.36536823356307596 | |
| Recall: 0.2462550237486299 | |
| F1: 0.2942130207034173 | |
| Best Precision: 0.3781160016454134 | |
| Best Recall: 0.2515788924265358 | |
| Best F1: 0.29983515287238344 | |
| Epoch: 39 | |
| Precision: 0.36536823356307596 | |
| Recall: 0.2462550237486299 | |
| F1: 0.2942130207034173 | |
| Best Precision: 0.3781160016454134 | |
| Best Recall: 0.2515788924265358 | |
| Best F1: 0.29983515287238344 | |