pradeep4321
/

valve_model

@@ -14,8 +14,8 @@ probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Train Loss: 3.1831
-- Validation Loss: 5.9072
 - Epoch: 99
 ## Model description
@@ -35,113 +35,113 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- optimizer: {'name': 'AdamWeightDecay', 'learning_rate': {'class_name': 'WarmUp', 'config': {'initial_learning_rate': 5e-05, 'decay_schedule_fn': {'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 5e-05, 'decay_steps': -999, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}, '__passive_serialization__': True}, 'warmup_steps': 1000, 'power': 1.0, 'name': None}}, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-08, 'amsgrad': False, 'weight_decay_rate': 0.01}
 - training_precision: mixed_float16
 ### Training results
 | Train Loss | Validation Loss | Epoch |
 |:----------:|:---------------:|:-----:|
-| 6.4756     | 6.5449          | 0     |
-| 6.5200     | 6.5443          | 1     |
-| 6.6058     | 6.5432          | 2     |
-| 6.5443     | 6.5416          | 3     |
-| 6.6271     | 6.5395          | 4     |
-| 6.5184     | 6.5369          | 5     |
-| 6.4858     | 6.5337          | 6     |
-| 6.4394     | 6.5301          | 7     |
-| 6.5547     | 6.5259          | 8     |
-| 6.4873     | 6.5211          | 9     |
-| 6.4768     | 6.5160          | 10    |
-| 6.5326     | 6.5102          | 11    |
-| 6.4251     | 6.5041          | 12    |
-| 6.4698     | 6.4974          | 13    |
-| 6.4125     | 6.4903          | 14    |
-| 6.3133     | 6.4827          | 15    |
-| 6.3633     | 6.4748          | 16    |
-| 6.3025     | 6.4665          | 17    |
-| 6.3038     | 6.4579          | 18    |
-| 6.2443     | 6.4490          | 19    |
-| 6.2891     | 6.4398          | 20    |
-| 6.1881     | 6.4304          | 21    |
-| 6.0868     | 6.4208          | 22    |
-| 6.1109     | 6.4110          | 23    |
-| 6.1498     | 6.4010          | 24    |
-| 5.9289     | 6.3908          | 25    |
-| 6.0533     | 6.3804          | 26    |
-| 6.0183     | 6.3699          | 27    |
-| 5.8904     | 6.3593          | 28    |
-| 5.8638     | 6.3486          | 29    |
-| 5.8532     | 6.3379          | 30    |
-| 5.8279     | 6.3273          | 31    |
-| 5.7185     | 6.3167          | 32    |
-| 5.7309     | 6.3062          | 33    |
-| 5.6457     | 6.2958          | 34    |
-| 5.5979     | 6.2855          | 35    |
-| 5.5968     | 6.2750          | 36    |
-| 5.6015     | 6.2646          | 37    |
-| 5.4416     | 6.2542          | 38    |
-| 5.5024     | 6.2441          | 39    |
-| 5.4739     | 6.2342          | 40    |
-| 5.3524     | 6.2245          | 41    |
-| 5.3214     | 6.2149          | 42    |
-| 5.2997     | 6.2052          | 43    |
-| 5.2619     | 6.1955          | 44    |
-| 5.2155     | 6.1859          | 45    |
-| 5.2030     | 6.1765          | 46    |
-| 5.1632     | 6.1672          | 47    |
-| 5.1386     | 6.1581          | 48    |
-| 5.0821     | 6.1492          | 49    |
-| 5.0143     | 6.1406          | 50    |
-| 5.0254     | 6.1318          | 51    |
-| 4.9244     | 6.1235          | 52    |
-| 4.8945     | 6.1151          | 53    |
-| 4.9138     | 6.1066          | 54    |
-| 4.8516     | 6.0985          | 55    |
-| 4.8212     | 6.0899          | 56    |
-| 4.6900     | 6.0817          | 57    |
-| 4.7051     | 6.0738          | 58    |
-| 4.6742     | 6.0657          | 59    |
-| 4.6304     | 6.0577          | 60    |
-| 4.5756     | 6.0498          | 61    |
-| 4.5728     | 6.0420          | 62    |
-| 4.5041     | 6.0341          | 63    |
-| 4.5266     | 6.0264          | 64    |
-| 4.4364     | 6.0193          | 65    |
-| 4.3653     | 6.0125          | 66    |
-| 4.3792     | 6.0054          | 67    |
-| 4.3870     | 5.9979          | 68    |
-| 4.2865     | 5.9911          | 69    |
-| 4.2443     | 5.9845          | 70    |
-| 4.2388     | 5.9783          | 71    |
-| 4.2063     | 5.9723          | 72    |
-| 4.1534     | 5.9667          | 73    |
-| 4.0811     | 5.9616          | 74    |
-| 4.1064     | 5.9565          | 75    |
-| 4.1281     | 5.9512          | 76    |
-| 4.0124     | 5.9467          | 77    |
-| 3.9703     | 5.9430          | 78    |
-| 3.8858     | 5.9389          | 79    |
-| 3.9194     | 5.9351          | 80    |
-| 3.8257     | 5.9309          | 81    |
-| 3.8251     | 5.9270          | 82    |
-| 3.8499     | 5.9234          | 83    |
-| 3.7903     | 5.9206          | 84    |
-| 3.7851     | 5.9190          | 85    |
-| 3.7319     | 5.9174          | 86    |
-| 3.6612     | 5.9169          | 87    |
-| 3.6404     | 5.9162          | 88    |
-| 3.5339     | 5.9162          | 89    |
-| 3.5073     | 5.9158          | 90    |
-| 3.4569     | 5.9157          | 91    |
-| 3.5231     | 5.9153          | 92    |
-| 3.3952     | 5.9152          | 93    |
-| 3.3774     | 5.9144          | 94    |
-| 3.3776     | 5.9126          | 95    |
-| 3.2881     | 5.9112          | 96    |
-| 3.2130     | 5.9099          | 97    |
-| 3.2514     | 5.9088          | 98    |
-| 3.1831     | 5.9072          | 99    |
 ### Framework versions

 This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Train Loss: 0.4860
+- Validation Loss: 6.0810
 - Epoch: 99
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- optimizer: {'name': 'AdamWeightDecay', 'learning_rate': {'class_name': 'WarmUp', 'config': {'initial_learning_rate': 2e-05, 'decay_schedule_fn': {'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 2e-05, 'decay_steps': 800, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}, '__passive_serialization__': True}, 'warmup_steps': 200, 'power': 1.0, 'name': None}}, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-08, 'amsgrad': False, 'weight_decay_rate': 0.01}
 - training_precision: mixed_float16
 ### Training results
 | Train Loss | Validation Loss | Epoch |
 |:----------:|:---------------:|:-----:|
+| 3.1291     | 5.9072          | 0     |
+| 3.1205     | 5.9071          | 1     |
+| 3.0615     | 5.9070          | 2     |
+| 3.1662     | 5.9069          | 3     |
+| 3.1011     | 5.9068          | 4     |
+| 3.1374     | 5.9066          | 5     |
+| 3.1472     | 5.9065          | 6     |
+| 3.0926     | 5.9066          | 7     |
+| 3.1436     | 5.9065          | 8     |
+| 3.1321     | 5.9065          | 9     |
+| 3.1027     | 5.9065          | 10    |
+| 2.9848     | 5.9068          | 11    |
+| 2.9544     | 5.9069          | 12    |
+| 3.0212     | 5.9066          | 13    |
+| 3.0448     | 5.9066          | 14    |
+| 3.0455     | 5.9063          | 15    |
+| 3.0294     | 5.9063          | 16    |
+| 2.9529     | 5.9058          | 17    |
+| 2.8377     | 5.9054          | 18    |
+| 2.8682     | 5.9054          | 19    |
+| 2.9745     | 5.9050          | 20    |
+| 2.9680     | 5.9049          | 21    |
+| 2.9270     | 5.9046          | 22    |
+| 2.8955     | 5.9039          | 23    |
+| 2.9627     | 5.9031          | 24    |
+| 2.8304     | 5.9020          | 25    |
+| 2.8542     | 5.9009          | 26    |
+| 2.8008     | 5.8999          | 27    |
+| 2.8067     | 5.8992          | 28    |
+| 2.7471     | 5.8987          | 29    |
+| 2.7494     | 5.8983          | 30    |
+| 2.7467     | 5.8990          | 31    |
+| 2.6482     | 5.9001          | 32    |
+| 2.7226     | 5.9006          | 33    |
+| 2.6202     | 5.9003          | 34    |
+| 2.6576     | 5.9005          | 35    |
+| 2.6144     | 5.9010          | 36    |
+| 2.6040     | 5.9015          | 37    |
+| 2.4523     | 5.9022          | 38    |
+| 2.4589     | 5.9023          | 39    |
+| 2.4796     | 5.9028          | 40    |
+| 2.4962     | 5.9027          | 41    |
+| 2.4251     | 5.9029          | 42    |
+| 2.3685     | 5.9031          | 43    |
+| 2.3015     | 5.9034          | 44    |
+| 2.3080     | 5.9035          | 45    |
+| 2.2066     | 5.9039          | 46    |
+| 2.1621     | 5.9061          | 47    |
+| 2.1354     | 5.9088          | 48    |
+| 2.1527     | 5.9112          | 49    |
+| 2.1650     | 5.9115          | 50    |
+| 2.1298     | 5.9117          | 51    |
+| 2.0993     | 5.9106          | 52    |
+| 2.0044     | 5.9099          | 53    |
+| 1.9764     | 5.9102          | 54    |
+| 1.9662     | 5.9116          | 55    |
+| 1.9702     | 5.9145          | 56    |
+| 1.9012     | 5.9152          | 57    |
+| 1.8061     | 5.9175          | 58    |
+| 1.7831     | 5.9211          | 59    |
+| 1.8015     | 5.9253          | 60    |
+| 1.7642     | 5.9298          | 61    |
+| 1.7484     | 5.9328          | 62    |
+| 1.5452     | 5.9342          | 63    |
+| 1.5996     | 5.9369          | 64    |
+| 1.4831     | 5.9396          | 65    |
+| 1.4367     | 5.9421          | 66    |
+| 1.4981     | 5.9435          | 67    |
+| 1.4513     | 5.9475          | 68    |
+| 1.3897     | 5.9532          | 69    |
+| 1.3108     | 5.9603          | 70    |
+| 1.3337     | 5.9664          | 71    |
+| 1.2564     | 5.9728          | 72    |
+| 1.2671     | 5.9770          | 73    |
+| 1.1286     | 5.9814          | 74    |
+| 1.1349     | 5.9843          | 75    |
+| 1.1645     | 5.9842          | 76    |
+| 1.1462     | 5.9806          | 77    |
+| 1.1028     | 5.9791          | 78    |
+| 0.9843     | 5.9770          | 79    |
+| 0.9734     | 5.9768          | 80    |
+| 0.9831     | 5.9795          | 81    |
+| 1.0021     | 5.9823          | 82    |
+| 0.8903     | 5.9826          | 83    |
+| 0.8244     | 5.9837          | 84    |
+| 0.8597     | 5.9863          | 85    |
+| 0.8703     | 5.9907          | 86    |
+| 0.7864     | 5.9996          | 87    |
+| 0.7394     | 6.0086          | 88    |
+| 0.6764     | 6.0188          | 89    |
+| 0.7007     | 6.0278          | 90    |
+| 0.6247     | 6.0355          | 91    |
+| 0.6640     | 6.0430          | 92    |
+| 0.6407     | 6.0498          | 93    |
+| 0.5903     | 6.0565          | 94    |
+| 0.6226     | 6.0614          | 95    |
+| 0.5934     | 6.0662          | 96    |
+| 0.5140     | 6.0713          | 97    |
+| 0.5300     | 6.0766          | 98    |
+| 0.4860     | 6.0810          | 99    |
 ### Framework versions

tf_model.h5 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6311b4e3a92717f413ce150c3791712ecebed73ff280364d1b10eaf8e2f7226a
 size 345588840

 version https://git-lfs.github.com/spec/v1
+oid sha256:3f6868ac2c89d7bbe7fdccfd267a5f4429b7e93801b24fd0499ee4b5fb912ede
 size 345588840