| --- |
| license: apache-2.0 |
| base_model: google/flan-t5-base |
| tags: |
| - generated_from_keras_callback |
| model-index: |
| - name: kaytoo2022/t5_technical_qa_with_react_2 |
| results: [] |
| inference: true |
| library_name: transformers |
| pipeline_tag: text2text-generation |
| --- |
| |
| <!-- This model card has been generated automatically according to the information Keras had access to. You should |
| probably proofread and complete it, then remove this comment. --> |
|
|
| # kaytoo2022/t5_technical_qa_with_react_2 |
| |
| This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on an unknown dataset. |
| It achieves the following results on the evaluation set: |
| - Train Loss: 1.5754 |
| - Validation Loss: 1.8786 |
| - Epoch: 9 |
| |
| ## Model description |
| |
| More information needed |
| |
| ## Intended uses & limitations |
| |
| More information needed |
| |
| ## Training and evaluation data |
| |
| More information needed |
| |
| ## Training procedure |
| |
| ### Training hyperparameters |
| |
| The following hyperparameters were used during training: |
| - optimizer: {'name': 'AdamWeightDecay', 'learning_rate': 2e-05, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False, 'weight_decay_rate': 0.01} |
| - training_precision: float32 |
| |
| ### Training results |
| |
| | Train Loss | Validation Loss | Epoch | |
| |:----------:|:---------------:|:-----:| |
| | 2.5639 | 2.2487 | 0 | |
| | 2.2752 | 2.1552 | 1 | |
| | 2.1253 | 2.0893 | 2 | |
| | 2.0155 | 2.0500 | 3 | |
| | 1.9230 | 2.0040 | 4 | |
| | 1.8434 | 1.9731 | 5 | |
| | 1.7722 | 1.9408 | 6 | |
| | 1.7102 | 1.9300 | 7 | |
| | 1.6478 | 1.8975 | 8 | |
| | 1.5754 | 1.8786 | 9 | |
| |
| |
| ### Framework versions |
| |
| - Transformers 4.42.4 |
| - TensorFlow 2.17.0 |
| - Datasets 2.20.0 |
| - Tokenizers 0.19.1 |
| |