Training in progress epoch 0

Browse files

Files changed (17) hide show

README.md +57 -0
config.json +23 -0
logs/train/events.out.tfevents.1664041052.d3bafb248f11.18.0.v2 +3 -0
logs/train/events.out.tfevents.1664041068.d3bafb248f11.profile-empty +3 -0
logs/train/plugins/profile/2022_09_24_17_37_48/d3bafb248f11.input_pipeline.pb +3 -0
logs/train/plugins/profile/2022_09_24_17_37_48/d3bafb248f11.kernel_stats.pb +3 -0
logs/train/plugins/profile/2022_09_24_17_37_48/d3bafb248f11.memory_profile.json.gz +3 -0
logs/train/plugins/profile/2022_09_24_17_37_48/d3bafb248f11.overview_page.pb +3 -0
logs/train/plugins/profile/2022_09_24_17_37_48/d3bafb248f11.tensorflow_stats.pb +3 -0
logs/train/plugins/profile/2022_09_24_17_37_48/d3bafb248f11.trace.json.gz +3 -0
logs/train/plugins/profile/2022_09_24_17_37_48/d3bafb248f11.xplane.pb +3 -0
logs/validation/events.out.tfevents.1664043273.d3bafb248f11.18.1.v2 +3 -0
special_tokens_map.json +7 -0
tf_model.h5 +3 -0
tokenizer.json +0 -0
tokenizer_config.json +14 -0
vocab.txt +0 -0

README.md ADDED Viewed

	@@ -0,0 +1,57 @@

+---
+license: apache-2.0
+tags:
+- generated_from_keras_callback
+model-index:
+- name: kevinbram/testarbara
+  results: []
+---
+<!-- This model card has been generated automatically according to the information Keras had access to. You should
+probably proofread and complete it, then remove this comment. -->
+# kevinbram/testarbara
+This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Train Loss: 1.4900
+- Train End Logits Accuracy: 0.6129
+- Train Start Logits Accuracy: 0.5735
+- Validation Loss: 1.1335
+- Validation End Logits Accuracy: 0.6908
+- Validation Start Logits Accuracy: 0.6545
+- Epoch: 0
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- optimizer: {'name': 'Adam', 'learning_rate': {'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 2e-05, 'decay_steps': 11064, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}}, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-08, 'amsgrad': False}
+- training_precision: float32
+### Training results
+| Train Loss | Train End Logits Accuracy | Train Start Logits Accuracy | Validation Loss | Validation End Logits Accuracy | Validation Start Logits Accuracy | Epoch |
+|:----------:|:-------------------------:|:---------------------------:|:---------------:|:------------------------------:|:--------------------------------:|:-----:|
+| 1.4900     | 0.6129                    | 0.5735                      | 1.1335          | 0.6908                         | 0.6545                           | 0     |
+### Framework versions
+- Transformers 4.20.1
+- TensorFlow 2.6.4
+- Datasets 2.1.0
+- Tokenizers 0.12.1

config.json ADDED Viewed

	@@ -0,0 +1,23 @@

+{
+  "_name_or_path": "distilbert-base-uncased",
+  "activation": "gelu",
+  "architectures": [
+    "DistilBertForQuestionAnswering"
+  ],
+  "attention_dropout": 0.1,
+  "dim": 768,
+  "dropout": 0.1,
+  "hidden_dim": 3072,
+  "initializer_range": 0.02,
+  "max_position_embeddings": 512,
+  "model_type": "distilbert",
+  "n_heads": 12,
+  "n_layers": 6,
+  "pad_token_id": 0,
+  "qa_dropout": 0.1,
+  "seq_classif_dropout": 0.2,
+  "sinusoidal_pos_embds": false,
+  "tie_weights_": true,
+  "transformers_version": "4.20.1",
+  "vocab_size": 30522
+}

logs/train/events.out.tfevents.1664041052.d3bafb248f11.18.0.v2 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:01d1749ef2c7d14b3de3fceb5beaa9a14435edf4802e332278f9a3297548d77f
+size 1498022

logs/train/events.out.tfevents.1664041068.d3bafb248f11.profile-empty ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:9632f92137db07347afcac16d989b04a5728a2c4b991e28d1d03af9f8b7b04c2
+size 40

logs/train/plugins/profile/2022_09_24_17_37_48/d3bafb248f11.input_pipeline.pb ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:97d1779a300861a4edaab9121771cca84cdc414f02ab4aac5f8b22f9547913c6
+size 2680

logs/train/plugins/profile/2022_09_24_17_37_48/d3bafb248f11.kernel_stats.pb ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:6fb958f67afa60e77b683fa0ae0a598992ef05857874c40f9cd1b2b7e3388e95
+size 314505

logs/train/plugins/profile/2022_09_24_17_37_48/d3bafb248f11.memory_profile.json.gz ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5cd2b3ca4e0a72ebec63b9498bbe9b9f8e6164c17a441d88cba608f308d1546e
+size 35348

logs/train/plugins/profile/2022_09_24_17_37_48/d3bafb248f11.overview_page.pb ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c92c7763218b39c5000a31863d05ca5578a83ffe569cda9124d5c1512013e746
+size 5847

logs/train/plugins/profile/2022_09_24_17_37_48/d3bafb248f11.tensorflow_stats.pb ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:3f6264a851c7bfae28610c2f8df7cbbd9dd071f1cc876c15f91e8484c1f67b3e
+size 189737

logs/train/plugins/profile/2022_09_24_17_37_48/d3bafb248f11.trace.json.gz ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:6bc6773d12eb4d710da59a48716dfc7864e281382fd7b50eaad4e63b521a4d54
+size 150606

logs/train/plugins/profile/2022_09_24_17_37_48/d3bafb248f11.xplane.pb ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5a14dfd73994bf72a168e7c80ef853527a32803fec0144f71c3bcd1156d284d7
+size 1153228

logs/validation/events.out.tfevents.1664043273.d3bafb248f11.18.1.v2 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:f818bd76daa3ab4ea821040f47587c71d07438ed20f9703db665a7d2a6761ff5
+size 566

special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,7 @@

+{
+  "cls_token": "[CLS]",
+  "mask_token": "[MASK]",
+  "pad_token": "[PAD]",
+  "sep_token": "[SEP]",
+  "unk_token": "[UNK]"
+}

tf_model.h5 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:9b9ee787195c78e53c05d810c0688b9255ee6d3804900622d6a8571f8d6726de
+size 265583592

tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff

tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,14 @@

+{
+  "cls_token": "[CLS]",
+  "do_lower_case": true,
+  "mask_token": "[MASK]",
+  "model_max_length": 512,
+  "name_or_path": "distilbert-base-uncased",
+  "pad_token": "[PAD]",
+  "sep_token": "[SEP]",
+  "special_tokens_map_file": null,
+  "strip_accents": null,
+  "tokenize_chinese_chars": true,
+  "tokenizer_class": "DistilBertTokenizer",
+  "unk_token": "[UNK]"
+}

vocab.txt ADDED Viewed

The diff for this file is too large to render. See raw diff