ayshi commited on
Commit
1cc6e28
·
1 Parent(s): 30a7e13

Training in progress epoch 0

Browse files
README.md CHANGED
@@ -1,6 +1,6 @@
1
  ---
2
  license: apache-2.0
3
- base_model: distilbert-base-multilingual-cased
4
  tags:
5
  - generated_from_keras_callback
6
  model-index:
@@ -13,12 +13,12 @@ probably proofread and complete it, then remove this comment. -->
13
 
14
  # ayshi/basic_distil
15
 
16
- This model is a fine-tuned version of [distilbert-base-multilingual-cased](https://huggingface.co/distilbert-base-multilingual-cased) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Train Loss: 0.3221
19
- - Validation Loss: 0.8711
20
- - Train Accuracy: 0.7511
21
- - Epoch: 9
22
 
23
  ## Model description
24
 
@@ -44,16 +44,7 @@ The following hyperparameters were used during training:
44
 
45
  | Train Loss | Validation Loss | Train Accuracy | Epoch |
46
  |:----------:|:---------------:|:--------------:|:-----:|
47
- | 1.3876 | 1.1603 | 0.6667 | 0 |
48
- | 1.1107 | 1.0802 | 0.6667 | 1 |
49
- | 0.9684 | 0.9907 | 0.6933 | 2 |
50
- | 0.8068 | 0.9187 | 0.7244 | 3 |
51
- | 0.6642 | 0.8437 | 0.7556 | 4 |
52
- | 0.5446 | 0.8595 | 0.7644 | 5 |
53
- | 0.4424 | 0.8628 | 0.76 | 6 |
54
- | 0.3908 | 0.8741 | 0.76 | 7 |
55
- | 0.3276 | 0.8693 | 0.76 | 8 |
56
- | 0.3221 | 0.8711 | 0.7511 | 9 |
57
 
58
 
59
  ### Framework versions
 
1
  ---
2
  license: apache-2.0
3
+ base_model: ayshi/basic_distil
4
  tags:
5
  - generated_from_keras_callback
6
  model-index:
 
13
 
14
  # ayshi/basic_distil
15
 
16
+ This model is a fine-tuned version of [ayshi/basic_distil](https://huggingface.co/ayshi/basic_distil) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Train Loss: 0.3419
19
+ - Validation Loss: 0.9411
20
+ - Train Accuracy: 0.7556
21
+ - Epoch: 0
22
 
23
  ## Model description
24
 
 
44
 
45
  | Train Loss | Validation Loss | Train Accuracy | Epoch |
46
  |:----------:|:---------------:|:--------------:|:-----:|
47
+ | 0.3419 | 0.9411 | 0.7556 | 0 |
 
 
 
 
 
 
 
 
 
48
 
49
 
50
  ### Framework versions
config.json CHANGED
@@ -1,5 +1,5 @@
1
  {
2
- "_name_or_path": "distilbert-base-multilingual-cased",
3
  "activation": "gelu",
4
  "architectures": [
5
  "DistilBertForSequenceClassification"
 
1
  {
2
+ "_name_or_path": "ayshi/basic_distil",
3
  "activation": "gelu",
4
  "architectures": [
5
  "DistilBertForSequenceClassification"
special_tokens_map.json CHANGED
@@ -1,4 +1,11 @@
1
  {
 
 
 
 
 
 
 
2
  "cls_token": "[CLS]",
3
  "mask_token": "[MASK]",
4
  "pad_token": "[PAD]",
 
1
  {
2
+ "additional_special_tokens": [
3
+ "[PAD]",
4
+ "[UNK]",
5
+ "[CLS]",
6
+ "[SEP]",
7
+ "[MASK]"
8
+ ],
9
  "cls_token": "[CLS]",
10
  "mask_token": "[MASK]",
11
  "pad_token": "[PAD]",
tf_model.h5 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:db2205cda38961df29041306b74f3058608f551a62bbb5c17b54dfd16ae9ab88
3
- size 541452232
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ec5f0e6cd04299ffdc7164a2413da47492990753d8a0f2c81419b9378f55cf3f
3
+ size 541448896
tokenizer.json CHANGED
@@ -1,6 +1,11 @@
1
  {
2
  "version": "1.0",
3
- "truncation": null,
 
 
 
 
 
4
  "padding": null,
5
  "added_tokens": [
6
  {
 
1
  {
2
  "version": "1.0",
3
+ "truncation": {
4
+ "direction": "Right",
5
+ "max_length": 512,
6
+ "strategy": "LongestFirst",
7
+ "stride": 0
8
+ },
9
  "padding": null,
10
  "added_tokens": [
11
  {
tokenizer_config.json CHANGED
@@ -41,7 +41,13 @@
41
  "special": true
42
  }
43
  },
44
- "additional_special_tokens": [],
 
 
 
 
 
 
45
  "clean_up_tokenization_spaces": true,
46
  "cls_token": "[CLS]",
47
  "do_lower_case": false,
 
41
  "special": true
42
  }
43
  },
44
+ "additional_special_tokens": [
45
+ "[PAD]",
46
+ "[UNK]",
47
+ "[CLS]",
48
+ "[SEP]",
49
+ "[MASK]"
50
+ ],
51
  "clean_up_tokenization_spaces": true,
52
  "cls_token": "[CLS]",
53
  "do_lower_case": false,