DunnBC22 commited on
Commit
3c450c2
·
1 Parent(s): 7960a45

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -8
README.md CHANGED
@@ -5,28 +5,31 @@ tags:
5
  model-index:
6
  - name: bert-base-uncased-Masked_Language_Modeling-Reddit_Comments
7
  results: []
 
 
 
 
8
  ---
9
 
10
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
11
- should probably proofread and complete it, then remove this comment. -->
12
-
13
  # bert-base-uncased-Masked_Language_Modeling-Reddit_Comments
14
 
15
- This model is a fine-tuned version of [bert-base-uncased](https://huggingface.co/bert-base-uncased) on the None dataset.
16
  It achieves the following results on the evaluation set:
17
  - Loss: 2.5415
18
 
19
  ## Model description
20
 
21
- More information needed
 
 
22
 
23
  ## Intended uses & limitations
24
 
25
- More information needed
26
 
27
  ## Training and evaluation data
28
 
29
- More information needed
30
 
31
  ## Training procedure
32
 
@@ -49,10 +52,11 @@ The following hyperparameters were used during training:
49
  | 2.6818 | 2.0 | 21624 | 2.5699 |
50
  | 2.6103 | 3.0 | 32436 | 2.5402 |
51
 
 
52
 
53
  ### Framework versions
54
 
55
  - Transformers 4.27.0
56
  - Pytorch 1.13.1+cu116
57
  - Datasets 2.10.1
58
- - Tokenizers 0.13.2
 
5
  model-index:
6
  - name: bert-base-uncased-Masked_Language_Modeling-Reddit_Comments
7
  results: []
8
+ language:
9
+ - en
10
+ metrics:
11
+ - perplexity
12
  ---
13
 
 
 
 
14
  # bert-base-uncased-Masked_Language_Modeling-Reddit_Comments
15
 
16
+ This model is a fine-tuned version of [bert-base-uncased](https://huggingface.co/bert-base-uncased).
17
  It achieves the following results on the evaluation set:
18
  - Loss: 2.5415
19
 
20
  ## Model description
21
 
22
+ This is a masked language modeling project.
23
+
24
+ For more information on how it was created, check out the following link: https://github.com/DunnBC22/NLP_Projects/blob/main/Masked%20Language%20Model/Datasets%20for%20NLP%20-%20Reddit%20Comments/Datasets_for_NLP_MLM.ipynb
25
 
26
  ## Intended uses & limitations
27
 
28
+ This model is intended to demonstrate my ability to solve a complex problem using technology.
29
 
30
  ## Training and evaluation data
31
 
32
+ Dataset Source: https://www.kaggle.com/datasets/toygarr/datasets-for-natural-language-processing
33
 
34
  ## Training procedure
35
 
 
52
  | 2.6818 | 2.0 | 21624 | 2.5699 |
53
  | 2.6103 | 3.0 | 32436 | 2.5402 |
54
 
55
+ Perplexity: 12.70
56
 
57
  ### Framework versions
58
 
59
  - Transformers 4.27.0
60
  - Pytorch 1.13.1+cu116
61
  - Datasets 2.10.1
62
+ - Tokenizers 0.13.2