tomhodemon commited on
Commit
4633b94
·
1 Parent(s): 630a3b6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +14 -13
README.md CHANGED
@@ -1,18 +1,19 @@
1
- # Model Card for t5-small-wikitext
2
- ## Model Description
3
 
4
- # Training Details
5
 
6
- ## Training Data
7
- ## Training Procedure
8
- ### Preprocessing
9
- ### Speeds, Sizes, Times
10
 
11
  ---
12
- license: apache-2.0
13
- datasets:
14
- - wikitext
15
- language:
16
- - en
17
- pipeline_tag: text2text-generation
 
 
18
  ---
 
1
+ # t5-small-wikitext
 
2
 
3
+ t5-small trained on [wikitext/wikitest-103-raw-v1](wikitext/wikitest-103-raw-v1) over 50k steps (around 2 hours of training) following [T5 paper](https://arxiv.org/pdf/1910.10683.pdf) training procedure.
4
 
5
+ * batch_size: 32
6
+ * max_seq_length: 128
7
+ * optim: Adafactor
8
+ * sheduler: inverse square root (10k warm-up steps)
9
 
10
  ---
11
+ - license:
12
+ apache-2.0
13
+ - datasets:
14
+ wikitext
15
+ - language:
16
+ en
17
+ - pipeline_tag:
18
+ text2text-generation
19
  ---