euhidaman commited on
Commit
f209f3a
·
verified ·
1 Parent(s): f5153ad

BitMar 100M tokens - Epoch 10 - 996,862,184 tokens processed

Browse files
Files changed (3) hide show
  1. README.md +6 -6
  2. pytorch_model.bin +1 -1
  3. training_metadata.json +4 -4
README.md CHANGED
@@ -19,9 +19,9 @@ This model was trained on exactly 100 million tokens as part of the BabyLM chall
19
 
20
  ## Training Details
21
  - Total tokens: 100,000,000
22
- - Epochs completed: 9
23
- - Tokens processed: 897,175,985
24
- - Cross-modal similarity: 0.4368
25
 
26
  ## Model Architecture
27
  - Text encoder: 4 layers, 128 hidden size
@@ -38,6 +38,6 @@ tokenizer = AutoTokenizer.from_pretrained("euhidaman/bitmar-attention-multimodal
38
 
39
 
40
  ## Training Status
41
- - **Status**: In Progress (Epoch 9)
42
- - **Tokens Processed**: 897,175,985
43
- - **Best Cross-modal Similarity**: 0.4368
 
19
 
20
  ## Training Details
21
  - Total tokens: 100,000,000
22
+ - Epochs completed: 10
23
+ - Tokens processed: 996,862,184
24
+ - Cross-modal similarity: 0.4552
25
 
26
  ## Model Architecture
27
  - Text encoder: 4 layers, 128 hidden size
 
38
 
39
 
40
  ## Training Status
41
+ - **Status**: In Progress (Epoch 10)
42
+ - **Tokens Processed**: 996,862,184
43
+ - **Best Cross-modal Similarity**: 0.4552
pytorch_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b3ca5993fbf9f2896ae5f6f9d90fbbdc635f7161c756da042fe21d52786c3b2e
3
  size 86128991
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d05b530305b4708c0680d672e825ad410d74cd906b96385d8d6e9ec67e4b15eb
3
  size 86128991
training_metadata.json CHANGED
@@ -1,9 +1,9 @@
1
  {
2
- "epoch": 8,
3
- "global_step": 895482,
4
- "tokens_processed": 897175985,
5
  "target_tokens": 100000000,
6
- "best_similarity": 0.4368094205856323,
7
  "training_config": {
8
  "model": {
9
  "vocab_size": 50257,
 
1
  {
2
+ "epoch": 9,
3
+ "global_step": 994980,
4
+ "tokens_processed": 996862184,
5
  "target_tokens": 100000000,
6
+ "best_similarity": 0.4551965296268463,
7
  "training_config": {
8
  "model": {
9
  "vocab_size": 50257,