estebancarlin commited on
Commit
c22fa7e
·
verified ·
1 Parent(s): 1ea057b

BitMar 100M tokens (no memory) - Epoch 8 - 797,458,002 tokens processed

Browse files
Files changed (3) hide show
  1. README.md +6 -6
  2. pytorch_model.bin +1 -1
  3. training_metadata.json +4 -4
README.md CHANGED
@@ -20,9 +20,9 @@ This model was trained on exactly 100 million tokens as part of the BabyLM chall
20
 
21
  ## Training Details
22
  - Total tokens: 100,000,000
23
- - Epochs completed: 7
24
- - Tokens processed: 697,775,898
25
- - Cross-modal similarity: 0.3340
26
  - Episodic memory: Disabled
27
 
28
  ## Model Architecture
@@ -40,6 +40,6 @@ tokenizer = AutoTokenizer.from_pretrained("estebancarlin/bitmar-no-memory")
40
 
41
 
42
  ## Training Status
43
- - **Status**: In Progress (Epoch 7)
44
- - **Tokens Processed**: 697,775,898
45
- - **Best Cross-modal Similarity**: 0.3340
 
20
 
21
  ## Training Details
22
  - Total tokens: 100,000,000
23
+ - Epochs completed: 8
24
+ - Tokens processed: 797,458,002
25
+ - Cross-modal similarity: 0.3342
26
  - Episodic memory: Disabled
27
 
28
  ## Model Architecture
 
40
 
41
 
42
  ## Training Status
43
+ - **Status**: In Progress (Epoch 8)
44
+ - **Tokens Processed**: 797,458,002
45
+ - **Best Cross-modal Similarity**: 0.3342
pytorch_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f7c0255243f07e53cb5ae469ff79780ff268d73271e3456d411686e5933b0c40
3
  size 85226595
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:85d8b2614e03b5b9cf65e84a6e7cc5d4b0ad52e581f076a0c041c1f250fc3300
3
  size 85226595
training_metadata.json CHANGED
@@ -1,9 +1,9 @@
1
  {
2
- "epoch": 6,
3
- "global_step": 116067,
4
- "tokens_processed": 697775898,
5
  "target_tokens": 100000000,
6
- "best_similarity": 0.33396559953689575,
7
  "training_config": {
8
  "model": {
9
  "vocab_size": 50257,
 
1
  {
2
+ "epoch": 7,
3
+ "global_step": 132648,
4
+ "tokens_processed": 797458002,
5
  "target_tokens": 100000000,
6
+ "best_similarity": 0.33421099185943604,
7
  "training_config": {
8
  "model": {
9
  "vocab_size": 50257,