jnjj commited on
Commit
83c0ebd
·
verified ·
1 Parent(s): 5092f60

Update training progress and artifacts

Browse files
Files changed (2) hide show
  1. README.md +5 -5
  2. model.safetensors +1 -1
README.md CHANGED
@@ -19,13 +19,13 @@ The model files (merged weights and tokenizer) are stored at the root of this re
19
  - **Diverse Datasets:** The script iterates through datasets available on the Hugging Face Hub, attempting to train on each.
20
  - **Short Training Iterations:** Each training run per dataset configuration is currently set to a small number of steps (`max_steps=1`) to allow for rapid iteration across many datasets.
21
  ## Training Progress
22
- - **Datasets Processed (Successfully trained on at least one config):** 6
23
- - **Text Examples Streamed (Total):** 36
24
- - **Tokens Processed (Total):** 18432
25
- - **Last Successful Model Update:** 2025-05-07 14:25:07 UTC
26
  ### Evaluation Metrics
27
 
28
- - **Overall Perplexity (on a small fixed dataset):** 143214.66
29
 
30
  #### Generated Examples (Qualitative Assessment)
31
 
 
19
  - **Diverse Datasets:** The script iterates through datasets available on the Hugging Face Hub, attempting to train on each.
20
  - **Short Training Iterations:** Each training run per dataset configuration is currently set to a small number of steps (`max_steps=1`) to allow for rapid iteration across many datasets.
21
  ## Training Progress
22
+ - **Datasets Processed (Successfully trained on at least one config):** 8
23
+ - **Text Examples Streamed (Total):** 48
24
+ - **Tokens Processed (Total):** 24576
25
+ - **Last Successful Model Update:** 2025-05-07 14:27:09 UTC
26
  ### Evaluation Metrics
27
 
28
+ - **Overall Perplexity (on a small fixed dataset):** 143114.84
29
 
30
  #### Generated Examples (Qualitative Assessment)
31
 
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:998802bb2719e88839e658c6669416e9c9b2aef47b4ea62c81fc799f97d49026
3
  size 80000008
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:39cb03b5008417eff14aa3cb8196faf959572a7ab871ce654172bd059ee0193d
3
  size 80000008