jnjj commited on
Commit
5587e8e
·
verified ·
1 Parent(s): 83c0ebd

Update training progress and artifacts

Browse files
Files changed (2) hide show
  1. README.md +5 -5
  2. model.safetensors +1 -1
README.md CHANGED
@@ -19,13 +19,13 @@ The model files (merged weights and tokenizer) are stored at the root of this re
19
  - **Diverse Datasets:** The script iterates through datasets available on the Hugging Face Hub, attempting to train on each.
20
  - **Short Training Iterations:** Each training run per dataset configuration is currently set to a small number of steps (`max_steps=1`) to allow for rapid iteration across many datasets.
21
  ## Training Progress
22
- - **Datasets Processed (Successfully trained on at least one config):** 8
23
- - **Text Examples Streamed (Total):** 48
24
- - **Tokens Processed (Total):** 24576
25
- - **Last Successful Model Update:** 2025-05-07 14:27:09 UTC
26
  ### Evaluation Metrics
27
 
28
- - **Overall Perplexity (on a small fixed dataset):** 143114.84
29
 
30
  #### Generated Examples (Qualitative Assessment)
31
 
 
19
  - **Diverse Datasets:** The script iterates through datasets available on the Hugging Face Hub, attempting to train on each.
20
  - **Short Training Iterations:** Each training run per dataset configuration is currently set to a small number of steps (`max_steps=1`) to allow for rapid iteration across many datasets.
21
  ## Training Progress
22
+ - **Datasets Processed (Successfully trained on at least one config):** 9
23
+ - **Text Examples Streamed (Total):** 54
24
+ - **Tokens Processed (Total):** 27648
25
+ - **Last Successful Model Update:** 2025-05-07 14:28:37 UTC
26
  ### Evaluation Metrics
27
 
28
+ - **Overall Perplexity (on a small fixed dataset):** 143030.67
29
 
30
  #### Generated Examples (Qualitative Assessment)
31
 
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:39cb03b5008417eff14aa3cb8196faf959572a7ab871ce654172bd059ee0193d
3
  size 80000008
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e635dfb0e375a4055d47994b1bcd0f76dc8bc7c8f4c7ae490bc79f0ae9b283eb
3
  size 80000008