jnjj
/

xd_v1

@@ -23,14 +23,14 @@ The fully merged model weights and tokenizer are updated periodically at the roo
 - **Dynamic Dataset Source:** The script iterates through a wide array of Hugging Face Hub datasets.
 - **Rapid Iteration Strategy:** Training per dataset configuration is brief (`max_steps=1`), prioritizing breadth of exposure over depth on any single dataset.
 ## Training Progress
-- **Datasets Processed (Successfully trained on at least one config):** 4
-- **Text Examples Streamed (Total):** 24
-- **Tokens Processed (Total):** 12288
-- **Last Successful Model Update:** 2025-05-08 15:40:28 UTC
 ### Evaluation Snapshot (Approximate)
-- **Current Perplexity (wikitext Subset):** 286.11
-- **Perplexity Change:** `-0.92` ⬇️ (vs previous cycle's perplexity)
 #### Generated Examples (Qualitative Assessment)
@@ -41,7 +41,7 @@ The fully merged model weights and tokenizer are updated periodically at the roo
 | Creative Prompt            | `Describe a friendly robot that love...` | `We are pleased to announce the launch of... ` |
 | Question Answering (Basic) | `What is the main color of a ripe ba...` | `As an example we've been using the same ... ` |
 | Code Generation (Simple Python) | `Write a Python function that takes ...` | `We are looking forward to seeing us in t... ` |
-| Reasoning (Simple)         | `If a train leaves station A at 10:0...` | `The time of day we were trying to get ou... ` |
 #### Standard Benchmarks (via `lighteval`)
 _Note: Running standard benchmarks requires a dedicated setup using the `lighteval` harness. The table below shows scores if available in `evaluation_stats.json`, otherwise `N/A`._

 - **Dynamic Dataset Source:** The script iterates through a wide array of Hugging Face Hub datasets.
 - **Rapid Iteration Strategy:** Training per dataset configuration is brief (`max_steps=1`), prioritizing breadth of exposure over depth on any single dataset.
 ## Training Progress
+- **Datasets Processed (Successfully trained on at least one config):** 5
+- **Text Examples Streamed (Total):** 30
+- **Tokens Processed (Total):** 15360
+- **Last Successful Model Update:** 2025-05-08 15:42:00 UTC
 ### Evaluation Snapshot (Approximate)
+- **Current Perplexity (wikitext Subset):** 284.82
+- **Perplexity Change:** `-1.29` ⬇️ (vs previous cycle's perplexity)
 #### Generated Examples (Qualitative Assessment)
 | Creative Prompt            | `Describe a friendly robot that love...` | `We are pleased to announce the launch of... ` |
 | Question Answering (Basic) | `What is the main color of a ripe ba...` | `As an example we've been using the same ... ` |
 | Code Generation (Simple Python) | `Write a Python function that takes ...` | `We are looking forward to seeing us in t... ` |
+| Reasoning (Simple)         | `If a train leaves station A at 10:0...` | `This is a big task force to get ready fo... ` |
 #### Standard Benchmarks (via `lighteval`)
 _Note: Running standard benchmarks requires a dedicated setup using the `lighteval` harness. The table below shows scores if available in `evaluation_stats.json`, otherwise `N/A`._