jasonacox
/

jojo-124M

Text Generation

Eval Results (legacy)

text-generation-inference

Model card Files Files and versions

jasonacox commited on Jul 11, 2025

Commit

bde826d

·

verified ·

1 Parent(s): 1be10e0

Upload Jojo LLM model

Files changed (2) hide show

README.md +6 -6
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -9,7 +9,7 @@ tags:
 - jojo-llm
 - pytorch
 datasets:
-- tinystories
 metrics:
 - perplexity
 model-index:
@@ -19,8 +19,8 @@ model-index:
       type: text-generation
       name: Text Generation
     dataset:
-      type: tinystories
-      name: Tinystories
     metrics:
     - type: perplexity
       value: N/A
@@ -31,7 +31,7 @@ model-index:
 ## Model Description
-jasonacox/jojo-124M is a GPT-style language model trained using the Jojo LLM training framework. This model was fine-tuned on the tinystories dataset and is designed for text generation tasks.
 ## Model Details
@@ -46,14 +46,14 @@ jasonacox/jojo-124M is a GPT-style language model trained using the Jojo LLM tra
 - **Hidden Size**: 768
 - **Attention Heads**: 12
 - **Context Length**: 1024 tokens
-- **Vocabulary Size**: 50,304
 - **Total Parameters**: 219.6M
 ## Training Details
 ### Training Data
-The model was trained on the **tinystories** dataset.
 ### Training Procedure

 - jojo-llm
 - pytorch
 datasets:
+- TinyStoriesV2
 metrics:
 - perplexity
 model-index:
       type: text-generation
       name: Text Generation
     dataset:
+      type: TinyStoriesV2
+      name: Tinystoriesv2
     metrics:
     - type: perplexity
       value: N/A
 ## Model Description
+jasonacox/jojo-124M is a GPT-style language model trained using the Jojo LLM training framework. This model was fine-tuned on the TinyStoriesV2 dataset and is designed for text generation tasks.
 ## Model Details
 - **Hidden Size**: 768
 - **Attention Heads**: 12
 - **Context Length**: 1024 tokens
+- **Vocabulary Size**: 50,304 tokens
 - **Total Parameters**: 219.6M
 ## Training Details
 ### Training Data
+The model was trained on the **TinyStoriesV2** dataset.
 ### Training Procedure

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6756ae678821b3e1bfd37fb8957e5677a7080eeeb93db5db4d8978125e06d27e
 size 497918592

 version https://git-lfs.github.com/spec/v1
+oid sha256:97659e5ce53bd703510b76936433f5901477f0ed2b43163d573d6e5f40b45650
 size 497918592