Tralalabs
/

SkullLLM-125M

Model card Files Files and versions

Erik commited on Jan 20

Commit

d97411a

·

verified ·

1 Parent(s): d105a7e

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -22,9 +22,9 @@ metrics:
 - accuracy
 ---
-# 💀 Nebulos (SkullLLM-125M)
-**Nebulos** is a lightweight, experimental multilingual language model fine-tuned from GPT-2. This project, part of the **SkullLLM** series, demonstrates that AI training is possible on highly constrained consumer hardware (3GB VRAM) using advanced optimization techniques.
 ### 🚀 Model Details
 - **Developed by:** Erik22TY
@@ -60,7 +60,7 @@ Nebulos was trained on a high-quality multilingual stream:
 - **Final Loss:** 4.0898
 ### ⚠️ Limitations & Behavior
-As a 125M parameter model trained for 500 steps, Nebulos is a **Proof of Concept**.
 - **Repetitions:** May occasionally loop phrases (e.g., "metic"). Use `repetition_penalty=1.5`.
 - **Language Blending:** Due to its size, it may mix Romance languages (Spanish/French/Portuguese) in complex responses.
 - **Coherence:** Best used for short-form explanations or creative experiments.

 - accuracy
 ---
+# 💀 SkullLLM-125M
+**SkullLLM-125M** is a lightweight, experimental multilingual language model fine-tuned from GPT-2. This project, part of the **SkullLLM** series, demonstrates that AI training is possible on highly constrained consumer hardware (3GB VRAM) using advanced optimization techniques.
 ### 🚀 Model Details
 - **Developed by:** Erik22TY
 - **Final Loss:** 4.0898
 ### ⚠️ Limitations & Behavior
+As a 125M parameter model trained for 500 steps, SkullLLM-125M is a **Proof of Concept**.
 - **Repetitions:** May occasionally loop phrases (e.g., "metic"). Use `repetition_penalty=1.5`.
 - **Language Blending:** Due to its size, it may mix Romance languages (Spanish/French/Portuguese) in complex responses.
 - **Coherence:** Best used for short-form explanations or creative experiments.