Sweaterdog
/

Andy-4-base-DEPRECATED

Model card Files Files and versions

Sweaterdog commited on Apr 14, 2025

Commit

5107cc9

·

verified ·

1 Parent(s): 866cdd7

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -20,9 +20,9 @@ This AI is crafted to push the boundaries of gameplay, reasoning, and multi-lang
 # Overview
-Andy-4-base is an 8B parameter model built on the efficient Llama3.1 8B DeepSeek-R1 distill architecture.
-It has been meticulously trained over three weeks on a single RTX 3090 using two carefully curated datasets.
 The model underwent 2 epochs on the first dataset with a higher learning rate and 4 epochs on the second dataset with a much lower learning rate, ensuring a balanced and robust learning process.

 # Overview
+Andy-4-base is an 8B parameter model tuned from Llama3.1 8B.
+It has been trained over three weeks on a single RTX 3090 using two carefully curated datasets.
 The model underwent 2 epochs on the first dataset with a higher learning rate and 4 epochs on the second dataset with a much lower learning rate, ensuring a balanced and robust learning process.