ArliAI
/

Mistral-Nemo-12B-ArliAI-RPMax-v1.2

Model card Files Files and versions

OwenArli commited on Oct 12, 2024

Commit

1c9d0e5

·

verified ·

1 Parent(s): 6275a12

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -4,6 +4,8 @@ license: apache-2.0
 # ArliAI-RPMax-12B-v1.2
 =====================================
 ## RPMax Series Overview
 | [2B](https://huggingface.co/ArliAI/Gemma-2-2B-ArliAI-RPMax-v1.1) |
@@ -38,7 +40,7 @@ v1.2 update completely removes non-creative/RP examples in the dataset and is al
 * **Sequence Length**: 8192
 * **Training Duration**: Approximately 2 days on 2x3090Ti
 * **Epochs**: 1 epoch training for minimized repetition sickness
-* **QLORA**: 64-rank 128-alpha, resulting in ~2% trainable weights
 * **Learning Rate**: 0.00001
 * **Gradient accumulation**: Very low 32 for better learning.

 # ArliAI-RPMax-12B-v1.2
 =====================================
+## UPDATE: Merged it wrongly to base after LORA training. Working to reupload. The 8B version meanwhile is working fine.
 ## RPMax Series Overview
 | [2B](https://huggingface.co/ArliAI/Gemma-2-2B-ArliAI-RPMax-v1.1) |
 * **Sequence Length**: 8192
 * **Training Duration**: Approximately 2 days on 2x3090Ti
 * **Epochs**: 1 epoch training for minimized repetition sickness
+* **LORA**: 64-rank 128-alpha, resulting in ~2% trainable weights
 * **Learning Rate**: 0.00001
 * **Gradient accumulation**: Very low 32 for better learning.