AnonSOB
/

crysis-mlx

4-bit precision

Model card Files Files and versions

AnonSOB commited on Apr 20, 2025

Commit

0714121

·

verified ·

1 Parent(s): 671fda8

Update README.md

Files changed (1) hide show

README.md +16 -1

README.md CHANGED Viewed

@@ -9,16 +9,31 @@ base_model:
 tags:
 - meme
 - crysis
 ---
 This model is a fine-tuned version of [google/gemma-3-1b-it-qat-q4_0-unquantized](https://huggingface.co/google/gemma-3-1b-it-qat-q4_0-unquantized) on [AnonSOB/Crysis](https://huggingface.co/datasets/AnonSOB/Crysis).
 The following hyperparameters were used during training:
 - learning_rate: 1e-5
 - train_batch_size: 5
 - seed: 0
 - num_epochs: 6000
 Trained with mlx-lm.
 Thank you Alex Ziskind for original TinyLlama Crysis ❤️
-This model is unquantized. I dont know why I trained it on Gemma 3 QAT because it acts weird when quantized.

 tags:
 - meme
 - crysis
+library_name: mlx
 ---
 This model is a fine-tuned version of [google/gemma-3-1b-it-qat-q4_0-unquantized](https://huggingface.co/google/gemma-3-1b-it-qat-q4_0-unquantized) on [AnonSOB/Crysis](https://huggingface.co/datasets/AnonSOB/Crysis).
+### Training hyperparameters
 The following hyperparameters were used during training:
 - learning_rate: 1e-5
 - train_batch_size: 5
 - seed: 0
 - num_epochs: 6000
+### MLX.
 Trained with mlx-lm.
+### Thank you
 Thank you Alex Ziskind for original TinyLlama Crysis ❤️
+This model is unquantized. I dont know why I trained it on Gemma 3 QAT because it acts weird when quantized.
+## Suggested parameters
+- Temperature: `0.8`
+- Max Tokens: `32`
+- Max seq len: `64`