Pinkstackorg
/

Fijik1-6b-Instruct-Llama3.2

Text Generation

text-generation-inference

Model card Files Files and versions

Pinkstack commited on May 14, 2025

Commit

cd5e4a5

·

verified ·

1 Parent(s): 7a22456

Update README.md

Files changed (1) hide show

README.md +5 -1

README.md CHANGED Viewed

@@ -7,10 +7,14 @@ tags:
 - llama
 - trl
 - dpo
 license: llama3.2
 language:
 - en
 pipeline_tag: text-generation
 ---
 😁:```Hi Fijik!```
@@ -24,7 +28,7 @@ After merging, we used a custom dataset mix meant for this model, to improve its
 - **Step 2 for the fine-tuning via unsloth:** DPO for 2 epochs for even better instruction following.
 After these two steps, we got a powerful model which has less parameters than llama 3.1 8B yet performs just as good if not better, Note that unlike our other models, it is not a thinking model. our theory behind this model is that a smaller yet deeper model can outperform for it's size.
-Meta states that LLAMA 3.2 was pre-trained on up to 9 trillion high quality tokens.
 # What should Fijik be used for?
 Fijik

 - llama
 - trl
 - dpo
+- roleplay
+- math
+- code
 license: llama3.2
 language:
 - en
 pipeline_tag: text-generation
+library_name: transformers
 ---
 😁:```Hi Fijik!```
 - **Step 2 for the fine-tuning via unsloth:** DPO for 2 epochs for even better instruction following.
 After these two steps, we got a powerful model which has less parameters than llama 3.1 8B yet performs just as good if not better, Note that unlike our other models, it is not a thinking model. our theory behind this model is that a smaller yet deeper model can outperform for it's size.
+Meta states that LLAMA 3.2 was pre-trained on up to 9 trillion high quality tokens, with a knowledge cutoff date of December 2023.
 # What should Fijik be used for?
 Fijik