Pinkstack
/

Llama-3.2-3B-o1

Text Generation

text-generation-inference

Model card Files Files and versions

Pinkstack commited on Dec 15, 2024

Commit

b11d31f

·

verified ·

1 Parent(s): 7fba4a2

Update README.md

Files changed (1) hide show

README.md +5 -1

README.md CHANGED Viewed

@@ -15,10 +15,14 @@ datasets:
 # Uploaded  model
 - **Developed by:** PinkStack
 - **License:** apache-2.0
 - **Finetuned from model :** unsloth/llama-3.2-3b-instruct-bnb-4bit
-This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
 [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)

 # Uploaded  model
+Further trained Llama 3.2 3B with enhanced reasoning, aiming for deepseek r1 quality.
+Trained with Nvidia Tesla t4
+⚠️ This is the first model we've trained as a test. Phi 3.5 soon.
 - **Developed by:** PinkStack
 - **License:** apache-2.0
 - **Finetuned from model :** unsloth/llama-3.2-3b-instruct-bnb-4bit
+This llama model was trained using [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
 [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)