Uploaded model
- Developed by: arman1o1
- License: apache-2.0
- Finetuned from model : unsloth/Meta-Llama-3.1-8B-Instruct
This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.
- Downloads last month
- 2
Model tree for arman1o1/Llama3_1_8B_GRPO
Base model
meta-llama/Llama-3.1-8B Finetuned
meta-llama/Llama-3.1-8B-Instruct Finetuned
unsloth/Meta-Llama-3.1-8B-Instruct