Uploaded model
- Developed by: chriswhpang
- License: apache-2.0
- Finetuned from model : unsloth/SmolLM2-1.7B-Instruct
This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.
- Downloads last month
- 3
Hardware compatibility
Log In to add your hardware
8-bit
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐ Ask for provider support
Model tree for chriswhpang/SmolLM2-1.7B-Instruct-GRPO-GGUF
Base model
HuggingFaceTB/SmolLM2-1.7B Quantized
HuggingFaceTB/SmolLM2-1.7B-Instruct Finetuned
unsloth/SmolLM2-1.7B-Instruct