Uploaded model

This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.

GGUF

Model size

2B params

Architecture

llama

Hardware compatibility

8-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for chriswhpang/SmolLM2-1.7B-Instruct-GRPO-GGUF

Base model

Quantized

Finetuned

Quantized

(21)

this model