shopifyinterngrinder/sidekick-autocomplete-06b-clm-shopping

Fine-tuned from Qwen/Qwen3-0.6B using TRL SFT.

Training Details

Parameter Value
Base Model Qwen/Qwen3-0.6B
Dataset shopifyinterngrinder/sidekick-autocomplete-data-shopping @ main
Training Examples 69,780
Validation Examples 7,754
Epochs 3
Learning Rate 2e-05
Batch Size (per device) 1
Gradient Accumulation 4
Max Sequence Length 512
Precision bf16
Optimizer adamw_torch_fused
Warmup Steps 100
Weight Decay 0.01
LR Scheduler cosine
Packing Enabled
Dataset Format prompt_completion

Framework Versions

Library Version
Transformers 4.57.6
TRL 0.29.0
PyTorch 2.8.0+cu128
Datasets 3.6.0
Accelerate 1.13.0
Downloads last month
3
Safetensors
Model size
0.8B params
Tensor type
BF16
·
Inference Providers NEW
Input a message to start chatting with shopifyinterngrinder/sidekick-autocomplete-06b-clm-shopping.

Model tree for shopifyinterngrinder/sidekick-autocomplete-06b-clm-shopping

Finetuned
Qwen/Qwen3-0.6B
Finetuned
(954)
this model

Dataset used to train shopifyinterngrinder/sidekick-autocomplete-06b-clm-shopping