Solo

Model Details

Base Model Qwen/Qwen3-0.6B
Method LoRA (PEFT)
Parameters 0.6B

Training Hyperparameters

Epochs 2
Max Steps 100
Batch Size 2
Gradient Accumulation 4
Learning Rate 0.0002
LoRA r 4
LoRA Alpha 4
Max Sequence Length 2048
Training Duration 4m 20s

Dataset

GetSoloTech/Code-Reasoning


Trained with Solo

Downloads last month
3
Safetensors
Model size
0.6B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for zeeshaan-ai/solo-tune-test68

Finetuned
Qwen/Qwen3-0.6B
Adapter
(359)
this model

Dataset used to train zeeshaan-ai/solo-tune-test68