NeuronSpark V3 โ€” 1.1B SFT NoThink (step 2000)

้ž think SFT (V2.5 ๆ•ฐๆฎ้›†, 292,525 samples, 0 think tags). bs=12 ร— max_len=1024 ร— 8 GPU eff_batch=768. Base = pretrain step 108000.

Downloads last month
168
Safetensors
Model size
1B params
Tensor type
F32
ยท
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support