pantelis-ninja/unsloth-Qwen2.5-3B-Instruct-10_r-8_lr-0.0005_ms-50_gas-2_batch-size-8 Text Generation • 3B • Updated Nov 30, 2024 • 1
pantelis-ninja/unsloth-Qwen2.5-3B-Instruct-9_r-8_lr-0.0005_ms-50_gas-1_batch-size-8 Text Generation • 3B • Updated Nov 30, 2024 • 1
pantelis-ninja/unsloth-Qwen2.5-3B-Instruct-8_r-8_lr-0.0005_ms-25_gas-4_batch-size-16 Text Generation • 3B • Updated Nov 30, 2024 • 1
pantelis-ninja/unsloth-Qwen2.5-3B-Instruct-7_r-8_lr-0.0005_ms-25_gas-3_batch-size-16 Text Generation • 3B • Updated Nov 30, 2024 • 1
pantelis-ninja/unsloth-Qwen2.5-3B-Instruct-6_r-8_lr-0.0005_ms-25_gas-2_batch-size-16 Text Generation • 3B • Updated Nov 30, 2024 • 1
pantelis-ninja/unsloth-Qwen2.5-3B-Instruct-5_r-8_lr-0.0005_ms-25_gas-1_batch-size-16 Text Generation • 3B • Updated Nov 30, 2024 • 1
pantelis-ninja/unsloth-Qwen2.5-3B-Instruct-4_r-8_lr-0.0005_ms-25_gas-4_batch-size-8 Text Generation • 3B • Updated Nov 30, 2024 • 1
pantelis-ninja/unsloth-Qwen2.5-3B-Instruct-3_r-8_lr-0.0005_ms-25_gas-3_batch-size-8 Text Generation • 3B • Updated Nov 30, 2024 • 1
pantelis-ninja/unsloth-Qwen2.5-3B-Instruct-2_r-8_lr-0.0005_ms-25_gas-2_batch-size-8 Text Generation • 3B • Updated Nov 30, 2024 • 1
pantelis-ninja/unsloth-Qwen2.5-3B-Instruct-1_r-8_lr-0.0005_ms-25_gas-1_batch-size-8 Text Generation • 3B • Updated Nov 30, 2024 • 1
pantelis-ninja/unsloth-Qwen2.5-3B-Instruct_dtype-bfloat16_r-8_lr-0.0005_ms-50_gas-1_max-steps-50 Text Generation • 3B • Updated Nov 30, 2024 • 1
pantelis-ninja/unsloth-Qwen2.5-3B-Instruct_dtype-bfloat16_r-8_lr-0.0005_ms-50_gas-4_max-steps-50 Updated Nov 30, 2024
pantelis-ninja/unsloth-Qwen2.5-3B-Instruct_dtype-bfloat16_r-8_lr-0.0005_ms-50_gas-3_max-steps-50 Updated Nov 30, 2024
pantelis-ninja/unsloth-Qwen2.5-3B-Instruct_dtype-bfloat16_r-8_lr-0.0005_ms-50_gas-2_max-steps-50 Updated Nov 30, 2024
pantelis-ninja/unsloth-Qwen2.5-3B-Instruct_dtype-bfloat16_r-8_lr-0.0005_ms-25_gas-4_max-steps-25 Text Generation • 3B • Updated Nov 30, 2024 • 1
pantelis-ninja/unsloth-Qwen2.5-3B-Instruct_dtype-bfloat16_r-8_lr-0.0005_ms-25_gas-3_max-steps-25 Text Generation • 3B • Updated Nov 30, 2024 • 1
pantelis-ninja/unsloth-Qwen2.5-3B-Instruct_dtype-bfloat16_r-8_lr-0.0005_ms-25_gas-2_max-steps-25 Text Generation • 3B • Updated Nov 30, 2024 • 1
pantelis-ninja/unsloth-Qwen2.5-3B-Instruct_dtype-bfloat16_r-8_lr-0.0005_ms-25_gas-1_max-steps-25 Text Generation • 3B • Updated Nov 30, 2024 • 1
pantelis-ninja/unsloth-Qwen2.5-3B-Instruct_gas-1_dtype-bfloat16_r-8_lr-0.0005_ms-100_gas-2_max-steps-100 Text Generation • 3B • Updated Nov 30, 2024 • 1
pantelis-ninja/unsloth-Qwen2.5-3B-Instruct_gas-1_dtype-bfloat16_r-8_lr-0.0005_ms-100_gas-1_max-steps-100 Text Generation • 3B • Updated Nov 30, 2024 • 1
pantelis-ninja/unsloth-Qwen2.5-3B-Instruct_gas-1_dtype-bfloat16_r-8_lr-0.0005_ms-75_gas-4_max-steps-75 Text Generation • 3B • Updated Nov 30, 2024 • 1
pantelis-ninja/unsloth-Qwen2.5-3B-Instruct_gas-1_dtype-bfloat16_r-8_lr-0.0005_ms-75_gas-3_max-steps-75 Text Generation • 3B • Updated Nov 30, 2024 • 3
pantelis-ninja/unsloth-Qwen2.5-3B-Instruct_gas-1_dtype-bfloat16_r-8_lr-0.0005_ms-75_gas-2_max-steps-75 Text Generation • 3B • Updated Nov 30, 2024 • 1
pantelis-ninja/unsloth-Qwen2.5-3B-Instruct_gas-1_dtype-bfloat16_r-8_lr-0.0005_ms-75_gas-1_max-steps-75 Text Generation • 3B • Updated Nov 30, 2024 • 1
pantelis-ninja/unsloth-Qwen2.5-3B-Instruct_gas-1_dtype-bfloat16_r-8_lr-0.0005_ms-50_gas-4_max-steps-50 Text Generation • 3B • Updated Nov 30, 2024 • 1
pantelis-ninja/unsloth-Qwen2.5-3B-Instruct_gas-1_dtype-bfloat16_r-8_lr-0.0005_ms-50_gas-3_max-steps-50 Text Generation • 3B • Updated Nov 30, 2024 • 1
pantelis-ninja/unsloth-Qwen2.5-3B-Instruct_gas-1_dtype-bfloat16_r-8_lr-0.0005_ms-50_gas-2_max-steps-50 Text Generation • 3B • Updated Nov 30, 2024 • 1
pantelis-ninja/unsloth-Qwen2.5-3B-Instruct_gas-1_dtype-bfloat16_r-8_lr-0.0005_ms-50_gas-1_max-steps-50 Text Generation • 3B • Updated Nov 30, 2024
pantelis-ninja/unsloth-Qwen2.5-3B-Instruct_gas-1_dtype-bfloat16_r-8_lr-0.0005_ms-25_gas-4_max-steps-25 Text Generation • 3B • Updated Nov 30, 2024 • 1
pantelis-ninja/unsloth-Qwen2.5-3B-Instruct_gas-1_dtype-bfloat16_r-8_lr-0.0005_ms-25_gas-3_max-steps-25 Text Generation • 3B • Updated Nov 30, 2024 • 1