u-10bei/sft_alfworld_trajectory_dataset_v5
Viewer • Updated • 2.5k • 884
How to use Chattso-GPT/adv-sft-v10 with PEFT:
Task type is invalid.
LoRA adapter fine-tuned from Qwen/Qwen3-4B-Instruct-2507 using LoRA + Unsloth. This repository contains LoRA adapter weights only.
Dataset License: MIT License.
Base model
Qwen/Qwen3-4B-Instruct-2507