Qwen2.5-7B-Agent-Mixed-Trajectory-LoRA

This repository provides a merged model fine-tuned from unsloth/Qwen2.5-7B-Instruct using LoRA + Unsloth.

Dataset Construction

Training data was built by mixing and preprocessing two trajectory datasets:

ALFWorld (u-10bei/sft_alfworld_trajectory_dataset_v5): 2,327 samples after cleaning
DBBench (u-10bei/dbbench_sft_dataset_react_v4): 1,200 samples after cleaning

Category-level upsampling was applied to reinforce weak task types:

Final dataset size: 5,169 samples

Dataset License: MIT License. Users must comply with the MIT license and the base model's original terms of use.

Safetensors

Model size

8B params

Tensor type

BF16

Base model

Finetuned

Finetuned

Adapter

(624)

this model