Shin-YAM
/

Agent_try09

@@ -33,6 +33,12 @@ Loss is applied to **all assistant turns** in the multi-turn trajectory,
 enabling the model to learn environment observation, action selection,
 tool use, and recovery from errors.
 ## Training Configuration
 - Base model: Qwen/Qwen3-4B-Instruct-2507

 enabling the model to learn environment observation, action selection,
 tool use, and recovery from errors.
+The training process ic consist of two steps.
+First, training for LoRA in order to be adapted to Database SQL,
+Then Secondary, that for ALF is performed separately.
+Finally, each LoRA adapter is merged into base model sequentially,
+LoRA for DB and then that for ALF.
 ## Training Configuration
 - Base model: Qwen/Qwen3-4B-Instruct-2507