walidsobhie-code commited on
Commit
33e3770
·
1 Parent(s): 058de92

fix: use correct input_path key for train_lora and ensure data path is correct

Browse files

- Changed config 'train_file' to 'input_path' (matches train_lora.py)
- DATA_FILE now correctly points to actual data (data_mini/train_mini.jsonl)
- Removed unused train_dir/eval_dir from config
- Should finally resolve FileNotFoundError

Files changed (1) hide show
  1. kaggle_train_stack29.ipynb +2 -2
kaggle_train_stack29.ipynb CHANGED
@@ -136,7 +136,7 @@
136
  " 'torch_dtype': 'float16'\n",
137
  " },\n",
138
  " 'data': {\n",
139
- " 'train_file': DATA_FILE, # USE THE ACTUAL DATA FILE PATH\n",
140
  " 'max_length': 2048,\n",
141
  " 'train_split': 1.0 # Use all data for training\n",
142
  " },\n",
@@ -185,7 +185,7 @@
185
  "print(f\"✅ Config saved to: {config_path}\")\n",
186
  "print(\"\\nConfig summary:\")\n",
187
  "print(f\" Model: {config['model']['name']}\")\n",
188
- "print(f\" Data: {config['data']['train_file']}\")\n",
189
  "print(f\" LoRA rank: {config['lora']['r']}\")\n",
190
  "print(f\" Batch size: {config['training']['batch_size']}\")\n",
191
  "print(f\" Epochs: {config['training']['num_epochs']}\")"
 
136
  " 'torch_dtype': 'float16'\n",
137
  " },\n",
138
  " 'data': {\n",
139
+ " 'input_path': DATA_FILE, # Correct key for train_lora.py\n",
140
  " 'max_length': 2048,\n",
141
  " 'train_split': 1.0 # Use all data for training\n",
142
  " },\n",
 
185
  "print(f\"✅ Config saved to: {config_path}\")\n",
186
  "print(\"\\nConfig summary:\")\n",
187
  "print(f\" Model: {config['model']['name']}\")\n",
188
+ "print(f\" Data: {config['data']['input_path']}\")\n",
189
  "print(f\" LoRA rank: {config['lora']['r']}\")\n",
190
  "print(f\" Batch size: {config['training']['batch_size']}\")\n",
191
  "print(f\" Epochs: {config['training']['num_epochs']}\")"