HuggingFaceTB
/

SmolVLM-Instruct-DPO

Image-Text-to-Text

Model card Files Files and versions

kashif HF Staff commited on Nov 26, 2024

Commit

250d5be

·

verified ·

1 Parent(s): af975d2

Update README.md

Files changed (1) hide show

README.md +12 -1

README.md CHANGED Viewed

@@ -79,7 +79,18 @@ Use the code below to get started with the model.
 ### Training Procedure
 ```bash
- accelerate launch  --config_file examples/accelerate_configs/multi_gpu.yaml  examples/scripts/dpo_vlm.py    --dataset_name HuggingFaceH4/rlaif-v_formatted     --model_name_or_path HuggingFaceTB/SmolVLM-Instruct     --per_device_train_batch_size 8   --gradient_accumulation_steps 32     --dataset_num_proc 32     --output_dir dpo_smolvlm_rlaif-v     --bf16     --torch_dtype bfloat16      --use_peft    --lora_target_modules=all-linear exit
 ```
 ### Framework versions

 ### Training Procedure
 ```bash
+accelerate launch  --config_file examples/accelerate_configs/multi_gpu.yaml \
+  examples/scripts/dpo_vlm.py \
+  --dataset_name HuggingFaceH4/rlaif-v_formatted \
+  --model_name_or_path HuggingFaceTB/SmolVLM-Instruct \
+  --per_device_train_batch_size 8 \
+  --gradient_accumulation_steps 32 \
+  --dataset_num_proc 32 \
+  --output_dir dpo_smolvlm_rlaif-v \
+  --bf16 \
+  --torch_dtype bfloat16 \
+  --use_peft \
+  --lora_target_modules=all-linear exit
 ```
 ### Framework versions