Update README.md
Browse files
README.md
CHANGED
|
@@ -98,6 +98,8 @@ print(generated_texts[0])
|
|
| 98 |
|
| 99 |
### Training Procedure
|
| 100 |
|
|
|
|
|
|
|
| 101 |
```bash
|
| 102 |
accelerate launch --config_file examples/accelerate_configs/multi_gpu.yaml \
|
| 103 |
examples/scripts/dpo_vlm.py \
|
|
@@ -110,7 +112,7 @@ accelerate launch --config_file examples/accelerate_configs/multi_gpu.yaml \
|
|
| 110 |
--bf16 \
|
| 111 |
--torch_dtype bfloat16 \
|
| 112 |
--use_peft \
|
| 113 |
-
--lora_target_modules=all-linear
|
| 114 |
```
|
| 115 |
|
| 116 |
### Framework versions
|
|
|
|
| 98 |
|
| 99 |
### Training Procedure
|
| 100 |
|
| 101 |
+
See detailed blog on preference tuning VLLMs [here](https://huggingface.co/blog/dpo_vlm).
|
| 102 |
+
|
| 103 |
```bash
|
| 104 |
accelerate launch --config_file examples/accelerate_configs/multi_gpu.yaml \
|
| 105 |
examples/scripts/dpo_vlm.py \
|
|
|
|
| 112 |
--bf16 \
|
| 113 |
--torch_dtype bfloat16 \
|
| 114 |
--use_peft \
|
| 115 |
+
--lora_target_modules=all-linear
|
| 116 |
```
|
| 117 |
|
| 118 |
### Framework versions
|