Update README.md
Browse files
README.md
CHANGED
|
@@ -134,7 +134,7 @@ The following training hyperparameters are used in feature alignment and task sp
|
|
| 134 |
|
| 135 |
| Data size | Global Batch Size | Learning Rate | Epochs | Max Length | Weight Decay |
|
| 136 |
|--------------|-------------------|---------------|--------|------------|--------------|
|
| 137 |
-
|
|
| 138 |
|
| 139 |
We use full-parameter finetuning for the projector and apply LoRA to the language model.
|
| 140 |
|
|
|
|
| 134 |
|
| 135 |
| Data size | Global Batch Size | Learning Rate | Epochs | Max Length | Weight Decay |
|
| 136 |
|--------------|-------------------|---------------|--------|------------|--------------|
|
| 137 |
+
| 1M | 16 | 5e-5 | 1 | 2048 | 1e-5 |
|
| 138 |
|
| 139 |
We use full-parameter finetuning for the projector and apply LoRA to the language model.
|
| 140 |
|