Update README.md
Browse files
README.md
CHANGED
|
@@ -107,7 +107,8 @@ to speed up the training: max_seq_length = 200
|
|
| 107 |
#### Training Hyperparameters
|
| 108 |
|
| 109 |
- **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
|
| 110 |
-
|
|
|
|
| 111 |
trainer = SFTTrainer(
|
| 112 |
model = model,
|
| 113 |
train_dataset=train_data,
|
|
@@ -148,7 +149,7 @@ trainer = SFTTrainer(
|
|
| 148 |
}
|
| 149 |
),
|
| 150 |
)
|
| 151 |
-
|
| 152 |
|
| 153 |
#### Speeds, Sizes, Times [optional]
|
| 154 |
|
|
|
|
| 107 |
#### Training Hyperparameters
|
| 108 |
|
| 109 |
- **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
|
| 110 |
+
|
| 111 |
+
```python
|
| 112 |
trainer = SFTTrainer(
|
| 113 |
model = model,
|
| 114 |
train_dataset=train_data,
|
|
|
|
| 149 |
}
|
| 150 |
),
|
| 151 |
)
|
| 152 |
+
```
|
| 153 |
|
| 154 |
#### Speeds, Sizes, Times [optional]
|
| 155 |
|