Update README.md
Browse files
README.md
CHANGED
|
@@ -202,10 +202,6 @@ Each argument was constructed by selecting two random propositions (from a list
|
|
| 202 |
|
| 203 |
<!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
|
| 204 |
|
| 205 |
-
#### Preprocessing
|
| 206 |
-
|
| 207 |
-
[More Information Needed]
|
| 208 |
-
We converted the data to the Alpaca chat format before feeding it to the model.
|
| 209 |
|
| 210 |
#### Training
|
| 211 |
|
|
@@ -213,7 +209,7 @@ We used unsloth for memory reduced sped up training.
|
|
| 213 |
|
| 214 |
We trained for one epoch.
|
| 215 |
|
| 216 |
-
Less than
|
| 217 |
|
| 218 |
## Evaluation
|
| 219 |
|
|
|
|
| 202 |
|
| 203 |
<!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
|
| 204 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 205 |
|
| 206 |
#### Training
|
| 207 |
|
|
|
|
| 209 |
|
| 210 |
We trained for one epoch.
|
| 211 |
|
| 212 |
+
Less than 3.5 GB of VRAM were used for training, and it took 3 hours.
|
| 213 |
|
| 214 |
## Evaluation
|
| 215 |
|