Update README.md
Browse files
README.md
CHANGED
|
@@ -77,7 +77,7 @@ r = pipe(
|
|
| 77 |
|
| 78 |
### Training Data
|
| 79 |
|
| 80 |
-
The model has been on a proprietary dataset of ~1.35M examples consisting of
|
| 81 |
* High quality swedish instruct data
|
| 82 |
* Single turn
|
| 83 |
* Multi-turn
|
|
@@ -88,14 +88,14 @@ The model has been on a proprietary dataset of ~1.35M examples consisting of
|
|
| 88 |
For training we used hugginface Accelerate and TRL.
|
| 89 |
|
| 90 |
|
| 91 |
-
#### Preprocessing
|
| 92 |
|
| 93 |
For efficiency, we packed all the examples into 8K context windows, reducing the number examples to ~12% of their original count.
|
| 94 |
|
| 95 |
|
| 96 |
#### Training Hyperparameters
|
| 97 |
|
| 98 |
-
- **Training regime:**
|
| 99 |
|
| 100 |
|
| 101 |
[More Information Needed]
|
|
@@ -104,8 +104,10 @@ For efficiency, we packed all the examples into 8K context windows, reducing the
|
|
| 104 |
|
| 105 |
The model has been evaluated on [Scandeval](https://scandeval.com/swedish-nlg/) swedish subset.
|
| 106 |
|
|
|
|
| 107 |

|
| 108 |
|
|
|
|
| 109 |

|
| 110 |
|
| 111 |
|
|
|
|
| 77 |
|
| 78 |
### Training Data
|
| 79 |
|
| 80 |
+
The model has been trained on a proprietary dataset of ~1.35M examples consisting of
|
| 81 |
* High quality swedish instruct data
|
| 82 |
* Single turn
|
| 83 |
* Multi-turn
|
|
|
|
| 88 |
For training we used hugginface Accelerate and TRL.
|
| 89 |
|
| 90 |
|
| 91 |
+
#### Preprocessing
|
| 92 |
|
| 93 |
For efficiency, we packed all the examples into 8K context windows, reducing the number examples to ~12% of their original count.
|
| 94 |
|
| 95 |
|
| 96 |
#### Training Hyperparameters
|
| 97 |
|
| 98 |
+
- **Training regime:**
|
| 99 |
|
| 100 |
|
| 101 |
[More Information Needed]
|
|
|
|
| 104 |
|
| 105 |
The model has been evaluated on [Scandeval](https://scandeval.com/swedish-nlg/) swedish subset.
|
| 106 |
|
| 107 |
+
The result of the individual metrics compared to other top scoring models
|
| 108 |

|
| 109 |
|
| 110 |
+
The mean score of all metrics compared to other models in the Swedish NLG category.
|
| 111 |

|
| 112 |
|
| 113 |
|