Commit ·
4567479
1
Parent(s): 585fa7a
Update README.md
Browse files
README.md
CHANGED
|
@@ -142,6 +142,24 @@ Outputs:
|
|
| 142 |
|
| 143 |
</details>
|
| 144 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 145 |
|
| 146 |
# Training Data
|
| 147 |
For all languages except Catalan, this model was trained with ~10M lines of text per language from StatMT's [News Crawl](https://data.statmt.org/news-crawl/).
|
|
|
|
| 142 |
|
| 143 |
</details>
|
| 144 |
|
| 145 |
+
If you prefer your output to not be broken into separate sentences, you can disable sentence boundary detection
|
| 146 |
+
in the API call:
|
| 147 |
+
|
| 148 |
+
```python
|
| 149 |
+
input_texts: List[str] = [
|
| 150 |
+
"hola amigo cómo estás es un día lluvioso hoy",
|
| 151 |
+
]
|
| 152 |
+
results: List[str] = m.infer(input_texts, apply_sbd=False)
|
| 153 |
+
print(results[0])
|
| 154 |
+
```
|
| 155 |
+
|
| 156 |
+
Instead of a `List[List[str]]` (a list of output sentences for each input), we get a `List[str]` (one output
|
| 157 |
+
sentence per input):
|
| 158 |
+
|
| 159 |
+
```text
|
| 160 |
+
Hola, amigo. ¿Cómo estás? Es un día lluvioso hoy.
|
| 161 |
+
```
|
| 162 |
+
|
| 163 |
|
| 164 |
# Training Data
|
| 165 |
For all languages except Catalan, this model was trained with ~10M lines of text per language from StatMT's [News Crawl](https://data.statmt.org/news-crawl/).
|