Update README.md
Browse files
README.md
CHANGED
|
@@ -50,7 +50,7 @@ model = AutoModel.from_pretrained("nlpaueb/sec-bert-base")
|
|
| 50 |
## Pre-process Text
|
| 51 |
|
| 52 |
In order to use SEC-BERT-NUM, you have to pre-process texts replacing every numerical token with a corresponding shape pseudo token from a list of 214 predefined shape pseudo tokens. If the numerical token does not correspond to any shape pseudo token we replace it with the [NUM] pseudo-token.
|
| 53 |
-
Below there is an example how you can pre-process a simple sentence. This approach is quite simple, feel free to modify it as you see fit.
|
| 54 |
|
| 55 |
```python
|
| 56 |
import re
|
|
|
|
| 50 |
## Pre-process Text
|
| 51 |
|
| 52 |
In order to use SEC-BERT-NUM, you have to pre-process texts replacing every numerical token with a corresponding shape pseudo token from a list of 214 predefined shape pseudo tokens. If the numerical token does not correspond to any shape pseudo token we replace it with the [NUM] pseudo-token.
|
| 53 |
+
Below there is an example of how you can pre-process a simple sentence. This approach is quite simple, feel free to modify it as you see fit.
|
| 54 |
|
| 55 |
```python
|
| 56 |
import re
|