Update README.md
Browse files
README.md
CHANGED
|
@@ -16,6 +16,7 @@ tags:
|
|
| 16 |
OpenLID-v3 is a high-coverage, high-performance language identification model. It is an improved version of [OpenLID-v2](https://huggingface.co/laurievb/OpenLID-v2).
|
| 17 |
|
| 18 |
The original model and training data are described in [Burchell et al. (2023)](https://aclanthology.org/2023.acl-short.75/), [the OpenLID-v2 dataset repo](https://huggingface.co/datasets/laurievb/OpenLID-v2).
|
|
|
|
| 19 |
|
| 20 |
### How to use
|
| 21 |
|
|
@@ -93,7 +94,17 @@ Evaluation details are available in [the paper repository](https://github.com/hp
|
|
| 93 |
|
| 94 |
### BibTeX entry and citation info
|
| 95 |
|
| 96 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 97 |
|
| 98 |
```
|
| 99 |
@inproceedings{burchell-etal-2023-open,
|
|
@@ -117,6 +128,7 @@ Evaluation details are available in [the paper repository](https://github.com/hp
|
|
| 117 |
}
|
| 118 |
```
|
| 119 |
|
|
|
|
| 120 |
[](https://arxiv.org/abs/2305.13820)
|
| 121 |
|
| 122 |
This project has received funding from the European Union’s Horizon Europe research and innovation programme under grant agreement No 101070350 and from UK Research and Innovation (UKRI) under the UK government’s Horizon Europe funding guarantee [grant number 10052546].
|
|
|
|
| 16 |
OpenLID-v3 is a high-coverage, high-performance language identification model. It is an improved version of [OpenLID-v2](https://huggingface.co/laurievb/OpenLID-v2).
|
| 17 |
|
| 18 |
The original model and training data are described in [Burchell et al. (2023)](https://aclanthology.org/2023.acl-short.75/), [the OpenLID-v2 dataset repo](https://huggingface.co/datasets/laurievb/OpenLID-v2).
|
| 19 |
+
This model is described in the [OpenLID-v3 paper](https://arxiv.org/abs/2602.13139).
|
| 20 |
|
| 21 |
### How to use
|
| 22 |
|
|
|
|
| 94 |
|
| 95 |
### BibTeX entry and citation info
|
| 96 |
|
| 97 |
+
```
|
| 98 |
+
@misc{fedorova2026openlidv3improvingprecisionclosely,
|
| 99 |
+
title={OpenLID-v3: Improving the Precision of Closely Related Language Identification -- An Experience Report},
|
| 100 |
+
author={Mariia Fedorova and Nikolay Arefyev and Maja Buljan and Jindřich Helcl and Stephan Oepen and Egil Rønningstad and Yves Scherrer},
|
| 101 |
+
year={2026},
|
| 102 |
+
eprint={2602.13139},
|
| 103 |
+
archivePrefix={arXiv},
|
| 104 |
+
primaryClass={cs.CL},
|
| 105 |
+
url={https://arxiv.org/abs/2602.13139},
|
| 106 |
+
}
|
| 107 |
+
```
|
| 108 |
|
| 109 |
```
|
| 110 |
@inproceedings{burchell-etal-2023-open,
|
|
|
|
| 128 |
}
|
| 129 |
```
|
| 130 |
|
| 131 |
+
[](https://arxiv.org/abs/2602.13139)
|
| 132 |
[](https://arxiv.org/abs/2305.13820)
|
| 133 |
|
| 134 |
This project has received funding from the European Union’s Horizon Europe research and innovation programme under grant agreement No 101070350 and from UK Research and Innovation (UKRI) under the UK government’s Horizon Europe funding guarantee [grant number 10052546].
|