Update README.md
Browse files
README.md
CHANGED
|
@@ -13,9 +13,9 @@ language:
|
|
| 13 |
- ko
|
| 14 |
---
|
| 15 |
|
| 16 |
-
# colbert-ko-v1
|
| 17 |
|
| 18 |
-
**colbert-ko-v1
|
| 19 |
|
| 20 |
## Model Details
|
| 21 |
|
|
@@ -57,7 +57,7 @@ We omit MIRACLRetrieval and MrTidyRetrieval in evalution due to our device condi
|
|
| 57 |
|
| 58 |
| Model | Parameters | Average Recall@10 | Average Precision@10 | Average NDCG@10 | Average F1@10 |
|
| 59 |
|-----------------------------------------------|------------|----------------|-------------------|--------------|------------|
|
| 60 |
-
| **colbert-ko-v1
|
| 61 |
| [jina-colbert-v2](https://huggingface.co/jinaai/jina-colbert-v2) | 0.5B | 0.7518 | 0.0888 | 0.6671 | 0.1577 |
|
| 62 |
|
| 63 |
## Usage
|
|
@@ -84,7 +84,7 @@ documents_ids = [
|
|
| 84 |
]
|
| 85 |
|
| 86 |
model = models.ColBERT(
|
| 87 |
-
model_name_or_path="yjoonjang/colbert-ko-v1
|
| 88 |
)
|
| 89 |
|
| 90 |
queries_embeddings = model.encode(
|
|
@@ -141,7 +141,7 @@ from pylate import indexes, models, retrieve
|
|
| 141 |
|
| 142 |
# Step 1: Load the ColBERT model
|
| 143 |
model = models.ColBERT(
|
| 144 |
-
model_name_or_path="yjoonjang/colbert-ko-v1
|
| 145 |
)
|
| 146 |
|
| 147 |
# Step 2: Initialize the PLAID index
|
|
|
|
| 13 |
- ko
|
| 14 |
---
|
| 15 |
|
| 16 |
+
# colbert-ko-v1
|
| 17 |
|
| 18 |
+
**colbert-ko-v1** is a Korean colbert model finetuned with [PyLate](https://github.com/lightonai/pylate). This model is trained exclusively on Korean dataset. It maps sentences & paragraphs to sequences of 128-dimensional dense vectors and can be used for semantic textual similarity using the MaxSim operator.
|
| 19 |
|
| 20 |
## Model Details
|
| 21 |
|
|
|
|
| 57 |
|
| 58 |
| Model | Parameters | Average Recall@10 | Average Precision@10 | Average NDCG@10 | Average F1@10 |
|
| 59 |
|-----------------------------------------------|------------|----------------|-------------------|--------------|------------|
|
| 60 |
+
| **colbert-ko-v1** | **0.1B** | **0.7999** | **0.0930** | **0.7172** | **0.1655**|
|
| 61 |
| [jina-colbert-v2](https://huggingface.co/jinaai/jina-colbert-v2) | 0.5B | 0.7518 | 0.0888 | 0.6671 | 0.1577 |
|
| 62 |
|
| 63 |
## Usage
|
|
|
|
| 84 |
]
|
| 85 |
|
| 86 |
model = models.ColBERT(
|
| 87 |
+
model_name_or_path="yjoonjang/colbert-ko-v1",
|
| 88 |
)
|
| 89 |
|
| 90 |
queries_embeddings = model.encode(
|
|
|
|
| 141 |
|
| 142 |
# Step 1: Load the ColBERT model
|
| 143 |
model = models.ColBERT(
|
| 144 |
+
model_name_or_path="yjoonjang/colbert-ko-v1",
|
| 145 |
)
|
| 146 |
|
| 147 |
# Step 2: Initialize the PLAID index
|