Update README.md
Browse files
README.md
CHANGED
|
@@ -89,15 +89,9 @@ The model was trained on 80,000+ queries from 11 benchmarks:
|
|
| 89 |
|
| 90 |
| Domain | Datasets |
|
| 91 |
|--------|----------|
|
| 92 |
-
| Financial | FinReport, FinSlides, FinQA, ConvFinQA |
|
| 93 |
| Scientific | ArxivQA, SciQAG |
|
| 94 |
-
| General | Wiki-SS, MP-DocVQA, DUDE, VQAnBD,
|
| 95 |
-
|
| 96 |
-
### Training Procedure
|
| 97 |
-
|
| 98 |
-
1. **Oracle Label Generation**: Run all retrieval pipelines on training queries to collect nDCG@5 and latency metrics
|
| 99 |
-
2. **Reward Computation**: `r(q, i) = (1 - 位) 路 nDCG(q, i) + 位 路 (1 - NormalizedLatency(q, i))`
|
| 100 |
-
3. **Soft Label Training**: Train with weighted KL divergence loss using reward scores as soft labels
|
| 101 |
|
| 102 |
### Hyperparameters
|
| 103 |
|
|
@@ -112,14 +106,6 @@ The model was trained on 80,000+ queries from 11 benchmarks:
|
|
| 112 |
| Precision | bfloat16 |
|
| 113 |
| 位 (trade-off) | 0.0 (accuracy-focused) |
|
| 114 |
|
| 115 |
-
## Performance
|
| 116 |
-
|
| 117 |
-
### Latency
|
| 118 |
-
|
| 119 |
-
| Component | Time |
|
| 120 |
-
|-----------|------|
|
| 121 |
-
| Router Inference | ~15ms |
|
| 122 |
-
|
| 123 |
## Intended Use
|
| 124 |
|
| 125 |
IRouterLM is designed for:
|
|
@@ -132,7 +118,6 @@ IRouterLM is designed for:
|
|
| 132 |
|
| 133 |
- Trained on English queries only
|
| 134 |
- Optimized for document retrieval tasks (financial, scientific, general domains)
|
| 135 |
-
- Requires the corresponding retrieval pipelines to be available
|
| 136 |
|
| 137 |
## License
|
| 138 |
|
|
|
|
| 89 |
|
| 90 |
| Domain | Datasets |
|
| 91 |
|--------|----------|
|
| 92 |
+
| Financial | FinReport, FinSlides, FinQA, ConvFinQA, TAT-DQA |
|
| 93 |
| Scientific | ArxivQA, SciQAG |
|
| 94 |
+
| General | Wiki-SS, MP-DocVQA, DUDE, VQAnBD, |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 95 |
|
| 96 |
### Hyperparameters
|
| 97 |
|
|
|
|
| 106 |
| Precision | bfloat16 |
|
| 107 |
| 位 (trade-off) | 0.0 (accuracy-focused) |
|
| 108 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 109 |
## Intended Use
|
| 110 |
|
| 111 |
IRouterLM is designed for:
|
|
|
|
| 118 |
|
| 119 |
- Trained on English queries only
|
| 120 |
- Optimized for document retrieval tasks (financial, scientific, general domains)
|
|
|
|
| 121 |
|
| 122 |
## License
|
| 123 |
|