Update README.md
Browse files
README.md
CHANGED
|
@@ -20,7 +20,7 @@ A Colab notebook is available in our [GitHub repository](https://github.com/MLI-
|
|
| 20 |
## Training Details
|
| 21 |
|
| 22 |
- Models are pretrained on synthetic data generated by sampling ground-truth sequences of length L uniformly at random over the quaternary alphabet, and independently introducing insertions, deletions, and substitutions at each position.
|
| 23 |
-
- Error probabilities for insertions, deletions, and substitutions are drawn uniformly from the interval [0.01, 0.1], and cluster sizes are sampled uniformly from
|
| 24 |
- Models are fine-tuned on real-world sequencing data (Noisy-DNA and Microsoft datasets).
|
| 25 |
|
| 26 |
For full experimental details, see [our paper](https://arxiv.org/abs/XXXX.XXXXX).
|
|
|
|
| 20 |
## Training Details
|
| 21 |
|
| 22 |
- Models are pretrained on synthetic data generated by sampling ground-truth sequences of length L uniformly at random over the quaternary alphabet, and independently introducing insertions, deletions, and substitutions at each position.
|
| 23 |
+
- Error probabilities for insertions, deletions, and substitutions are drawn uniformly from the interval [0.01, 0.1], and cluster sizes are sampled uniformly from [2, 10].
|
| 24 |
- Models are fine-tuned on real-world sequencing data (Noisy-DNA and Microsoft datasets).
|
| 25 |
|
| 26 |
For full experimental details, see [our paper](https://arxiv.org/abs/XXXX.XXXXX).
|