ZhiyuanChen commited on
Commit
46863bd
·
verified ·
1 Parent(s): 4746fa4

Upload folder using huggingface_hub

Browse files
Files changed (7) hide show
  1. README.md +394 -0
  2. config.json +39 -0
  3. license-faq.md +299 -0
  4. license.md +661 -0
  5. model.safetensors +3 -0
  6. tokenizer_config.json +69 -0
  7. vocab.txt +4102 -0
README.md ADDED
@@ -0,0 +1,394 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ datasets:
3
+ - multimolecule/gencode-human
4
+ library_name: multimolecule
5
+ license: agpl-3.0
6
+ mask_token: <mask>
7
+ pipeline_tag: fill-mask
8
+ tags:
9
+ - Biology
10
+ - DNA
11
+ widget:
12
+ - example_title: prion protein (Kanno blood group)
13
+ mask_index: 21
14
+ mask_index_1based: 22
15
+ masked_char: A
16
+ output:
17
+ - label: GCTTCT
18
+ score: 0.999084
19
+ - label: GCCTCT
20
+ score: 0.00032
21
+ - label: GTTTCT
22
+ score: 0.000146
23
+ - label: GGTTCT
24
+ score: 0.000136
25
+ - label: ACTTCT
26
+ score: 5.8e-05
27
+ pipeline_tag: fill-mask
28
+ sequence_type: cDNA
29
+ task: fill-mask
30
+ text: ATGGCGAACCTTGGCTGCTGG<mask>GTTCTCTTTGTGGCCACATGGAGTGACCTGGGCCTCTGC
31
+ - example_title: interleukin 10
32
+ mask_index: 11
33
+ mask_index_1based: 12
34
+ masked_char: A
35
+ output:
36
+ - label: AGCTCT
37
+ score: 0.999974
38
+ - label: GGCTCT
39
+ score: 6.0e-06
40
+ - label: TGCTCT
41
+ score: 5.0e-06
42
+ - label: ACCTCT
43
+ score: 3.0e-06
44
+ - label: ATCTCT
45
+ score: 2.0e-06
46
+ pipeline_tag: fill-mask
47
+ sequence_type: cDNA
48
+ task: fill-mask
49
+ text: ATGCACAGCTC<mask>GCTCTGTTGCCTGGTCCTCCTGACTGGGGTGAGGGCC
50
+ - example_title: Zaire ebolavirus
51
+ mask_index: 11
52
+ mask_index_1based: 12
53
+ masked_char: A
54
+ output:
55
+ - label: CAAAAG
56
+ score: 0.959986
57
+ - label: CAGAAG
58
+ score: 0.033968
59
+ - label: CTGAAG
60
+ score: 0.002412
61
+ - label: CAAGAG
62
+ score: 0.000615
63
+ - label: CACAAG
64
+ score: 0.000562
65
+ pipeline_tag: fill-mask
66
+ sequence_type: cDNA
67
+ task: fill-mask
68
+ text: AATGTTCAAAC<mask>TGAAGCTCTGTTAGCTGATGGTCTTGCTAAAGCATTTCCTAGCAATATGATGGTAGTCACAGAGCGTGAGCAAAAAGAAAGCTTATTGCATCAAGCATCATGGCACCACACAAGTGATGATTTTGGTGAGCATGCCACAGTTAGAGGGAGTAGCTTTGTAACTGATTTAGAGAAATACAATCTTGCATTTAGATATGAGTTTACAGCACCTTTTATAGAATATTGTAACCGTTGCTATGGTGTTAAGAATGTTTTTAATTGGATGCATTATACAATCCCACAGTGTTAT
69
+ - example_title: SARS coronavirus
70
+ mask_index: 14
71
+ mask_index_1based: 15
72
+ masked_char: A
73
+ output:
74
+ - label: TTCTTA
75
+ score: 0.999995
76
+ - label: TTCTTT
77
+ score: 1.0e-06
78
+ - label: TTCTTG
79
+ score: 1.0e-06
80
+ - label: TTCTTC
81
+ score: 1.0e-06
82
+ - label: CTCTTA
83
+ score: 1.0e-06
84
+ pipeline_tag: fill-mask
85
+ sequence_type: cDNA
86
+ task: fill-mask
87
+ text: ATGTTTATTTTCTT<mask>TCTTACTCTCACTAGTGGTAGTGACCTTGACCGGTGCACCACTTTTGATGATGTTCAAGCTCCTAATTACACTCAACATACTTCATCTATGAGGGGGGTTTACTATCCTGATGAAATTTTTAGATCAGACACTCTTTATTTAACTCAGGATTTATTTCTTCCATTTTATTCTAATGTTACAGGGTTTCATACTATTAATCATACGTTTGACAACCCTGTCATACCTTTTAAGGATGGTATTTATTTTGCTGCCACAGAGAAATCAAATGTTGTCCGTGGTTGGGTTTTTGGTTCTACCATGAACAACAAGTCACAGTCGGTGATTATTATTAACAATTCTACTAATGTTGTTATACGAGCATGTAACTTTGAATTGTGTGACAACCCTTTCTTTGCTGTTTCTAAACCCATGGGTACACAGACACATACTATGATATTCGATAATGCATTTAAATGCACTTTCGAGTACATATCT
88
+ ---
89
+
90
+ # DNABERT
91
+
92
+ Pre-trained model on human genome using a masked language modeling (MLM) objective with k-mer tokenization.
93
+
94
+ ## Disclaimer
95
+
96
+ This is an UNOFFICIAL implementation of the [DNABERT: pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome](https://doi.org/10.1093/bioinformatics/btab083) by Yanrong Ji, Zhihan Zhou, et al.
97
+
98
+ The OFFICIAL repository of DNABERT is at [jerryji1993/DNABERT](https://github.com/jerryji1993/DNABERT).
99
+
100
+ > [!TIP]
101
+ > The MultiMolecule team has confirmed that the provided model and checkpoints are producing the same intermediate representations as the original implementation.
102
+
103
+ **The team releasing DNABERT did not write this model card for this model so this model card has been written by the MultiMolecule team.**
104
+
105
+ ## Model Details
106
+
107
+ DNABERT is a [bert](https://huggingface.co/google-bert/bert-base-uncased)-style model pre-trained on the human genome with k-mer tokenization in a self-supervised fashion. This means that the model was trained on the raw nucleotides of DNA sequences only, with an automatic process to generate inputs and labels from those texts. Please refer to the [Training Details](#training-details) section for more information on the training process.
108
+
109
+ ### Variants
110
+
111
+ - **[multimolecule/dnabert-6mer-3mer](https://huggingface.co/multimolecule/dnabert-6mer-3mer)**: The DNABERT model pre-trained on 3-mer data.
112
+ - **[multimolecule/dnabert-6mer-4mer](https://huggingface.co/multimolecule/dnabert-6mer-4mer)**: The DNABERT model pre-trained on 4-mer data.
113
+ - **[multimolecule/dnabert-6mer-5mer](https://huggingface.co/multimolecule/dnabert-6mer-5mer)**: The DNABERT model pre-trained on 5-mer data.
114
+ - **[multimolecule/dnabert-6mer-6mer](https://huggingface.co/multimolecule/dnabert-6mer-6mer)**: The DNABERT model pre-trained on 6-mer data.
115
+
116
+ ### Model Specification
117
+
118
+ <table>
119
+ <thead>
120
+ <tr>
121
+ <th>Variants</th>
122
+ <th>Num Layers</th>
123
+ <th>Hidden Size</th>
124
+ <th>Num Heads</th>
125
+ <th>Intermediate Size</th>
126
+ <th>Num Parameters (M)</th>
127
+ <th>FLOPs (G)</th>
128
+ <th>MACs (G)</th>
129
+ <th>Max Num Tokens</th>
130
+ </tr>
131
+ </thead>
132
+ <tbody>
133
+ <tr>
134
+ <td><b>dnabert-6mer</b></td>
135
+ <td rowspan="4">12</td>
136
+ <td rowspan="4">768</td>
137
+ <td rowspan="4">12</td>
138
+ <td rowspan="4">3072</td>
139
+ <td>89.19</td>
140
+ <td rowspan="4">96.86</td>
141
+ <td rowspan="4">48.43</td>
142
+ <td rowspan="4">512</td>
143
+ </tr>
144
+ <tr>
145
+ <td>dnabert-5mer</td>
146
+ <td>86.83</td>
147
+ </tr>
148
+ <tr>
149
+ <td>dnabert-4mer</td>
150
+ <td>86.24</td>
151
+ </tr>
152
+ <tr>
153
+ <td>dnabert-3mer</td>
154
+ <td>86.10</td>
155
+ </tr>
156
+ </tbody>
157
+ </table>
158
+
159
+ ### Links
160
+
161
+ - **Code**: [multimolecule.dnabert](https://github.com/DLS5-Omics/multimolecule/tree/master/multimolecule/models/dnabert)
162
+ - **Data**: [multimolecule/gencode-human](https://huggingface.co/datasets/multimolecule/gencode-human)
163
+ - **Paper**: [DNABERT: pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome](https://doi.org/10.1093/bioinformatics/btab083)
164
+ - **Developed by**: Yanrong Ji, Zhihan Zhou, Han Liu, Ramana V Davuluri
165
+ - **Model type**: [BERT](https://huggingface.co/google-bert/bert-base-uncased)
166
+ - **Original Repositories**: [jerryji1993/DNABERT](https://github.com/jerryji1993/DNABERT)
167
+
168
+ ## Usage
169
+
170
+ The model file depends on the [`multimolecule`](https://multimolecule.danling.org) library. You can install it using pip:
171
+
172
+ ```bash
173
+ pip install multimolecule
174
+ ```
175
+
176
+ ### Direct Use
177
+
178
+ #### Masked Language Modeling
179
+
180
+ > [!WARNING]
181
+ > Default transformers pipeline does not support K-mer tokenization.
182
+
183
+ You can use this model directly with a pipeline for masked language modeling:
184
+
185
+ ```python
186
+ import multimolecule # you must import multimolecule to register models
187
+ from transformers import pipeline
188
+
189
+ predictor = pipeline("fill-mask", model="multimolecule/dnabert-6mer")
190
+ output = predictor("ATCG<mask>TGCA")
191
+ ```
192
+
193
+ ### Downstream Use
194
+
195
+ #### Extract Features
196
+
197
+ Here is how to use this model to get the features of a given sequence in PyTorch:
198
+
199
+ ```python
200
+ from multimolecule import DnaBertModel
201
+ from transformers import AutoTokenizer
202
+
203
+
204
+ tokenizer = AutoTokenizer.from_pretrained("multimolecule/dnabert-6mer")
205
+ model = DnaBertModel.from_pretrained("multimolecule/dnabert-6mer")
206
+
207
+ text = "ATCGATCGATCGATCG"
208
+ input = tokenizer(text, return_tensors="pt")
209
+
210
+ output = model(**input)
211
+ ```
212
+
213
+ #### Sequence Classification / Regression
214
+
215
+ > [!NOTE]
216
+ > This model is not fine-tuned for any specific task. You will need to fine-tune the model on a downstream task to use it for sequence classification or regression.
217
+
218
+ Here is how to use this model as backbone to fine-tune for a sequence-level task in PyTorch:
219
+
220
+ ```python
221
+ import torch
222
+ from multimolecule import DnaBertForSequencePrediction
223
+ from transformers import AutoTokenizer
224
+
225
+
226
+ tokenizer = AutoTokenizer.from_pretrained("multimolecule/dnabert-6mer")
227
+ model = DnaBertForSequencePrediction.from_pretrained("multimolecule/dnabert-6mer")
228
+
229
+ text = "ATCGATCGATCGATCG"
230
+ input = tokenizer(text, return_tensors="pt")
231
+ label = torch.tensor([1])
232
+
233
+ output = model(**input, labels=label)
234
+ ```
235
+
236
+ #### Token Classification / Regression
237
+
238
+ > [!NOTE]
239
+ > This model is not fine-tuned for any specific task. You will need to fine-tune the model on a downstream task to use it for token classification or regression.
240
+
241
+ Here is how to use this model as backbone to fine-tune for a nucleotide-level task in PyTorch:
242
+
243
+ ```python
244
+ import torch
245
+ from multimolecule import DnaBertForTokenPrediction
246
+ from transformers import AutoTokenizer
247
+
248
+
249
+ tokenizer = AutoTokenizer.from_pretrained("multimolecule/dnabert-6mer")
250
+ model = DnaBertForTokenPrediction.from_pretrained("multimolecule/dnabert-6mer")
251
+
252
+ text = "ATCGATCGATCGATCG"
253
+ input = tokenizer(text, return_tensors="pt")
254
+ label = torch.randint(2, (len(text), ))
255
+
256
+ output = model(**input, labels=label)
257
+ ```
258
+
259
+ #### Contact Classification / Regression
260
+
261
+ > [!NOTE]
262
+ > This model is not fine-tuned for any specific task. You will need to fine-tune the model on a downstream task to use it for contact classification or regression.
263
+
264
+ Here is how to use this model as backbone to fine-tune for a contact-level task in PyTorch:
265
+
266
+ ```python
267
+ import torch
268
+ from multimolecule import DnaBertForContactPrediction
269
+ from transformers import AutoTokenizer
270
+
271
+
272
+ tokenizer = AutoTokenizer.from_pretrained("multimolecule/dnabert-6mer")
273
+ model = DnaBertForContactPrediction.from_pretrained("multimolecule/dnabert-6mer")
274
+
275
+ text = "ATCGATCGATCGATCG"
276
+ input = tokenizer(text, return_tensors="pt")
277
+ label = torch.randint(2, (len(text), len(text)))
278
+
279
+ output = model(**input, labels=label)
280
+ ```
281
+
282
+ ## Training Details
283
+
284
+ DNABERT used Masked Language Modeling (MLM) as the pre-training objective: taking a sequence, the model randomly masks 15% of the tokens in the input then runs the entire masked sentence through the model and has to predict the masked tokens. This is comparable to the Cloze task in language modeling.
285
+
286
+ ### Training Data
287
+
288
+ The DNABERT model was pre-trained on the human genome. The training data consists of DNA sequences from the human reference genome (GRCh38.p13), with all sequences containing only the four canonical nucleotides (A, T, C, G).
289
+
290
+ ### Training Procedure
291
+
292
+ #### Preprocessing
293
+
294
+ DNABERT used masked language modeling (MLM) as the pre-training objective. The masking procedure is similar to the one used in BERT:
295
+
296
+ - 15% of the tokens are masked. In the last 20,000 steps, the masking rate is increased to 20%.
297
+ - In 80% of the cases, the masked tokens are replaced by `<mask>`.
298
+ - In 10% of the cases, the masked tokens are replaced by a random token (different) from the one they replace.
299
+ - In the 10% remaining cases, the masked tokens are left as is.
300
+
301
+ Since DNABERT used k-mer tokenizer, it masks the entire k-mer instead of individual nucleotides to avoid information leakage.
302
+
303
+ For example, if the k-mer is 3, the sequence `"TAGCGTAT"` will be tokenized as `["TAG", "AGC", "GCG", "CGT", "GTA", "TAT"]`. If the nucleotide `"C"` is masked, the adjacent tokens will also be masked, resulting `["TAG", "<mask>", "<mask>", "<mask>", "GTA", "TAT"]`.
304
+
305
+ #### Pre-training
306
+
307
+ The model was trained on 8 NVIDIA RTX 2080Ti GPUs.
308
+
309
+ - Batch size: 2,000
310
+ - Steps: 120,000
311
+ - Learning rate: 4e-4
312
+ - Learning rate scheduler: Linear
313
+ - Learning rate warm-up: 10,000 steps
314
+
315
+ ## Citation
316
+
317
+ ```bibtex
318
+ @ARTICLE{Ji2021-cj,
319
+ title = "{DNABERT}: pre-trained Bidirectional Encoder Representations
320
+ from Transformers model for {DNA-language} in genome",
321
+ author = "Ji, Yanrong and Zhou, Zhihan and Liu, Han and Davuluri, Ramana V",
322
+ abstract = "MOTIVATION: Deciphering the language of non-coding DNA is one of
323
+ the fundamental problems in genome research. Gene regulatory
324
+ code is highly complex due to the existence of polysemy and
325
+ distant semantic relationship, which previous informatics
326
+ methods often fail to capture especially in data-scarce
327
+ scenarios. RESULTS: To address this challenge, we developed a
328
+ novel pre-trained bidirectional encoder representation, named
329
+ DNABERT, to capture global and transferrable understanding of
330
+ genomic DNA sequences based on up and downstream nucleotide
331
+ contexts. We compared DNABERT to the most widely used programs
332
+ for genome-wide regulatory elements prediction and demonstrate
333
+ its ease of use, accuracy and efficiency. We show that the
334
+ single pre-trained transformers model can simultaneously achieve
335
+ state-of-the-art performance on prediction of promoters, splice
336
+ sites and transcription factor binding sites, after easy
337
+ fine-tuning using small task-specific labeled data. Further,
338
+ DNABERT enables direct visualization of nucleotide-level
339
+ importance and semantic relationship within input sequences for
340
+ better interpretability and accurate identification of conserved
341
+ sequence motifs and functional genetic variant candidates.
342
+ Finally, we demonstrate that pre-trained DNABERT with human
343
+ genome can even be readily applied to other organisms with
344
+ exceptional performance. We anticipate that the pre-trained
345
+ DNABERT model can be fined tuned to many other sequence analyses
346
+ tasks. AVAILABILITY AND IMPLEMENTATION: The source code,
347
+ pretrained and finetuned model for DNABERT are available at
348
+ GitHub (https://github.com/jerryji1993/DNABERT). SUPPLEMENTARY
349
+ INFORMATION: Supplementary data are available at Bioinformatics
350
+ online.",
351
+ journal = "Bioinformatics",
352
+ publisher = "Oxford University Press (OUP)",
353
+ volume = 37,
354
+ number = 15,
355
+ pages = "2112--2120",
356
+ month = aug,
357
+ year = 2021,
358
+ copyright = "https://academic.oup.com/journals/pages/open\_access/funder\_policies/chorus/standard\_publication\_model",
359
+ language = "en"
360
+ }
361
+ ```
362
+
363
+ > [!NOTE]
364
+ > The artifacts distributed in this repository are part of the MultiMolecule project.
365
+ > If you use MultiMolecule in your research, you must cite the MultiMolecule project as follows:
366
+
367
+ ```bibtex
368
+ @software{chen_2024_12638419,
369
+ author = {Chen, Zhiyuan and Zhu, Sophia Y.},
370
+ title = {MultiMolecule},
371
+ doi = {10.5281/zenodo.12638419},
372
+ publisher = {Zenodo},
373
+ url = {https://doi.org/10.5281/zenodo.12638419},
374
+ year = 2024,
375
+ month = may,
376
+ day = 4
377
+ }
378
+ ```
379
+
380
+ ## Contact
381
+
382
+ Please use GitHub issues of [MultiMolecule](https://github.com/DLS5-Omics/multimolecule/issues) for any questions or comments on the model card.
383
+
384
+ Please contact the authors of the [DNABERT paper](https://doi.org/10.1093/bioinformatics/btab083) for questions or comments on the paper/model.
385
+
386
+ ## License
387
+
388
+ This model is licensed under the [GNU Affero General Public License](license.md).
389
+
390
+ For additional terms and clarifications, please refer to our [License FAQ](license-faq.md).
391
+
392
+ ```spdx
393
+ SPDX-License-Identifier: AGPL-3.0-or-later
394
+ ```
config.json ADDED
@@ -0,0 +1,39 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "add_cross_attention": false,
3
+ "architectures": [
4
+ "DnaBertForMaskedLM"
5
+ ],
6
+ "attention_dropout": 0.1,
7
+ "bos_token_id": 0,
8
+ "dtype": "float32",
9
+ "eos_token_id": 2,
10
+ "head": null,
11
+ "hidden_act": "gelu",
12
+ "hidden_dropout": 0.1,
13
+ "hidden_size": 768,
14
+ "id2label": {
15
+ "0": "LABEL_0"
16
+ },
17
+ "initializer_range": 0.02,
18
+ "intermediate_size": 3072,
19
+ "is_decoder": false,
20
+ "label2id": {
21
+ "LABEL_0": 0
22
+ },
23
+ "layer_norm_eps": 1e-12,
24
+ "lm_head": null,
25
+ "mask_token_id": 4,
26
+ "max_position_embeddings": 512,
27
+ "model_type": "dnabert",
28
+ "null_token_id": 5,
29
+ "num_attention_heads": 12,
30
+ "num_hidden_layers": 12,
31
+ "pad_token_id": 0,
32
+ "position_embedding_type": "absolute",
33
+ "tie_word_embeddings": true,
34
+ "transformers_version": "5.2.0",
35
+ "type_vocab_size": 2,
36
+ "unk_token_id": 3,
37
+ "use_cache": true,
38
+ "vocab_size": 4102
39
+ }
license-faq.md ADDED
@@ -0,0 +1,299 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # License FAQ
2
+
3
+ This License FAQ (Frequently Asked Questions) clarifies the terms and conditions governing the use of the materials in the MultiMolecule project (the "MultiMolecule") provided by the DanLing Team (also known as DanLing) ("we," "us," or "our").
4
+ This FAQ serves as an addendum to, and is incorporated by reference into, the [GNU Affero General Public License (AGPL)](license.md) (the "License").
5
+ This FAQ and the License together constitute the entire agreement (the "Agreement") between you and us regarding your use of MultiMolecule.
6
+ Capitalized terms used but not defined in this FAQ have the meanings given to them in the AGPL.
7
+
8
+ ## 0. Summary of Key Points
9
+
10
+ This summary highlights the key aspects of our license.
11
+ For more detailed information, please refer to the corresponding sections below and read the [License](license.md).
12
+
13
+ <div class="grid cards" markdown>
14
+
15
+ !!! question "What are source code and object code in MultiMolecule?"
16
+
17
+ The Source Code includes all materials necessary to develop, train, evaluate, and run a model, including data, code, configuration, and documentation.
18
+ The Object Code includes model weight files and compiled code.
19
+
20
+ [:octicons-arrow-right-24: What are source code and object code in MultiMolecule?](#1-what-are-source-code-and-object-code-in-multimolecule)
21
+
22
+ !!! question "Am I required to share my trained model?"
23
+
24
+ If you Convey model weight files hosted and distributed by MultiMolecule, you must convey those weight files under the Agreement, along with the Corresponding Source.
25
+ If you Convey modified versions of such weight files (for example, fine-tuned weights), you must convey those modified weight files under the Agreement, along with the Corresponding Source.
26
+ These obligations apply regardless of whether you used MultiMolecule, a third-party library, or a customized training pipeline to produce the conveyed weights.
27
+
28
+ [:octicons-arrow-right-24: Am I required to share my trained model?](#2-am-i-required-to-share-my-trained-model)
29
+
30
+ !!! question "Am I required to share the data used for training?"
31
+
32
+ If you Convey any model weights covered by Section 2, you must also provide to recipients under the Agreement any training datasets used to train, update, or modify those weights, excluding data used solely for evaluation as clarified in Section 3.
33
+
34
+ [:octicons-arrow-right-24: Am I required to share the data used for training?](#3-am-i-required-to-share-the-data-used-for-training)
35
+
36
+ !!! question "Do I need to acknowledge MultiMolecule?"
37
+
38
+ We strongly encourage acknowledgement whenever MultiMolecule contributes to your work.
39
+ Citation is strongly requested for all research papers and becomes mandatory only as a condition of additional permissions granted in Section 5 or Section 8.
40
+ Reasonable author attribution in Appropriate Legal Notices is required in certain distributed interactive interfaces.
41
+
42
+ [:octicons-arrow-right-24: Do I need to acknowledge MultiMolecule?](#4-do-i-need-to-acknowledge-multimolecule)
43
+
44
+ !!! question "Can I publish research papers using MultiMolecule?"
45
+
46
+ If your manuscript or supplements include MultiMolecule materials (as described in Section 5), then the manuscript and supplements must be distributed under the License unless an additional permission applies.
47
+ Section 5 and Section 8 grant additional permissions to enable publication in certain venues under manuscript-sharing licenses, subject to Section 6.
48
+ If you cannot comply with the License and do not qualify for an additional permission, you may not distribute the manuscript or supplements with MultiMolecule materials.
49
+
50
+ [:octicons-arrow-right-24: Can I publish research papers using MultiMolecule?](#5-can-i-publish-research-papers-using-multimolecule)
51
+
52
+ !!! question "Is there any restriction on publishing research papers in certain venues?"
53
+
54
+ Yes, there are restrictions on publishing research papers in certain venues under the additional permissions granted by this FAQ.
55
+ Section 6 limits only additional permissions granted by this FAQ.
56
+ Section 6 does not restrict publication under the [License](license.md) itself.
57
+
58
+ [:octicons-arrow-right-24: Restrictions on Publishing Research Papers in Certain Venues](#6-restrictions-on-publishing-research-papers-in-certain-venues)
59
+
60
+ !!! question "Can I use MultiMolecule for commercial purposes?"
61
+
62
+ Yes, you can use MultiMolecule for commercial purposes under the terms of the Agreement.
63
+ If you prefer commercial use without the obligations that apply when you Convey covered materials, you must obtain a separate license.
64
+
65
+ [:octicons-arrow-right-24: Can I use MultiMolecule for commercial purposes?](#7-can-i-use-multimolecule-for-commercial-purposes)
66
+
67
+ !!! question "Are there special permissions for MultiMolecule Collaborators?"
68
+
69
+ Yes, recognized Collaborators are granted specific additional permissions pursuant to Section 7 of the License.
70
+ These permissions are subject to the stated conditions and to Section 6.
71
+
72
+ [:octicons-arrow-right-24: Are there special permissions for MultiMolecule Collaborators?](#8-are-there-special-permissions-for-multimolecule-collaborators)
73
+
74
+ </div>
75
+
76
+ ## 1. What are source code and object code in MultiMolecule?
77
+
78
+ For all materials in the MultiMolecule project, the following definitions clarify and supplement those found in the [License](license.md).
79
+
80
+ > [!TIP] Scope of materials hosted by MultiMolecule
81
+ > Unless explicitly stated otherwise in the relevant model card, dataset card, file header, directory notice, or accompanying LICENSE/NOTICE, all model weights, datasets, code, configuration, and documentation hosted and distributed by MultiMolecule are provided under the Agreement.
82
+ > If we host any specific item under different terms in the future, we will explicitly label that item, and the stated terms will control for that item.
83
+
84
+ > [!IMPORTANT] Source Code
85
+ > **Source Code** refers to the preferred form of the licensed materials for making modifications thereto, consistent with Section 1 of the License.
86
+ > It encompasses all materials necessary for developing, training, evaluating and running the models.
87
+
88
+ Source Code includes, but is not limited to:
89
+
90
+ - **Data**: The datasets, in the form needed for processing, that are required for training, evaluating, or running the models provided or generated as part of the licensed materials.
91
+ - **Code**: All source code for scripts, programs, libraries (including model architecture and pipeline definitions), and utilities required to process data, train models, perform evaluations, deploy the models, or otherwise operate and modify the licensed materials.
92
+ - **Configuration**: Configuration files, settings parameters, environmental specifications, and any scripts used to control the installation, compilation, training, evaluation, running, or execution processes related to the licensed materials.
93
+ - **Documentation**: Interface definition files, build instructions, manuals, guides, research papers and technical reports distributed by MultiMolecule as part of the licensed materials describing the specific methodologies, architectures and parameters used, and any other technical documentation necessary to understand, install, operate, and modify the licensed materials.
94
+
95
+ *Providing the Source Code as defined here is necessary to satisfy the requirement to provide the "Corresponding Source" under the Agreement (and where applicable, under the License) when conveying Object Code.*
96
+
97
+ > [!IMPORTANT] Object Code
98
+ > **Object Code** refers to any form of the licensed materials that is not Source Code.
99
+
100
+ Object Code primarily includes, but is not limited to:
101
+
102
+ - **Model Weights**: The numerical parameters representing the learned state of a model after training (e.g., files in SafeTensors, HDF5, or similar formats).
103
+ This includes all model weights provided or hosted by MultiMolecule except for those stated otherwise.
104
+ This also includes any fine-tuned model weights derived from model weights provided or hosted by MultiMolecule.
105
+ - **Compiled Code**: Any executable software code not in human-readable source form, like compiled C++ extensions sometimes found in Python packages.
106
+
107
+ *For model weights treated as Object Code in MultiMolecule, the Corresponding Source includes, at a minimum, the training data and the scripts needed to reproduce the conveyed weights.*
108
+
109
+ Understanding this distinction helps clarify your obligations under the Agreement.
110
+ For instance, if you Convey Object Code (like model weights), you must also ensure the corresponding Source Code (including the necessary data, code, configuration, and documentation) is available under the terms of the Agreement.
111
+
112
+ ## 2. Am I required to share my trained model?
113
+
114
+ If you Convey model weights covered by the Agreement, you must convey those weights under the Agreement, along with the Corresponding Source.
115
+
116
+ As explained in Section 1 of this FAQ, model weights are treated as Object Code in MultiMolecule.
117
+ Whenever you Convey Object Code under the Agreement, you must also provide the Corresponding Source.
118
+ For model weights covered by the Agreement, Corresponding Source includes, at a minimum, the code, training data, configuration, and scripts needed to reproduce, install, run, and modify the conveyed weights, as clarified in Section 1 of this FAQ.
119
+
120
+ If you modify MultiMolecule and you provide users remote interaction with your modified version through a computer network, you must comply with Section 13 of the License by offering the Corresponding Source of your modified version to those users.
121
+ This Section 13 obligation concerns remote interaction with the modified MultiMolecule Program itself.
122
+
123
+ Section 3 specifies the training-data requirement.
124
+
125
+ ## 3. Am I required to share the data used for training?
126
+
127
+ If you Convey model weights covered by the Agreement, you must also provide to recipients under the Agreement any training datasets used to train, update, or modify those weights, excluding data used solely for evaluation as clarified below.
128
+
129
+ As explained in Section 1 of this FAQ, training datasets are treated as Source Code in MultiMolecule.
130
+ Accordingly, whenever you are required to provide Corresponding Source under the Agreement, the required Corresponding Source includes the training data required to reproduce the conveyed weights, as clarified in Section 1 of this FAQ.
131
+
132
+ This requirement applies only to data used to train, update, or modify the conveyed model weights.
133
+ Data used solely for evaluation is not required under this provision, provided it was not also used for training.
134
+
135
+ If the required training data cannot be provided to recipients under the Agreement, you may not Convey the resulting weights under the Agreement.
136
+
137
+ ## 4. Do I need to acknowledge MultiMolecule?
138
+
139
+ We strongly encourage acknowledgement whenever MultiMolecule contributes to your work.
140
+ This section distinguishes
141
+
142
+ - (a) what we strongly request as a community norm, and
143
+ - (b) what becomes mandatory as a condition of additional permissions granted by this FAQ.
144
+
145
+ > [!NOTE]
146
+ > We strongly encourage formal citation in any research paper that uses MultiMolecule.
147
+ > If you publish a paper solely under the [License](license.md), citation is not a condition of license compliance, but it is strongly requested.
148
+ > If you rely on an additional permission in Section 5 or Section 8, formal citation is a condition of that additional permission.
149
+
150
+ When citation is required under this FAQ (e.g., as a condition of an additional permission in Section 5 or Section 8), it must include, at a minimum, the project name (“MultiMolecule”) and the DOI ([10.5281/zenodo.12638419](https://doi.org/10.5281/zenodo.12638419)).
151
+ If the venue does not support formal citations, the project name and DOI must instead appear in the acknowledgments section.
152
+
153
+ > [!IMPORTANT]
154
+ > If you Convey a program that incorporates MultiMolecule, you must preserve a reasonable author attribution for MultiMolecule in the Appropriate Legal Notices.
155
+
156
+ If the Program has an interactive user interface, it must display Appropriate Legal Notices.
157
+ Those notices must include a reasonable author attribution for MultiMolecule, including the project name and a link to the official repository or website.
158
+
159
+ For command-line programs, this attribution must be shown prominently at startup and be available via `--help`, `--version`, or `--about` where applicable.
160
+ For web services, this attribution must be shown prominently on the main page or another readily accessible location.
161
+
162
+ For libraries or non-interactive components, this attribution must be shown prominently in the documentation and, if the component provides any interactive interface, in that interface.
163
+
164
+ ## 5. Can I publish research papers using MultiMolecule?
165
+
166
+ > [!IMPORTANT]
167
+ > As clarified in Section 1, Documentation is part of Source Code in MultiMolecule.
168
+ > Accordingly, if your manuscript or supplements include MultiMolecule materials (i.e., contain or reproduce MultiMolecule code, weights, datasets, documentation text, figures, or other MultiMolecule materials), then the manuscript and supplements are treated as Documentation distributed with those MultiMolecule materials.
169
+ > This requirement concerns manuscripts and supplements that include MultiMolecule materials, not separate and independent works that are merely distributed alongside MultiMolecule.
170
+ > Therefore, absent an additional permission, the manuscript and supplements that include MultiMolecule materials must be distributed under the License to the extent the manuscript or supplements contain or reproduce MultiMolecule materials, as part of the same distribution of those MultiMolecule materials.
171
+ > If you cannot comply with the License, you may not distribute the manuscript or supplements with MultiMolecule materials under the Agreement.
172
+
173
+ This section grants additional permissions under Section 7 of the License for specific publication scenarios in which authors prefer to release manuscripts under manuscript-sharing licenses.
174
+ For avoidance of doubt, the additional permissions in this Section 5 apply only to the manuscript text and other documentary materials.
175
+ They do not alter the License that governs any MultiMolecule code, model weights, or datasets that you Convey, which remain under the Agreement unless explicitly stated otherwise.
176
+
177
+
178
+ > [!IMPORTANT]
179
+ > If you rely on any additional permission in this Section 5, formal citation as described in Section 4 is a condition of that additional permission.
180
+ > Any additional permission granted in this Section 5 remains subject to Section 6.
181
+
182
+ > [!TIP] Diamond Open Access
183
+ > Diamond open access venues are permitted under the additional permissions below.
184
+
185
+ You may publish manuscripts that Convey MultiMolecule materials in fully open access journals, conferences, or platforms that do not charge fees to either authors or readers.
186
+
187
+ The public version of the manuscript must be made available under a license that permits sharing of manuscripts.
188
+ You may use one of the following licenses.
189
+
190
+ - GNU Free Documentation License (GFDL)
191
+ - Creative Commons licenses
192
+ - OSI-approved licenses
193
+
194
+ This permission is granted as an additional permission under Section 7 of the [License](license.md).
195
+
196
+ > [!WARNING] Non-Profit
197
+ > Certain non-profit venues are permitted under the additional permissions below.
198
+
199
+ You may publish manuscripts that Convey MultiMolecule materials in certain non-profit journals, conferences, or platforms.
200
+
201
+ This includes the following venues.
202
+
203
+ - eLife
204
+
205
+ This permission is granted as an additional permission under Section 7 of the [License](license.md).
206
+
207
+ > [!CAUTION] Closed-Access / Author-Fee
208
+ > Closed-access or author-fee venues often make compliance impossible.
209
+
210
+ We do not endorse publishing MultiMolecule materials in closed-access or author-fee venues.
211
+
212
+ If a venue’s terms would prevent you from complying with the License for any MultiMolecule materials you Convey in connection with the publication, you must obtain a separate written license agreement from us prior to submission or publication.
213
+ Such an agreement may involve conditions such as co-authorship or financial contributions to the project.
214
+
215
+ ## 6. Restrictions on Publishing Research Papers in Certain Venues
216
+
217
+ > [!IMPORTANT]
218
+ > This section limits only additional permissions granted by this FAQ.
219
+ > This section does not restrict publication under the [License](license.md) itself.
220
+ > If you Convey MultiMolecule materials as part of a publication and you comply with the License for those materials, Section 6 does not apply.
221
+
222
+ Accordingly, Section 6 constrains only the additional permissions in Section 5 and Section 8.
223
+ Section 6 also constrains any separate written license agreement that incorporates or references this FAQ, unless that separate agreement expressly states otherwise in writing.
224
+
225
+ We believe that free and open access to research is a cornerstone of the machine learning community.
226
+ Inspired by the [Statement on Nature Machine Intelligence](https://openaccess.engineering.oregonstate.edu), and by the ongoing culture of [zero-cost open access](https://diamasproject.eu), we hold that research should be universally accessible without barriers to authors or readers.
227
+
228
+ The following publication venues adopt closed-access or author-fee models that contradict these fundamental values.
229
+ We view such practices as a regressive step in the evolution of machine learning research dissemination, one that undermines community efforts to foster open collaboration and knowledge sharing.
230
+
231
+ - Nature Machine Intelligence
232
+
233
+ Notwithstanding Sections 5 and 8, none of the additional permissions granted by this FAQ authorize submission or publication of a manuscript that Conveys MultiMolecule materials in the venues listed above.
234
+ You may submit or publish such a manuscript in the venues listed above only if you have a separate written license agreement from us that expressly permits publication notwithstanding this Section 6.
235
+
236
+ We strongly discourage publishing work that Conveys MultiMolecule materials in the venues listed above.
237
+
238
+ ## 7. Can I use MultiMolecule for commercial purposes?
239
+
240
+ Yes.
241
+ You may use MultiMolecule for commercial purposes, provided you comply with the Agreement.
242
+
243
+ If you Convey modified MultiMolecule materials, you must provide the Corresponding Source and related artifacts required by the License and this FAQ.
244
+
245
+ Where applicable, this includes training data as clarified in Section 3 and model weights as clarified in Section 2.
246
+
247
+ If you prefer commercial use without making such materials available under the License, you must obtain a separate written license agreement from us.
248
+ Please contact [license@danling.org](mailto:license@danling.org) for details.
249
+
250
+ ## 8. Are there special permissions for MultiMolecule Collaborators?
251
+
252
+ Yes.
253
+ If you are recognized as a Collaborator by the DanLing Team, you are entitled to the following additional permissions granted under Section 7 of the [License](license.md).
254
+
255
+ > [!TIP] Internal network use waiver
256
+ > Notwithstanding Section 13 of the License, Collaborators receive a waiver of the obligation to offer Corresponding Source to users interacting remotely through a computer network with a modified version of MultiMolecule, provided that the interaction is solely for internal research and development within the Collaborator’s team.
257
+ > This waiver does not apply to external users, public deployments, or Conveyance.
258
+
259
+ > [!TIP] Expanded permission for publishing papers
260
+ > Collaborators may publish manuscripts that Convey MultiMolecule materials in any peer-reviewed scientific venue, including journals and conference proceedings, regardless of access model or author fees.
261
+ > This expanded permission is granted as an additional permission under Section 7 of the [License](license.md).
262
+ > This expanded permission remains subject to Section 6 of this FAQ.
263
+ > This expanded permission affects only the licensing of the manuscript and supplementary documentation, and does not alter the License that governs any MultiMolecule materials you Convey.
264
+ > As a condition of this expanded permission, you must comply with the acknowledgement and citation requirements in Section 4.
265
+
266
+ > [!IMPORTANT] Source release timing related to publications
267
+ > This additional permission concerns the timing of *public release* of publication-related modifications.
268
+ > It does not delay any obligation under the License to provide Corresponding Source to recipients upon Conveyance, or to remote users upon network interaction under Section 13.
269
+ > If your modifications are utilized in research described in a manuscript, you must make the Corresponding Source for those publication-related modifications publicly available upon the first of the following events.
270
+ >
271
+ > - The manuscript’s formal acceptance for publication in a peer-reviewed venue.
272
+ > - 366 days have passed since the manuscript was first posted on a public preprint server.
273
+ >
274
+ > You must make the public release immediately upon the first applicable trigger event.
275
+ > If modifications are Conveyed or made available for remote interaction through a computer network in ways not tied to a publication or preprint, the standard timing rules of the License apply.
276
+
277
+ > [!NOTE] General conditions for Collaborator permissions
278
+ > These permissions are granted only to active, invited Collaborators recognized by the DanLing Team.
279
+ > These permissions are non-transferable and non-sublicensable.
280
+ > All other provisions of the License and this FAQ remain in full force and effect unless explicitly modified above.
281
+ > The DanLing Team may grant additional case-specific permissions through written communication.
282
+
283
+ ## 9. How can I use MultiMolecule if my organization forbids the use of code under the AGPL License?
284
+
285
+ Certain organizations, such as [Google](https://opensource.google/documentation/reference/using/agpl-policy), prohibit the use of AGPL-licensed code.
286
+ If you are affiliated with an organization that disallows the use of AGPL-licensed software, you must obtain a separate license from us to use MultiMolecule.
287
+
288
+ To request a separate license, please contact us at [license@danling.org](mailto:license@danling.org).
289
+
290
+ ## 10. Do we make updates to this FAQ?
291
+
292
+ > [!TIP] "In Short"
293
+ > Yes, we will update this FAQ as necessary to stay compliant with relevant laws.
294
+
295
+ We may update this license FAQ from time to time.
296
+ The updated version will be indicated by an updated 'Last Revised Time' at the bottom of this license FAQ.
297
+ If we make any material changes, we will notify you by posting the new license FAQ on this page.
298
+ We are unable to notify you directly as we do not collect any contact information from you.
299
+ We encourage you to review this license FAQ frequently to stay informed of how you can use our data, models, code, configuration, and documentation.
license.md ADDED
@@ -0,0 +1,661 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # GNU AFFERO GENERAL PUBLIC LICENSE
2
+
3
+ Version 3, 19 November 2007
4
+
5
+ Copyright (C) 2007 Free Software Foundation, Inc.
6
+ <https://fsf.org/>
7
+
8
+ Everyone is permitted to copy and distribute verbatim copies of this
9
+ license document, but changing it is not allowed.
10
+
11
+ ## Preamble
12
+
13
+ The GNU Affero General Public License is a free, copyleft license for
14
+ software and other kinds of works, specifically designed to ensure
15
+ cooperation with the community in the case of network server software.
16
+
17
+ The licenses for most software and other practical works are designed
18
+ to take away your freedom to share and change the works. By contrast,
19
+ our General Public Licenses are intended to guarantee your freedom to
20
+ share and change all versions of a program--to make sure it remains
21
+ free software for all its users.
22
+
23
+ When we speak of free software, we are referring to freedom, not
24
+ price. Our General Public Licenses are designed to make sure that you
25
+ have the freedom to distribute copies of free software (and charge for
26
+ them if you wish), that you receive source code or can get it if you
27
+ want it, that you can change the software or use pieces of it in new
28
+ free programs, and that you know you can do these things.
29
+
30
+ Developers that use our General Public Licenses protect your rights
31
+ with two steps: (1) assert copyright on the software, and (2) offer
32
+ you this License which gives you legal permission to copy, distribute
33
+ and/or modify the software.
34
+
35
+ A secondary benefit of defending all users' freedom is that
36
+ improvements made in alternate versions of the program, if they
37
+ receive widespread use, become available for other developers to
38
+ incorporate. Many developers of free software are heartened and
39
+ encouraged by the resulting cooperation. However, in the case of
40
+ software used on network servers, this result may fail to come about.
41
+ The GNU General Public License permits making a modified version and
42
+ letting the public access it on a server without ever releasing its
43
+ source code to the public.
44
+
45
+ The GNU Affero General Public License is designed specifically to
46
+ ensure that, in such cases, the modified source code becomes available
47
+ to the community. It requires the operator of a network server to
48
+ provide the source code of the modified version running there to the
49
+ users of that server. Therefore, public use of a modified version, on
50
+ a publicly accessible server, gives the public access to the source
51
+ code of the modified version.
52
+
53
+ An older license, called the Affero General Public License and
54
+ published by Affero, was designed to accomplish similar goals. This is
55
+ a different license, not a version of the Affero GPL, but Affero has
56
+ released a new version of the Affero GPL which permits relicensing
57
+ under this license.
58
+
59
+ The precise terms and conditions for copying, distribution and
60
+ modification follow.
61
+
62
+ ## TERMS AND CONDITIONS
63
+
64
+ ### 0. Definitions.
65
+
66
+ "This License" refers to version 3 of the GNU Affero General Public
67
+ License.
68
+
69
+ "Copyright" also means copyright-like laws that apply to other kinds
70
+ of works, such as semiconductor masks.
71
+
72
+ "The Program" refers to any copyrightable work licensed under this
73
+ License. Each licensee is addressed as "you". "Licensees" and
74
+ "recipients" may be individuals or organizations.
75
+
76
+ To "modify" a work means to copy from or adapt all or part of the work
77
+ in a fashion requiring copyright permission, other than the making of
78
+ an exact copy. The resulting work is called a "modified version" of
79
+ the earlier work or a work "based on" the earlier work.
80
+
81
+ A "covered work" means either the unmodified Program or a work based
82
+ on the Program.
83
+
84
+ To "propagate" a work means to do anything with it that, without
85
+ permission, would make you directly or secondarily liable for
86
+ infringement under applicable copyright law, except executing it on a
87
+ computer or modifying a private copy. Propagation includes copying,
88
+ distribution (with or without modification), making available to the
89
+ public, and in some countries other activities as well.
90
+
91
+ To "convey" a work means any kind of propagation that enables other
92
+ parties to make or receive copies. Mere interaction with a user
93
+ through a computer network, with no transfer of a copy, is not
94
+ conveying.
95
+
96
+ An interactive user interface displays "Appropriate Legal Notices" to
97
+ the extent that it includes a convenient and prominently visible
98
+ feature that (1) displays an appropriate copyright notice, and (2)
99
+ tells the user that there is no warranty for the work (except to the
100
+ extent that warranties are provided), that licensees may convey the
101
+ work under this License, and how to view a copy of this License. If
102
+ the interface presents a list of user commands or options, such as a
103
+ menu, a prominent item in the list meets this criterion.
104
+
105
+ ### 1. Source Code.
106
+
107
+ The "source code" for a work means the preferred form of the work for
108
+ making modifications to it. "Object code" means any non-source form of
109
+ a work.
110
+
111
+ A "Standard Interface" means an interface that either is an official
112
+ standard defined by a recognized standards body, or, in the case of
113
+ interfaces specified for a particular programming language, one that
114
+ is widely used among developers working in that language.
115
+
116
+ The "System Libraries" of an executable work include anything, other
117
+ than the work as a whole, that (a) is included in the normal form of
118
+ packaging a Major Component, but which is not part of that Major
119
+ Component, and (b) serves only to enable use of the work with that
120
+ Major Component, or to implement a Standard Interface for which an
121
+ implementation is available to the public in source code form. A
122
+ "Major Component", in this context, means a major essential component
123
+ (kernel, window system, and so on) of the specific operating system
124
+ (if any) on which the executable work runs, or a compiler used to
125
+ produce the work, or an object code interpreter used to run it.
126
+
127
+ The "Corresponding Source" for a work in object code form means all
128
+ the source code needed to generate, install, and (for an executable
129
+ work) run the object code and to modify the work, including scripts to
130
+ control those activities. However, it does not include the work's
131
+ System Libraries, or general-purpose tools or generally available free
132
+ programs which are used unmodified in performing those activities but
133
+ which are not part of the work. For example, Corresponding Source
134
+ includes interface definition files associated with source files for
135
+ the work, and the source code for shared libraries and dynamically
136
+ linked subprograms that the work is specifically designed to require,
137
+ such as by intimate data communication or control flow between those
138
+ subprograms and other parts of the work.
139
+
140
+ The Corresponding Source need not include anything that users can
141
+ regenerate automatically from other parts of the Corresponding Source.
142
+
143
+ The Corresponding Source for a work in source code form is that same
144
+ work.
145
+
146
+ ### 2. Basic Permissions.
147
+
148
+ All rights granted under this License are granted for the term of
149
+ copyright on the Program, and are irrevocable provided the stated
150
+ conditions are met. This License explicitly affirms your unlimited
151
+ permission to run the unmodified Program. The output from running a
152
+ covered work is covered by this License only if the output, given its
153
+ content, constitutes a covered work. This License acknowledges your
154
+ rights of fair use or other equivalent, as provided by copyright law.
155
+
156
+ You may make, run and propagate covered works that you do not convey,
157
+ without conditions so long as your license otherwise remains in force.
158
+ You may convey covered works to others for the sole purpose of having
159
+ them make modifications exclusively for you, or provide you with
160
+ facilities for running those works, provided that you comply with the
161
+ terms of this License in conveying all material for which you do not
162
+ control copyright. Those thus making or running the covered works for
163
+ you must do so exclusively on your behalf, under your direction and
164
+ control, on terms that prohibit them from making any copies of your
165
+ copyrighted material outside their relationship with you.
166
+
167
+ Conveying under any other circumstances is permitted solely under the
168
+ conditions stated below. Sublicensing is not allowed; section 10 makes
169
+ it unnecessary.
170
+
171
+ ### 3. Protecting Users' Legal Rights From Anti-Circumvention Law.
172
+
173
+ No covered work shall be deemed part of an effective technological
174
+ measure under any applicable law fulfilling obligations under article
175
+ 11 of the WIPO copyright treaty adopted on 20 December 1996, or
176
+ similar laws prohibiting or restricting circumvention of such
177
+ measures.
178
+
179
+ When you convey a covered work, you waive any legal power to forbid
180
+ circumvention of technological measures to the extent such
181
+ circumvention is effected by exercising rights under this License with
182
+ respect to the covered work, and you disclaim any intention to limit
183
+ operation or modification of the work as a means of enforcing, against
184
+ the work's users, your or third parties' legal rights to forbid
185
+ circumvention of technological measures.
186
+
187
+ ### 4. Conveying Verbatim Copies.
188
+
189
+ You may convey verbatim copies of the Program's source code as you
190
+ receive it, in any medium, provided that you conspicuously and
191
+ appropriately publish on each copy an appropriate copyright notice;
192
+ keep intact all notices stating that this License and any
193
+ non-permissive terms added in accord with section 7 apply to the code;
194
+ keep intact all notices of the absence of any warranty; and give all
195
+ recipients a copy of this License along with the Program.
196
+
197
+ You may charge any price or no price for each copy that you convey,
198
+ and you may offer support or warranty protection for a fee.
199
+
200
+ ### 5. Conveying Modified Source Versions.
201
+
202
+ You may convey a work based on the Program, or the modifications to
203
+ produce it from the Program, in the form of source code under the
204
+ terms of section 4, provided that you also meet all of these
205
+ conditions:
206
+
207
+ - a) The work must carry prominent notices stating that you modified
208
+ it, and giving a relevant date.
209
+ - b) The work must carry prominent notices stating that it is
210
+ released under this License and any conditions added under
211
+ section 7. This requirement modifies the requirement in section 4
212
+ to "keep intact all notices".
213
+ - c) You must license the entire work, as a whole, under this
214
+ License to anyone who comes into possession of a copy. This
215
+ License will therefore apply, along with any applicable section 7
216
+ additional terms, to the whole of the work, and all its parts,
217
+ regardless of how they are packaged. This License gives no
218
+ permission to license the work in any other way, but it does not
219
+ invalidate such permission if you have separately received it.
220
+ - d) If the work has interactive user interfaces, each must display
221
+ Appropriate Legal Notices; however, if the Program has interactive
222
+ interfaces that do not display Appropriate Legal Notices, your
223
+ work need not make them do so.
224
+
225
+ A compilation of a covered work with other separate and independent
226
+ works, which are not by their nature extensions of the covered work,
227
+ and which are not combined with it such as to form a larger program,
228
+ in or on a volume of a storage or distribution medium, is called an
229
+ "aggregate" if the compilation and its resulting copyright are not
230
+ used to limit the access or legal rights of the compilation's users
231
+ beyond what the individual works permit. Inclusion of a covered work
232
+ in an aggregate does not cause this License to apply to the other
233
+ parts of the aggregate.
234
+
235
+ ### 6. Conveying Non-Source Forms.
236
+
237
+ You may convey a covered work in object code form under the terms of
238
+ sections 4 and 5, provided that you also convey the machine-readable
239
+ Corresponding Source under the terms of this License, in one of these
240
+ ways:
241
+
242
+ - a) Convey the object code in, or embodied in, a physical product
243
+ (including a physical distribution medium), accompanied by the
244
+ Corresponding Source fixed on a durable physical medium
245
+ customarily used for software interchange.
246
+ - b) Convey the object code in, or embodied in, a physical product
247
+ (including a physical distribution medium), accompanied by a
248
+ written offer, valid for at least three years and valid for as
249
+ long as you offer spare parts or customer support for that product
250
+ model, to give anyone who possesses the object code either (1) a
251
+ copy of the Corresponding Source for all the software in the
252
+ product that is covered by this License, on a durable physical
253
+ medium customarily used for software interchange, for a price no
254
+ more than your reasonable cost of physically performing this
255
+ conveying of source, or (2) access to copy the Corresponding
256
+ Source from a network server at no charge.
257
+ - c) Convey individual copies of the object code with a copy of the
258
+ written offer to provide the Corresponding Source. This
259
+ alternative is allowed only occasionally and noncommercially, and
260
+ only if you received the object code with such an offer, in accord
261
+ with subsection 6b.
262
+ - d) Convey the object code by offering access from a designated
263
+ place (gratis or for a charge), and offer equivalent access to the
264
+ Corresponding Source in the same way through the same place at no
265
+ further charge. You need not require recipients to copy the
266
+ Corresponding Source along with the object code. If the place to
267
+ copy the object code is a network server, the Corresponding Source
268
+ may be on a different server (operated by you or a third party)
269
+ that supports equivalent copying facilities, provided you maintain
270
+ clear directions next to the object code saying where to find the
271
+ Corresponding Source. Regardless of what server hosts the
272
+ Corresponding Source, you remain obligated to ensure that it is
273
+ available for as long as needed to satisfy these requirements.
274
+ - e) Convey the object code using peer-to-peer transmission,
275
+ provided you inform other peers where the object code and
276
+ Corresponding Source of the work are being offered to the general
277
+ public at no charge under subsection 6d.
278
+
279
+ A separable portion of the object code, whose source code is excluded
280
+ from the Corresponding Source as a System Library, need not be
281
+ included in conveying the object code work.
282
+
283
+ A "User Product" is either (1) a "consumer product", which means any
284
+ tangible personal property which is normally used for personal,
285
+ family, or household purposes, or (2) anything designed or sold for
286
+ incorporation into a dwelling. In determining whether a product is a
287
+ consumer product, doubtful cases shall be resolved in favor of
288
+ coverage. For a particular product received by a particular user,
289
+ "normally used" refers to a typical or common use of that class of
290
+ product, regardless of the status of the particular user or of the way
291
+ in which the particular user actually uses, or expects or is expected
292
+ to use, the product. A product is a consumer product regardless of
293
+ whether the product has substantial commercial, industrial or
294
+ non-consumer uses, unless such uses represent the only significant
295
+ mode of use of the product.
296
+
297
+ "Installation Information" for a User Product means any methods,
298
+ procedures, authorization keys, or other information required to
299
+ install and execute modified versions of a covered work in that User
300
+ Product from a modified version of its Corresponding Source. The
301
+ information must suffice to ensure that the continued functioning of
302
+ the modified object code is in no case prevented or interfered with
303
+ solely because modification has been made.
304
+
305
+ If you convey an object code work under this section in, or with, or
306
+ specifically for use in, a User Product, and the conveying occurs as
307
+ part of a transaction in which the right of possession and use of the
308
+ User Product is transferred to the recipient in perpetuity or for a
309
+ fixed term (regardless of how the transaction is characterized), the
310
+ Corresponding Source conveyed under this section must be accompanied
311
+ by the Installation Information. But this requirement does not apply
312
+ if neither you nor any third party retains the ability to install
313
+ modified object code on the User Product (for example, the work has
314
+ been installed in ROM).
315
+
316
+ The requirement to provide Installation Information does not include a
317
+ requirement to continue to provide support service, warranty, or
318
+ updates for a work that has been modified or installed by the
319
+ recipient, or for the User Product in which it has been modified or
320
+ installed. Access to a network may be denied when the modification
321
+ itself materially and adversely affects the operation of the network
322
+ or violates the rules and protocols for communication across the
323
+ network.
324
+
325
+ Corresponding Source conveyed, and Installation Information provided,
326
+ in accord with this section must be in a format that is publicly
327
+ documented (and with an implementation available to the public in
328
+ source code form), and must require no special password or key for
329
+ unpacking, reading or copying.
330
+
331
+ ### 7. Additional Terms.
332
+
333
+ "Additional permissions" are terms that supplement the terms of this
334
+ License by making exceptions from one or more of its conditions.
335
+ Additional permissions that are applicable to the entire Program shall
336
+ be treated as though they were included in this License, to the extent
337
+ that they are valid under applicable law. If additional permissions
338
+ apply only to part of the Program, that part may be used separately
339
+ under those permissions, but the entire Program remains governed by
340
+ this License without regard to the additional permissions.
341
+
342
+ When you convey a copy of a covered work, you may at your option
343
+ remove any additional permissions from that copy, or from any part of
344
+ it. (Additional permissions may be written to require their own
345
+ removal in certain cases when you modify the work.) You may place
346
+ additional permissions on material, added by you to a covered work,
347
+ for which you have or can give appropriate copyright permission.
348
+
349
+ Notwithstanding any other provision of this License, for material you
350
+ add to a covered work, you may (if authorized by the copyright holders
351
+ of that material) supplement the terms of this License with terms:
352
+
353
+ - a) Disclaiming warranty or limiting liability differently from the
354
+ terms of sections 15 and 16 of this License; or
355
+ - b) Requiring preservation of specified reasonable legal notices or
356
+ author attributions in that material or in the Appropriate Legal
357
+ Notices displayed by works containing it; or
358
+ - c) Prohibiting misrepresentation of the origin of that material,
359
+ or requiring that modified versions of such material be marked in
360
+ reasonable ways as different from the original version; or
361
+ - d) Limiting the use for publicity purposes of names of licensors
362
+ or authors of the material; or
363
+ - e) Declining to grant rights under trademark law for use of some
364
+ trade names, trademarks, or service marks; or
365
+ - f) Requiring indemnification of licensors and authors of that
366
+ material by anyone who conveys the material (or modified versions
367
+ of it) with contractual assumptions of liability to the recipient,
368
+ for any liability that these contractual assumptions directly
369
+ impose on those licensors and authors.
370
+
371
+ All other non-permissive additional terms are considered "further
372
+ restrictions" within the meaning of section 10. If the Program as you
373
+ received it, or any part of it, contains a notice stating that it is
374
+ governed by this License along with a term that is a further
375
+ restriction, you may remove that term. If a license document contains
376
+ a further restriction but permits relicensing or conveying under this
377
+ License, you may add to a covered work material governed by the terms
378
+ of that license document, provided that the further restriction does
379
+ not survive such relicensing or conveying.
380
+
381
+ If you add terms to a covered work in accord with this section, you
382
+ must place, in the relevant source files, a statement of the
383
+ additional terms that apply to those files, or a notice indicating
384
+ where to find the applicable terms.
385
+
386
+ Additional terms, permissive or non-permissive, may be stated in the
387
+ form of a separately written license, or stated as exceptions; the
388
+ above requirements apply either way.
389
+
390
+ ### 8. Termination.
391
+
392
+ You may not propagate or modify a covered work except as expressly
393
+ provided under this License. Any attempt otherwise to propagate or
394
+ modify it is void, and will automatically terminate your rights under
395
+ this License (including any patent licenses granted under the third
396
+ paragraph of section 11).
397
+
398
+ However, if you cease all violation of this License, then your license
399
+ from a particular copyright holder is reinstated (a) provisionally,
400
+ unless and until the copyright holder explicitly and finally
401
+ terminates your license, and (b) permanently, if the copyright holder
402
+ fails to notify you of the violation by some reasonable means prior to
403
+ 60 days after the cessation.
404
+
405
+ Moreover, your license from a particular copyright holder is
406
+ reinstated permanently if the copyright holder notifies you of the
407
+ violation by some reasonable means, this is the first time you have
408
+ received notice of violation of this License (for any work) from that
409
+ copyright holder, and you cure the violation prior to 30 days after
410
+ your receipt of the notice.
411
+
412
+ Termination of your rights under this section does not terminate the
413
+ licenses of parties who have received copies or rights from you under
414
+ this License. If your rights have been terminated and not permanently
415
+ reinstated, you do not qualify to receive new licenses for the same
416
+ material under section 10.
417
+
418
+ ### 9. Acceptance Not Required for Having Copies.
419
+
420
+ You are not required to accept this License in order to receive or run
421
+ a copy of the Program. Ancillary propagation of a covered work
422
+ occurring solely as a consequence of using peer-to-peer transmission
423
+ to receive a copy likewise does not require acceptance. However,
424
+ nothing other than this License grants you permission to propagate or
425
+ modify any covered work. These actions infringe copyright if you do
426
+ not accept this License. Therefore, by modifying or propagating a
427
+ covered work, you indicate your acceptance of this License to do so.
428
+
429
+ ### 10. Automatic Licensing of Downstream Recipients.
430
+
431
+ Each time you convey a covered work, the recipient automatically
432
+ receives a license from the original licensors, to run, modify and
433
+ propagate that work, subject to this License. You are not responsible
434
+ for enforcing compliance by third parties with this License.
435
+
436
+ An "entity transaction" is a transaction transferring control of an
437
+ organization, or substantially all assets of one, or subdividing an
438
+ organization, or merging organizations. If propagation of a covered
439
+ work results from an entity transaction, each party to that
440
+ transaction who receives a copy of the work also receives whatever
441
+ licenses to the work the party's predecessor in interest had or could
442
+ give under the previous paragraph, plus a right to possession of the
443
+ Corresponding Source of the work from the predecessor in interest, if
444
+ the predecessor has it or can get it with reasonable efforts.
445
+
446
+ You may not impose any further restrictions on the exercise of the
447
+ rights granted or affirmed under this License. For example, you may
448
+ not impose a license fee, royalty, or other charge for exercise of
449
+ rights granted under this License, and you may not initiate litigation
450
+ (including a cross-claim or counterclaim in a lawsuit) alleging that
451
+ any patent claim is infringed by making, using, selling, offering for
452
+ sale, or importing the Program or any portion of it.
453
+
454
+ ### 11. Patents.
455
+
456
+ A "contributor" is a copyright holder who authorizes use under this
457
+ License of the Program or a work on which the Program is based. The
458
+ work thus licensed is called the contributor's "contributor version".
459
+
460
+ A contributor's "essential patent claims" are all patent claims owned
461
+ or controlled by the contributor, whether already acquired or
462
+ hereafter acquired, that would be infringed by some manner, permitted
463
+ by this License, of making, using, or selling its contributor version,
464
+ but do not include claims that would be infringed only as a
465
+ consequence of further modification of the contributor version. For
466
+ purposes of this definition, "control" includes the right to grant
467
+ patent sublicenses in a manner consistent with the requirements of
468
+ this License.
469
+
470
+ Each contributor grants you a non-exclusive, worldwide, royalty-free
471
+ patent license under the contributor's essential patent claims, to
472
+ make, use, sell, offer for sale, import and otherwise run, modify and
473
+ propagate the contents of its contributor version.
474
+
475
+ In the following three paragraphs, a "patent license" is any express
476
+ agreement or commitment, however denominated, not to enforce a patent
477
+ (such as an express permission to practice a patent or covenant not to
478
+ sue for patent infringement). To "grant" such a patent license to a
479
+ party means to make such an agreement or commitment not to enforce a
480
+ patent against the party.
481
+
482
+ If you convey a covered work, knowingly relying on a patent license,
483
+ and the Corresponding Source of the work is not available for anyone
484
+ to copy, free of charge and under the terms of this License, through a
485
+ publicly available network server or other readily accessible means,
486
+ then you must either (1) cause the Corresponding Source to be so
487
+ available, or (2) arrange to deprive yourself of the benefit of the
488
+ patent license for this particular work, or (3) arrange, in a manner
489
+ consistent with the requirements of this License, to extend the patent
490
+ license to downstream recipients. "Knowingly relying" means you have
491
+ actual knowledge that, but for the patent license, your conveying the
492
+ covered work in a country, or your recipient's use of the covered work
493
+ in a country, would infringe one or more identifiable patents in that
494
+ country that you have reason to believe are valid.
495
+
496
+ If, pursuant to or in connection with a single transaction or
497
+ arrangement, you convey, or propagate by procuring conveyance of, a
498
+ covered work, and grant a patent license to some of the parties
499
+ receiving the covered work authorizing them to use, propagate, modify
500
+ or convey a specific copy of the covered work, then the patent license
501
+ you grant is automatically extended to all recipients of the covered
502
+ work and works based on it.
503
+
504
+ A patent license is "discriminatory" if it does not include within the
505
+ scope of its coverage, prohibits the exercise of, or is conditioned on
506
+ the non-exercise of one or more of the rights that are specifically
507
+ granted under this License. You may not convey a covered work if you
508
+ are a party to an arrangement with a third party that is in the
509
+ business of distributing software, under which you make payment to the
510
+ third party based on the extent of your activity of conveying the
511
+ work, and under which the third party grants, to any of the parties
512
+ who would receive the covered work from you, a discriminatory patent
513
+ license (a) in connection with copies of the covered work conveyed by
514
+ you (or copies made from those copies), or (b) primarily for and in
515
+ connection with specific products or compilations that contain the
516
+ covered work, unless you entered into that arrangement, or that patent
517
+ license was granted, prior to 28 March 2007.
518
+
519
+ Nothing in this License shall be construed as excluding or limiting
520
+ any implied license or other defenses to infringement that may
521
+ otherwise be available to you under applicable patent law.
522
+
523
+ ### 12. No Surrender of Others' Freedom.
524
+
525
+ If conditions are imposed on you (whether by court order, agreement or
526
+ otherwise) that contradict the conditions of this License, they do not
527
+ excuse you from the conditions of this License. If you cannot convey a
528
+ covered work so as to satisfy simultaneously your obligations under
529
+ this License and any other pertinent obligations, then as a
530
+ consequence you may not convey it at all. For example, if you agree to
531
+ terms that obligate you to collect a royalty for further conveying
532
+ from those to whom you convey the Program, the only way you could
533
+ satisfy both those terms and this License would be to refrain entirely
534
+ from conveying the Program.
535
+
536
+ ### 13. Remote Network Interaction; Use with the GNU General Public License.
537
+
538
+ Notwithstanding any other provision of this License, if you modify the
539
+ Program, your modified version must prominently offer all users
540
+ interacting with it remotely through a computer network (if your
541
+ version supports such interaction) an opportunity to receive the
542
+ Corresponding Source of your version by providing access to the
543
+ Corresponding Source from a network server at no charge, through some
544
+ standard or customary means of facilitating copying of software. This
545
+ Corresponding Source shall include the Corresponding Source for any
546
+ work covered by version 3 of the GNU General Public License that is
547
+ incorporated pursuant to the following paragraph.
548
+
549
+ Notwithstanding any other provision of this License, you have
550
+ permission to link or combine any covered work with a work licensed
551
+ under version 3 of the GNU General Public License into a single
552
+ combined work, and to convey the resulting work. The terms of this
553
+ License will continue to apply to the part which is the covered work,
554
+ but the work with which it is combined will remain governed by version
555
+ 3 of the GNU General Public License.
556
+
557
+ ### 14. Revised Versions of this License.
558
+
559
+ The Free Software Foundation may publish revised and/or new versions
560
+ of the GNU Affero General Public License from time to time. Such new
561
+ versions will be similar in spirit to the present version, but may
562
+ differ in detail to address new problems or concerns.
563
+
564
+ Each version is given a distinguishing version number. If the Program
565
+ specifies that a certain numbered version of the GNU Affero General
566
+ Public License "or any later version" applies to it, you have the
567
+ option of following the terms and conditions either of that numbered
568
+ version or of any later version published by the Free Software
569
+ Foundation. If the Program does not specify a version number of the
570
+ GNU Affero General Public License, you may choose any version ever
571
+ published by the Free Software Foundation.
572
+
573
+ If the Program specifies that a proxy can decide which future versions
574
+ of the GNU Affero General Public License can be used, that proxy's
575
+ public statement of acceptance of a version permanently authorizes you
576
+ to choose that version for the Program.
577
+
578
+ Later license versions may give you additional or different
579
+ permissions. However, no additional obligations are imposed on any
580
+ author or copyright holder as a result of your choosing to follow a
581
+ later version.
582
+
583
+ ### 15. Disclaimer of Warranty.
584
+
585
+ THERE IS NO WARRANTY FOR THE PROGRAM, TO THE EXTENT PERMITTED BY
586
+ APPLICABLE LAW. EXCEPT WHEN OTHERWISE STATED IN WRITING THE COPYRIGHT
587
+ HOLDERS AND/OR OTHER PARTIES PROVIDE THE PROGRAM "AS IS" WITHOUT
588
+ WARRANTY OF ANY KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING, BUT NOT
589
+ LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR
590
+ A PARTICULAR PURPOSE. THE ENTIRE RISK AS TO THE QUALITY AND
591
+ PERFORMANCE OF THE PROGRAM IS WITH YOU. SHOULD THE PROGRAM PROVE
592
+ DEFECTIVE, YOU ASSUME THE COST OF ALL NECESSARY SERVICING, REPAIR OR
593
+ CORRECTION.
594
+
595
+ ### 16. Limitation of Liability.
596
+
597
+ IN NO EVENT UNLESS REQUIRED BY APPLICABLE LAW OR AGREED TO IN WRITING
598
+ WILL ANY COPYRIGHT HOLDER, OR ANY OTHER PARTY WHO MODIFIES AND/OR
599
+ CONVEYS THE PROGRAM AS PERMITTED ABOVE, BE LIABLE TO YOU FOR DAMAGES,
600
+ INCLUDING ANY GENERAL, SPECIAL, INCIDENTAL OR CONSEQUENTIAL DAMAGES
601
+ ARISING OUT OF THE USE OR INABILITY TO USE THE PROGRAM (INCLUDING BUT
602
+ NOT LIMITED TO LOSS OF DATA OR DATA BEING RENDERED INACCURATE OR
603
+ LOSSES SUSTAINED BY YOU OR THIRD PARTIES OR A FAILURE OF THE PROGRAM
604
+ TO OPERATE WITH ANY OTHER PROGRAMS), EVEN IF SUCH HOLDER OR OTHER
605
+ PARTY HAS BEEN ADVISED OF THE POSSIBILITY OF SUCH DAMAGES.
606
+
607
+ ### 17. Interpretation of Sections 15 and 16.
608
+
609
+ If the disclaimer of warranty and limitation of liability provided
610
+ above cannot be given local legal effect according to their terms,
611
+ reviewing courts shall apply local law that most closely approximates
612
+ an absolute waiver of all civil liability in connection with the
613
+ Program, unless a warranty or assumption of liability accompanies a
614
+ copy of the Program in return for a fee.
615
+
616
+ END OF TERMS AND CONDITIONS
617
+
618
+ ## How to Apply These Terms to Your New Programs
619
+
620
+ If you develop a new program, and you want it to be of the greatest
621
+ possible use to the public, the best way to achieve this is to make it
622
+ free software which everyone can redistribute and change under these
623
+ terms.
624
+
625
+ To do so, attach the following notices to the program. It is safest to
626
+ attach them to the start of each source file to most effectively state
627
+ the exclusion of warranty; and each file should have at least the
628
+ "copyright" line and a pointer to where the full notice is found.
629
+
630
+ <one line to give the program's name and a brief idea of what it does.>
631
+ Copyright (C) <year> <name of author>
632
+
633
+ This program is free software: you can redistribute it and/or modify
634
+ it under the terms of the GNU Affero General Public License as
635
+ published by the Free Software Foundation, either version 3 of the
636
+ License, or (at your option) any later version.
637
+
638
+ This program is distributed in the hope that it will be useful,
639
+ but WITHOUT ANY WARRANTY; without even the implied warranty of
640
+ MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
641
+ GNU Affero General Public License for more details.
642
+
643
+ You should have received a copy of the GNU Affero General Public License
644
+ along with this program. If not, see <https://www.gnu.org/licenses/>.
645
+
646
+ Also add information on how to contact you by electronic and paper
647
+ mail.
648
+
649
+ If your software can interact with users remotely through a computer
650
+ network, you should also make sure that it provides a way for users to
651
+ get its source. For example, if your program is a web application, its
652
+ interface could display a "Source" link that leads users to an archive
653
+ of the code. There are many ways you could offer source, and different
654
+ solutions will be better for different programs; see section 13 for
655
+ the specific requirements.
656
+
657
+ You should also get your employer (if you work as a programmer) or
658
+ school, if any, to sign a "copyright disclaimer" for the program, if
659
+ necessary. For more information on this, and how to apply and follow
660
+ the GNU AGPL, see <https://www.gnu.org/licenses/>.
661
+
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:959891f4fbe47525857764aef4661ec585602ce7773c54b64df9e29206fcaf90
3
+ size 369414552
tokenizer_config.json ADDED
@@ -0,0 +1,69 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "added_tokens_decoder": {
3
+ "0": {
4
+ "content": "<pad>",
5
+ "lstrip": false,
6
+ "normalized": false,
7
+ "rstrip": false,
8
+ "single_word": false,
9
+ "special": true
10
+ },
11
+ "1": {
12
+ "content": "<cls>",
13
+ "lstrip": false,
14
+ "normalized": false,
15
+ "rstrip": false,
16
+ "single_word": false,
17
+ "special": true
18
+ },
19
+ "2": {
20
+ "content": "<eos>",
21
+ "lstrip": false,
22
+ "normalized": false,
23
+ "rstrip": false,
24
+ "single_word": false,
25
+ "special": true
26
+ },
27
+ "3": {
28
+ "content": "<unk>",
29
+ "lstrip": false,
30
+ "normalized": false,
31
+ "rstrip": false,
32
+ "single_word": false,
33
+ "special": true
34
+ },
35
+ "4": {
36
+ "content": "<mask>",
37
+ "lstrip": false,
38
+ "normalized": false,
39
+ "rstrip": false,
40
+ "single_word": false,
41
+ "special": true
42
+ },
43
+ "5": {
44
+ "content": "<null>",
45
+ "lstrip": false,
46
+ "normalized": false,
47
+ "rstrip": false,
48
+ "single_word": false,
49
+ "special": true
50
+ }
51
+ },
52
+ "backend": "custom",
53
+ "bos_token": "<cls>",
54
+ "clean_up_tokenization_spaces": true,
55
+ "cls_token": "<cls>",
56
+ "codon": false,
57
+ "eos_token": "<eos>",
58
+ "extra_special_tokens": [
59
+ "<null>"
60
+ ],
61
+ "mask_token": "<mask>",
62
+ "model_max_length": 512,
63
+ "nmers": 6,
64
+ "pad_token": "<pad>",
65
+ "replace_U_with_T": true,
66
+ "sep_token": "<eos>",
67
+ "tokenizer_class": "DnaTokenizer",
68
+ "unk_token": "<unk>"
69
+ }
vocab.txt ADDED
@@ -0,0 +1,4102 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ <pad>
2
+ <cls>
3
+ <eos>
4
+ <unk>
5
+ <mask>
6
+ <null>
7
+ AAAAAA
8
+ AAAAAC
9
+ AAAAAG
10
+ AAAAAT
11
+ AAAACA
12
+ AAAACC
13
+ AAAACG
14
+ AAAACT
15
+ AAAAGA
16
+ AAAAGC
17
+ AAAAGG
18
+ AAAAGT
19
+ AAAATA
20
+ AAAATC
21
+ AAAATG
22
+ AAAATT
23
+ AAACAA
24
+ AAACAC
25
+ AAACAG
26
+ AAACAT
27
+ AAACCA
28
+ AAACCC
29
+ AAACCG
30
+ AAACCT
31
+ AAACGA
32
+ AAACGC
33
+ AAACGG
34
+ AAACGT
35
+ AAACTA
36
+ AAACTC
37
+ AAACTG
38
+ AAACTT
39
+ AAAGAA
40
+ AAAGAC
41
+ AAAGAG
42
+ AAAGAT
43
+ AAAGCA
44
+ AAAGCC
45
+ AAAGCG
46
+ AAAGCT
47
+ AAAGGA
48
+ AAAGGC
49
+ AAAGGG
50
+ AAAGGT
51
+ AAAGTA
52
+ AAAGTC
53
+ AAAGTG
54
+ AAAGTT
55
+ AAATAA
56
+ AAATAC
57
+ AAATAG
58
+ AAATAT
59
+ AAATCA
60
+ AAATCC
61
+ AAATCG
62
+ AAATCT
63
+ AAATGA
64
+ AAATGC
65
+ AAATGG
66
+ AAATGT
67
+ AAATTA
68
+ AAATTC
69
+ AAATTG
70
+ AAATTT
71
+ AACAAA
72
+ AACAAC
73
+ AACAAG
74
+ AACAAT
75
+ AACACA
76
+ AACACC
77
+ AACACG
78
+ AACACT
79
+ AACAGA
80
+ AACAGC
81
+ AACAGG
82
+ AACAGT
83
+ AACATA
84
+ AACATC
85
+ AACATG
86
+ AACATT
87
+ AACCAA
88
+ AACCAC
89
+ AACCAG
90
+ AACCAT
91
+ AACCCA
92
+ AACCCC
93
+ AACCCG
94
+ AACCCT
95
+ AACCGA
96
+ AACCGC
97
+ AACCGG
98
+ AACCGT
99
+ AACCTA
100
+ AACCTC
101
+ AACCTG
102
+ AACCTT
103
+ AACGAA
104
+ AACGAC
105
+ AACGAG
106
+ AACGAT
107
+ AACGCA
108
+ AACGCC
109
+ AACGCG
110
+ AACGCT
111
+ AACGGA
112
+ AACGGC
113
+ AACGGG
114
+ AACGGT
115
+ AACGTA
116
+ AACGTC
117
+ AACGTG
118
+ AACGTT
119
+ AACTAA
120
+ AACTAC
121
+ AACTAG
122
+ AACTAT
123
+ AACTCA
124
+ AACTCC
125
+ AACTCG
126
+ AACTCT
127
+ AACTGA
128
+ AACTGC
129
+ AACTGG
130
+ AACTGT
131
+ AACTTA
132
+ AACTTC
133
+ AACTTG
134
+ AACTTT
135
+ AAGAAA
136
+ AAGAAC
137
+ AAGAAG
138
+ AAGAAT
139
+ AAGACA
140
+ AAGACC
141
+ AAGACG
142
+ AAGACT
143
+ AAGAGA
144
+ AAGAGC
145
+ AAGAGG
146
+ AAGAGT
147
+ AAGATA
148
+ AAGATC
149
+ AAGATG
150
+ AAGATT
151
+ AAGCAA
152
+ AAGCAC
153
+ AAGCAG
154
+ AAGCAT
155
+ AAGCCA
156
+ AAGCCC
157
+ AAGCCG
158
+ AAGCCT
159
+ AAGCGA
160
+ AAGCGC
161
+ AAGCGG
162
+ AAGCGT
163
+ AAGCTA
164
+ AAGCTC
165
+ AAGCTG
166
+ AAGCTT
167
+ AAGGAA
168
+ AAGGAC
169
+ AAGGAG
170
+ AAGGAT
171
+ AAGGCA
172
+ AAGGCC
173
+ AAGGCG
174
+ AAGGCT
175
+ AAGGGA
176
+ AAGGGC
177
+ AAGGGG
178
+ AAGGGT
179
+ AAGGTA
180
+ AAGGTC
181
+ AAGGTG
182
+ AAGGTT
183
+ AAGTAA
184
+ AAGTAC
185
+ AAGTAG
186
+ AAGTAT
187
+ AAGTCA
188
+ AAGTCC
189
+ AAGTCG
190
+ AAGTCT
191
+ AAGTGA
192
+ AAGTGC
193
+ AAGTGG
194
+ AAGTGT
195
+ AAGTTA
196
+ AAGTTC
197
+ AAGTTG
198
+ AAGTTT
199
+ AATAAA
200
+ AATAAC
201
+ AATAAG
202
+ AATAAT
203
+ AATACA
204
+ AATACC
205
+ AATACG
206
+ AATACT
207
+ AATAGA
208
+ AATAGC
209
+ AATAGG
210
+ AATAGT
211
+ AATATA
212
+ AATATC
213
+ AATATG
214
+ AATATT
215
+ AATCAA
216
+ AATCAC
217
+ AATCAG
218
+ AATCAT
219
+ AATCCA
220
+ AATCCC
221
+ AATCCG
222
+ AATCCT
223
+ AATCGA
224
+ AATCGC
225
+ AATCGG
226
+ AATCGT
227
+ AATCTA
228
+ AATCTC
229
+ AATCTG
230
+ AATCTT
231
+ AATGAA
232
+ AATGAC
233
+ AATGAG
234
+ AATGAT
235
+ AATGCA
236
+ AATGCC
237
+ AATGCG
238
+ AATGCT
239
+ AATGGA
240
+ AATGGC
241
+ AATGGG
242
+ AATGGT
243
+ AATGTA
244
+ AATGTC
245
+ AATGTG
246
+ AATGTT
247
+ AATTAA
248
+ AATTAC
249
+ AATTAG
250
+ AATTAT
251
+ AATTCA
252
+ AATTCC
253
+ AATTCG
254
+ AATTCT
255
+ AATTGA
256
+ AATTGC
257
+ AATTGG
258
+ AATTGT
259
+ AATTTA
260
+ AATTTC
261
+ AATTTG
262
+ AATTTT
263
+ ACAAAA
264
+ ACAAAC
265
+ ACAAAG
266
+ ACAAAT
267
+ ACAACA
268
+ ACAACC
269
+ ACAACG
270
+ ACAACT
271
+ ACAAGA
272
+ ACAAGC
273
+ ACAAGG
274
+ ACAAGT
275
+ ACAATA
276
+ ACAATC
277
+ ACAATG
278
+ ACAATT
279
+ ACACAA
280
+ ACACAC
281
+ ACACAG
282
+ ACACAT
283
+ ACACCA
284
+ ACACCC
285
+ ACACCG
286
+ ACACCT
287
+ ACACGA
288
+ ACACGC
289
+ ACACGG
290
+ ACACGT
291
+ ACACTA
292
+ ACACTC
293
+ ACACTG
294
+ ACACTT
295
+ ACAGAA
296
+ ACAGAC
297
+ ACAGAG
298
+ ACAGAT
299
+ ACAGCA
300
+ ACAGCC
301
+ ACAGCG
302
+ ACAGCT
303
+ ACAGGA
304
+ ACAGGC
305
+ ACAGGG
306
+ ACAGGT
307
+ ACAGTA
308
+ ACAGTC
309
+ ACAGTG
310
+ ACAGTT
311
+ ACATAA
312
+ ACATAC
313
+ ACATAG
314
+ ACATAT
315
+ ACATCA
316
+ ACATCC
317
+ ACATCG
318
+ ACATCT
319
+ ACATGA
320
+ ACATGC
321
+ ACATGG
322
+ ACATGT
323
+ ACATTA
324
+ ACATTC
325
+ ACATTG
326
+ ACATTT
327
+ ACCAAA
328
+ ACCAAC
329
+ ACCAAG
330
+ ACCAAT
331
+ ACCACA
332
+ ACCACC
333
+ ACCACG
334
+ ACCACT
335
+ ACCAGA
336
+ ACCAGC
337
+ ACCAGG
338
+ ACCAGT
339
+ ACCATA
340
+ ACCATC
341
+ ACCATG
342
+ ACCATT
343
+ ACCCAA
344
+ ACCCAC
345
+ ACCCAG
346
+ ACCCAT
347
+ ACCCCA
348
+ ACCCCC
349
+ ACCCCG
350
+ ACCCCT
351
+ ACCCGA
352
+ ACCCGC
353
+ ACCCGG
354
+ ACCCGT
355
+ ACCCTA
356
+ ACCCTC
357
+ ACCCTG
358
+ ACCCTT
359
+ ACCGAA
360
+ ACCGAC
361
+ ACCGAG
362
+ ACCGAT
363
+ ACCGCA
364
+ ACCGCC
365
+ ACCGCG
366
+ ACCGCT
367
+ ACCGGA
368
+ ACCGGC
369
+ ACCGGG
370
+ ACCGGT
371
+ ACCGTA
372
+ ACCGTC
373
+ ACCGTG
374
+ ACCGTT
375
+ ACCTAA
376
+ ACCTAC
377
+ ACCTAG
378
+ ACCTAT
379
+ ACCTCA
380
+ ACCTCC
381
+ ACCTCG
382
+ ACCTCT
383
+ ACCTGA
384
+ ACCTGC
385
+ ACCTGG
386
+ ACCTGT
387
+ ACCTTA
388
+ ACCTTC
389
+ ACCTTG
390
+ ACCTTT
391
+ ACGAAA
392
+ ACGAAC
393
+ ACGAAG
394
+ ACGAAT
395
+ ACGACA
396
+ ACGACC
397
+ ACGACG
398
+ ACGACT
399
+ ACGAGA
400
+ ACGAGC
401
+ ACGAGG
402
+ ACGAGT
403
+ ACGATA
404
+ ACGATC
405
+ ACGATG
406
+ ACGATT
407
+ ACGCAA
408
+ ACGCAC
409
+ ACGCAG
410
+ ACGCAT
411
+ ACGCCA
412
+ ACGCCC
413
+ ACGCCG
414
+ ACGCCT
415
+ ACGCGA
416
+ ACGCGC
417
+ ACGCGG
418
+ ACGCGT
419
+ ACGCTA
420
+ ACGCTC
421
+ ACGCTG
422
+ ACGCTT
423
+ ACGGAA
424
+ ACGGAC
425
+ ACGGAG
426
+ ACGGAT
427
+ ACGGCA
428
+ ACGGCC
429
+ ACGGCG
430
+ ACGGCT
431
+ ACGGGA
432
+ ACGGGC
433
+ ACGGGG
434
+ ACGGGT
435
+ ACGGTA
436
+ ACGGTC
437
+ ACGGTG
438
+ ACGGTT
439
+ ACGTAA
440
+ ACGTAC
441
+ ACGTAG
442
+ ACGTAT
443
+ ACGTCA
444
+ ACGTCC
445
+ ACGTCG
446
+ ACGTCT
447
+ ACGTGA
448
+ ACGTGC
449
+ ACGTGG
450
+ ACGTGT
451
+ ACGTTA
452
+ ACGTTC
453
+ ACGTTG
454
+ ACGTTT
455
+ ACTAAA
456
+ ACTAAC
457
+ ACTAAG
458
+ ACTAAT
459
+ ACTACA
460
+ ACTACC
461
+ ACTACG
462
+ ACTACT
463
+ ACTAGA
464
+ ACTAGC
465
+ ACTAGG
466
+ ACTAGT
467
+ ACTATA
468
+ ACTATC
469
+ ACTATG
470
+ ACTATT
471
+ ACTCAA
472
+ ACTCAC
473
+ ACTCAG
474
+ ACTCAT
475
+ ACTCCA
476
+ ACTCCC
477
+ ACTCCG
478
+ ACTCCT
479
+ ACTCGA
480
+ ACTCGC
481
+ ACTCGG
482
+ ACTCGT
483
+ ACTCTA
484
+ ACTCTC
485
+ ACTCTG
486
+ ACTCTT
487
+ ACTGAA
488
+ ACTGAC
489
+ ACTGAG
490
+ ACTGAT
491
+ ACTGCA
492
+ ACTGCC
493
+ ACTGCG
494
+ ACTGCT
495
+ ACTGGA
496
+ ACTGGC
497
+ ACTGGG
498
+ ACTGGT
499
+ ACTGTA
500
+ ACTGTC
501
+ ACTGTG
502
+ ACTGTT
503
+ ACTTAA
504
+ ACTTAC
505
+ ACTTAG
506
+ ACTTAT
507
+ ACTTCA
508
+ ACTTCC
509
+ ACTTCG
510
+ ACTTCT
511
+ ACTTGA
512
+ ACTTGC
513
+ ACTTGG
514
+ ACTTGT
515
+ ACTTTA
516
+ ACTTTC
517
+ ACTTTG
518
+ ACTTTT
519
+ AGAAAA
520
+ AGAAAC
521
+ AGAAAG
522
+ AGAAAT
523
+ AGAACA
524
+ AGAACC
525
+ AGAACG
526
+ AGAACT
527
+ AGAAGA
528
+ AGAAGC
529
+ AGAAGG
530
+ AGAAGT
531
+ AGAATA
532
+ AGAATC
533
+ AGAATG
534
+ AGAATT
535
+ AGACAA
536
+ AGACAC
537
+ AGACAG
538
+ AGACAT
539
+ AGACCA
540
+ AGACCC
541
+ AGACCG
542
+ AGACCT
543
+ AGACGA
544
+ AGACGC
545
+ AGACGG
546
+ AGACGT
547
+ AGACTA
548
+ AGACTC
549
+ AGACTG
550
+ AGACTT
551
+ AGAGAA
552
+ AGAGAC
553
+ AGAGAG
554
+ AGAGAT
555
+ AGAGCA
556
+ AGAGCC
557
+ AGAGCG
558
+ AGAGCT
559
+ AGAGGA
560
+ AGAGGC
561
+ AGAGGG
562
+ AGAGGT
563
+ AGAGTA
564
+ AGAGTC
565
+ AGAGTG
566
+ AGAGTT
567
+ AGATAA
568
+ AGATAC
569
+ AGATAG
570
+ AGATAT
571
+ AGATCA
572
+ AGATCC
573
+ AGATCG
574
+ AGATCT
575
+ AGATGA
576
+ AGATGC
577
+ AGATGG
578
+ AGATGT
579
+ AGATTA
580
+ AGATTC
581
+ AGATTG
582
+ AGATTT
583
+ AGCAAA
584
+ AGCAAC
585
+ AGCAAG
586
+ AGCAAT
587
+ AGCACA
588
+ AGCACC
589
+ AGCACG
590
+ AGCACT
591
+ AGCAGA
592
+ AGCAGC
593
+ AGCAGG
594
+ AGCAGT
595
+ AGCATA
596
+ AGCATC
597
+ AGCATG
598
+ AGCATT
599
+ AGCCAA
600
+ AGCCAC
601
+ AGCCAG
602
+ AGCCAT
603
+ AGCCCA
604
+ AGCCCC
605
+ AGCCCG
606
+ AGCCCT
607
+ AGCCGA
608
+ AGCCGC
609
+ AGCCGG
610
+ AGCCGT
611
+ AGCCTA
612
+ AGCCTC
613
+ AGCCTG
614
+ AGCCTT
615
+ AGCGAA
616
+ AGCGAC
617
+ AGCGAG
618
+ AGCGAT
619
+ AGCGCA
620
+ AGCGCC
621
+ AGCGCG
622
+ AGCGCT
623
+ AGCGGA
624
+ AGCGGC
625
+ AGCGGG
626
+ AGCGGT
627
+ AGCGTA
628
+ AGCGTC
629
+ AGCGTG
630
+ AGCGTT
631
+ AGCTAA
632
+ AGCTAC
633
+ AGCTAG
634
+ AGCTAT
635
+ AGCTCA
636
+ AGCTCC
637
+ AGCTCG
638
+ AGCTCT
639
+ AGCTGA
640
+ AGCTGC
641
+ AGCTGG
642
+ AGCTGT
643
+ AGCTTA
644
+ AGCTTC
645
+ AGCTTG
646
+ AGCTTT
647
+ AGGAAA
648
+ AGGAAC
649
+ AGGAAG
650
+ AGGAAT
651
+ AGGACA
652
+ AGGACC
653
+ AGGACG
654
+ AGGACT
655
+ AGGAGA
656
+ AGGAGC
657
+ AGGAGG
658
+ AGGAGT
659
+ AGGATA
660
+ AGGATC
661
+ AGGATG
662
+ AGGATT
663
+ AGGCAA
664
+ AGGCAC
665
+ AGGCAG
666
+ AGGCAT
667
+ AGGCCA
668
+ AGGCCC
669
+ AGGCCG
670
+ AGGCCT
671
+ AGGCGA
672
+ AGGCGC
673
+ AGGCGG
674
+ AGGCGT
675
+ AGGCTA
676
+ AGGCTC
677
+ AGGCTG
678
+ AGGCTT
679
+ AGGGAA
680
+ AGGGAC
681
+ AGGGAG
682
+ AGGGAT
683
+ AGGGCA
684
+ AGGGCC
685
+ AGGGCG
686
+ AGGGCT
687
+ AGGGGA
688
+ AGGGGC
689
+ AGGGGG
690
+ AGGGGT
691
+ AGGGTA
692
+ AGGGTC
693
+ AGGGTG
694
+ AGGGTT
695
+ AGGTAA
696
+ AGGTAC
697
+ AGGTAG
698
+ AGGTAT
699
+ AGGTCA
700
+ AGGTCC
701
+ AGGTCG
702
+ AGGTCT
703
+ AGGTGA
704
+ AGGTGC
705
+ AGGTGG
706
+ AGGTGT
707
+ AGGTTA
708
+ AGGTTC
709
+ AGGTTG
710
+ AGGTTT
711
+ AGTAAA
712
+ AGTAAC
713
+ AGTAAG
714
+ AGTAAT
715
+ AGTACA
716
+ AGTACC
717
+ AGTACG
718
+ AGTACT
719
+ AGTAGA
720
+ AGTAGC
721
+ AGTAGG
722
+ AGTAGT
723
+ AGTATA
724
+ AGTATC
725
+ AGTATG
726
+ AGTATT
727
+ AGTCAA
728
+ AGTCAC
729
+ AGTCAG
730
+ AGTCAT
731
+ AGTCCA
732
+ AGTCCC
733
+ AGTCCG
734
+ AGTCCT
735
+ AGTCGA
736
+ AGTCGC
737
+ AGTCGG
738
+ AGTCGT
739
+ AGTCTA
740
+ AGTCTC
741
+ AGTCTG
742
+ AGTCTT
743
+ AGTGAA
744
+ AGTGAC
745
+ AGTGAG
746
+ AGTGAT
747
+ AGTGCA
748
+ AGTGCC
749
+ AGTGCG
750
+ AGTGCT
751
+ AGTGGA
752
+ AGTGGC
753
+ AGTGGG
754
+ AGTGGT
755
+ AGTGTA
756
+ AGTGTC
757
+ AGTGTG
758
+ AGTGTT
759
+ AGTTAA
760
+ AGTTAC
761
+ AGTTAG
762
+ AGTTAT
763
+ AGTTCA
764
+ AGTTCC
765
+ AGTTCG
766
+ AGTTCT
767
+ AGTTGA
768
+ AGTTGC
769
+ AGTTGG
770
+ AGTTGT
771
+ AGTTTA
772
+ AGTTTC
773
+ AGTTTG
774
+ AGTTTT
775
+ ATAAAA
776
+ ATAAAC
777
+ ATAAAG
778
+ ATAAAT
779
+ ATAACA
780
+ ATAACC
781
+ ATAACG
782
+ ATAACT
783
+ ATAAGA
784
+ ATAAGC
785
+ ATAAGG
786
+ ATAAGT
787
+ ATAATA
788
+ ATAATC
789
+ ATAATG
790
+ ATAATT
791
+ ATACAA
792
+ ATACAC
793
+ ATACAG
794
+ ATACAT
795
+ ATACCA
796
+ ATACCC
797
+ ATACCG
798
+ ATACCT
799
+ ATACGA
800
+ ATACGC
801
+ ATACGG
802
+ ATACGT
803
+ ATACTA
804
+ ATACTC
805
+ ATACTG
806
+ ATACTT
807
+ ATAGAA
808
+ ATAGAC
809
+ ATAGAG
810
+ ATAGAT
811
+ ATAGCA
812
+ ATAGCC
813
+ ATAGCG
814
+ ATAGCT
815
+ ATAGGA
816
+ ATAGGC
817
+ ATAGGG
818
+ ATAGGT
819
+ ATAGTA
820
+ ATAGTC
821
+ ATAGTG
822
+ ATAGTT
823
+ ATATAA
824
+ ATATAC
825
+ ATATAG
826
+ ATATAT
827
+ ATATCA
828
+ ATATCC
829
+ ATATCG
830
+ ATATCT
831
+ ATATGA
832
+ ATATGC
833
+ ATATGG
834
+ ATATGT
835
+ ATATTA
836
+ ATATTC
837
+ ATATTG
838
+ ATATTT
839
+ ATCAAA
840
+ ATCAAC
841
+ ATCAAG
842
+ ATCAAT
843
+ ATCACA
844
+ ATCACC
845
+ ATCACG
846
+ ATCACT
847
+ ATCAGA
848
+ ATCAGC
849
+ ATCAGG
850
+ ATCAGT
851
+ ATCATA
852
+ ATCATC
853
+ ATCATG
854
+ ATCATT
855
+ ATCCAA
856
+ ATCCAC
857
+ ATCCAG
858
+ ATCCAT
859
+ ATCCCA
860
+ ATCCCC
861
+ ATCCCG
862
+ ATCCCT
863
+ ATCCGA
864
+ ATCCGC
865
+ ATCCGG
866
+ ATCCGT
867
+ ATCCTA
868
+ ATCCTC
869
+ ATCCTG
870
+ ATCCTT
871
+ ATCGAA
872
+ ATCGAC
873
+ ATCGAG
874
+ ATCGAT
875
+ ATCGCA
876
+ ATCGCC
877
+ ATCGCG
878
+ ATCGCT
879
+ ATCGGA
880
+ ATCGGC
881
+ ATCGGG
882
+ ATCGGT
883
+ ATCGTA
884
+ ATCGTC
885
+ ATCGTG
886
+ ATCGTT
887
+ ATCTAA
888
+ ATCTAC
889
+ ATCTAG
890
+ ATCTAT
891
+ ATCTCA
892
+ ATCTCC
893
+ ATCTCG
894
+ ATCTCT
895
+ ATCTGA
896
+ ATCTGC
897
+ ATCTGG
898
+ ATCTGT
899
+ ATCTTA
900
+ ATCTTC
901
+ ATCTTG
902
+ ATCTTT
903
+ ATGAAA
904
+ ATGAAC
905
+ ATGAAG
906
+ ATGAAT
907
+ ATGACA
908
+ ATGACC
909
+ ATGACG
910
+ ATGACT
911
+ ATGAGA
912
+ ATGAGC
913
+ ATGAGG
914
+ ATGAGT
915
+ ATGATA
916
+ ATGATC
917
+ ATGATG
918
+ ATGATT
919
+ ATGCAA
920
+ ATGCAC
921
+ ATGCAG
922
+ ATGCAT
923
+ ATGCCA
924
+ ATGCCC
925
+ ATGCCG
926
+ ATGCCT
927
+ ATGCGA
928
+ ATGCGC
929
+ ATGCGG
930
+ ATGCGT
931
+ ATGCTA
932
+ ATGCTC
933
+ ATGCTG
934
+ ATGCTT
935
+ ATGGAA
936
+ ATGGAC
937
+ ATGGAG
938
+ ATGGAT
939
+ ATGGCA
940
+ ATGGCC
941
+ ATGGCG
942
+ ATGGCT
943
+ ATGGGA
944
+ ATGGGC
945
+ ATGGGG
946
+ ATGGGT
947
+ ATGGTA
948
+ ATGGTC
949
+ ATGGTG
950
+ ATGGTT
951
+ ATGTAA
952
+ ATGTAC
953
+ ATGTAG
954
+ ATGTAT
955
+ ATGTCA
956
+ ATGTCC
957
+ ATGTCG
958
+ ATGTCT
959
+ ATGTGA
960
+ ATGTGC
961
+ ATGTGG
962
+ ATGTGT
963
+ ATGTTA
964
+ ATGTTC
965
+ ATGTTG
966
+ ATGTTT
967
+ ATTAAA
968
+ ATTAAC
969
+ ATTAAG
970
+ ATTAAT
971
+ ATTACA
972
+ ATTACC
973
+ ATTACG
974
+ ATTACT
975
+ ATTAGA
976
+ ATTAGC
977
+ ATTAGG
978
+ ATTAGT
979
+ ATTATA
980
+ ATTATC
981
+ ATTATG
982
+ ATTATT
983
+ ATTCAA
984
+ ATTCAC
985
+ ATTCAG
986
+ ATTCAT
987
+ ATTCCA
988
+ ATTCCC
989
+ ATTCCG
990
+ ATTCCT
991
+ ATTCGA
992
+ ATTCGC
993
+ ATTCGG
994
+ ATTCGT
995
+ ATTCTA
996
+ ATTCTC
997
+ ATTCTG
998
+ ATTCTT
999
+ ATTGAA
1000
+ ATTGAC
1001
+ ATTGAG
1002
+ ATTGAT
1003
+ ATTGCA
1004
+ ATTGCC
1005
+ ATTGCG
1006
+ ATTGCT
1007
+ ATTGGA
1008
+ ATTGGC
1009
+ ATTGGG
1010
+ ATTGGT
1011
+ ATTGTA
1012
+ ATTGTC
1013
+ ATTGTG
1014
+ ATTGTT
1015
+ ATTTAA
1016
+ ATTTAC
1017
+ ATTTAG
1018
+ ATTTAT
1019
+ ATTTCA
1020
+ ATTTCC
1021
+ ATTTCG
1022
+ ATTTCT
1023
+ ATTTGA
1024
+ ATTTGC
1025
+ ATTTGG
1026
+ ATTTGT
1027
+ ATTTTA
1028
+ ATTTTC
1029
+ ATTTTG
1030
+ ATTTTT
1031
+ CAAAAA
1032
+ CAAAAC
1033
+ CAAAAG
1034
+ CAAAAT
1035
+ CAAACA
1036
+ CAAACC
1037
+ CAAACG
1038
+ CAAACT
1039
+ CAAAGA
1040
+ CAAAGC
1041
+ CAAAGG
1042
+ CAAAGT
1043
+ CAAATA
1044
+ CAAATC
1045
+ CAAATG
1046
+ CAAATT
1047
+ CAACAA
1048
+ CAACAC
1049
+ CAACAG
1050
+ CAACAT
1051
+ CAACCA
1052
+ CAACCC
1053
+ CAACCG
1054
+ CAACCT
1055
+ CAACGA
1056
+ CAACGC
1057
+ CAACGG
1058
+ CAACGT
1059
+ CAACTA
1060
+ CAACTC
1061
+ CAACTG
1062
+ CAACTT
1063
+ CAAGAA
1064
+ CAAGAC
1065
+ CAAGAG
1066
+ CAAGAT
1067
+ CAAGCA
1068
+ CAAGCC
1069
+ CAAGCG
1070
+ CAAGCT
1071
+ CAAGGA
1072
+ CAAGGC
1073
+ CAAGGG
1074
+ CAAGGT
1075
+ CAAGTA
1076
+ CAAGTC
1077
+ CAAGTG
1078
+ CAAGTT
1079
+ CAATAA
1080
+ CAATAC
1081
+ CAATAG
1082
+ CAATAT
1083
+ CAATCA
1084
+ CAATCC
1085
+ CAATCG
1086
+ CAATCT
1087
+ CAATGA
1088
+ CAATGC
1089
+ CAATGG
1090
+ CAATGT
1091
+ CAATTA
1092
+ CAATTC
1093
+ CAATTG
1094
+ CAATTT
1095
+ CACAAA
1096
+ CACAAC
1097
+ CACAAG
1098
+ CACAAT
1099
+ CACACA
1100
+ CACACC
1101
+ CACACG
1102
+ CACACT
1103
+ CACAGA
1104
+ CACAGC
1105
+ CACAGG
1106
+ CACAGT
1107
+ CACATA
1108
+ CACATC
1109
+ CACATG
1110
+ CACATT
1111
+ CACCAA
1112
+ CACCAC
1113
+ CACCAG
1114
+ CACCAT
1115
+ CACCCA
1116
+ CACCCC
1117
+ CACCCG
1118
+ CACCCT
1119
+ CACCGA
1120
+ CACCGC
1121
+ CACCGG
1122
+ CACCGT
1123
+ CACCTA
1124
+ CACCTC
1125
+ CACCTG
1126
+ CACCTT
1127
+ CACGAA
1128
+ CACGAC
1129
+ CACGAG
1130
+ CACGAT
1131
+ CACGCA
1132
+ CACGCC
1133
+ CACGCG
1134
+ CACGCT
1135
+ CACGGA
1136
+ CACGGC
1137
+ CACGGG
1138
+ CACGGT
1139
+ CACGTA
1140
+ CACGTC
1141
+ CACGTG
1142
+ CACGTT
1143
+ CACTAA
1144
+ CACTAC
1145
+ CACTAG
1146
+ CACTAT
1147
+ CACTCA
1148
+ CACTCC
1149
+ CACTCG
1150
+ CACTCT
1151
+ CACTGA
1152
+ CACTGC
1153
+ CACTGG
1154
+ CACTGT
1155
+ CACTTA
1156
+ CACTTC
1157
+ CACTTG
1158
+ CACTTT
1159
+ CAGAAA
1160
+ CAGAAC
1161
+ CAGAAG
1162
+ CAGAAT
1163
+ CAGACA
1164
+ CAGACC
1165
+ CAGACG
1166
+ CAGACT
1167
+ CAGAGA
1168
+ CAGAGC
1169
+ CAGAGG
1170
+ CAGAGT
1171
+ CAGATA
1172
+ CAGATC
1173
+ CAGATG
1174
+ CAGATT
1175
+ CAGCAA
1176
+ CAGCAC
1177
+ CAGCAG
1178
+ CAGCAT
1179
+ CAGCCA
1180
+ CAGCCC
1181
+ CAGCCG
1182
+ CAGCCT
1183
+ CAGCGA
1184
+ CAGCGC
1185
+ CAGCGG
1186
+ CAGCGT
1187
+ CAGCTA
1188
+ CAGCTC
1189
+ CAGCTG
1190
+ CAGCTT
1191
+ CAGGAA
1192
+ CAGGAC
1193
+ CAGGAG
1194
+ CAGGAT
1195
+ CAGGCA
1196
+ CAGGCC
1197
+ CAGGCG
1198
+ CAGGCT
1199
+ CAGGGA
1200
+ CAGGGC
1201
+ CAGGGG
1202
+ CAGGGT
1203
+ CAGGTA
1204
+ CAGGTC
1205
+ CAGGTG
1206
+ CAGGTT
1207
+ CAGTAA
1208
+ CAGTAC
1209
+ CAGTAG
1210
+ CAGTAT
1211
+ CAGTCA
1212
+ CAGTCC
1213
+ CAGTCG
1214
+ CAGTCT
1215
+ CAGTGA
1216
+ CAGTGC
1217
+ CAGTGG
1218
+ CAGTGT
1219
+ CAGTTA
1220
+ CAGTTC
1221
+ CAGTTG
1222
+ CAGTTT
1223
+ CATAAA
1224
+ CATAAC
1225
+ CATAAG
1226
+ CATAAT
1227
+ CATACA
1228
+ CATACC
1229
+ CATACG
1230
+ CATACT
1231
+ CATAGA
1232
+ CATAGC
1233
+ CATAGG
1234
+ CATAGT
1235
+ CATATA
1236
+ CATATC
1237
+ CATATG
1238
+ CATATT
1239
+ CATCAA
1240
+ CATCAC
1241
+ CATCAG
1242
+ CATCAT
1243
+ CATCCA
1244
+ CATCCC
1245
+ CATCCG
1246
+ CATCCT
1247
+ CATCGA
1248
+ CATCGC
1249
+ CATCGG
1250
+ CATCGT
1251
+ CATCTA
1252
+ CATCTC
1253
+ CATCTG
1254
+ CATCTT
1255
+ CATGAA
1256
+ CATGAC
1257
+ CATGAG
1258
+ CATGAT
1259
+ CATGCA
1260
+ CATGCC
1261
+ CATGCG
1262
+ CATGCT
1263
+ CATGGA
1264
+ CATGGC
1265
+ CATGGG
1266
+ CATGGT
1267
+ CATGTA
1268
+ CATGTC
1269
+ CATGTG
1270
+ CATGTT
1271
+ CATTAA
1272
+ CATTAC
1273
+ CATTAG
1274
+ CATTAT
1275
+ CATTCA
1276
+ CATTCC
1277
+ CATTCG
1278
+ CATTCT
1279
+ CATTGA
1280
+ CATTGC
1281
+ CATTGG
1282
+ CATTGT
1283
+ CATTTA
1284
+ CATTTC
1285
+ CATTTG
1286
+ CATTTT
1287
+ CCAAAA
1288
+ CCAAAC
1289
+ CCAAAG
1290
+ CCAAAT
1291
+ CCAACA
1292
+ CCAACC
1293
+ CCAACG
1294
+ CCAACT
1295
+ CCAAGA
1296
+ CCAAGC
1297
+ CCAAGG
1298
+ CCAAGT
1299
+ CCAATA
1300
+ CCAATC
1301
+ CCAATG
1302
+ CCAATT
1303
+ CCACAA
1304
+ CCACAC
1305
+ CCACAG
1306
+ CCACAT
1307
+ CCACCA
1308
+ CCACCC
1309
+ CCACCG
1310
+ CCACCT
1311
+ CCACGA
1312
+ CCACGC
1313
+ CCACGG
1314
+ CCACGT
1315
+ CCACTA
1316
+ CCACTC
1317
+ CCACTG
1318
+ CCACTT
1319
+ CCAGAA
1320
+ CCAGAC
1321
+ CCAGAG
1322
+ CCAGAT
1323
+ CCAGCA
1324
+ CCAGCC
1325
+ CCAGCG
1326
+ CCAGCT
1327
+ CCAGGA
1328
+ CCAGGC
1329
+ CCAGGG
1330
+ CCAGGT
1331
+ CCAGTA
1332
+ CCAGTC
1333
+ CCAGTG
1334
+ CCAGTT
1335
+ CCATAA
1336
+ CCATAC
1337
+ CCATAG
1338
+ CCATAT
1339
+ CCATCA
1340
+ CCATCC
1341
+ CCATCG
1342
+ CCATCT
1343
+ CCATGA
1344
+ CCATGC
1345
+ CCATGG
1346
+ CCATGT
1347
+ CCATTA
1348
+ CCATTC
1349
+ CCATTG
1350
+ CCATTT
1351
+ CCCAAA
1352
+ CCCAAC
1353
+ CCCAAG
1354
+ CCCAAT
1355
+ CCCACA
1356
+ CCCACC
1357
+ CCCACG
1358
+ CCCACT
1359
+ CCCAGA
1360
+ CCCAGC
1361
+ CCCAGG
1362
+ CCCAGT
1363
+ CCCATA
1364
+ CCCATC
1365
+ CCCATG
1366
+ CCCATT
1367
+ CCCCAA
1368
+ CCCCAC
1369
+ CCCCAG
1370
+ CCCCAT
1371
+ CCCCCA
1372
+ CCCCCC
1373
+ CCCCCG
1374
+ CCCCCT
1375
+ CCCCGA
1376
+ CCCCGC
1377
+ CCCCGG
1378
+ CCCCGT
1379
+ CCCCTA
1380
+ CCCCTC
1381
+ CCCCTG
1382
+ CCCCTT
1383
+ CCCGAA
1384
+ CCCGAC
1385
+ CCCGAG
1386
+ CCCGAT
1387
+ CCCGCA
1388
+ CCCGCC
1389
+ CCCGCG
1390
+ CCCGCT
1391
+ CCCGGA
1392
+ CCCGGC
1393
+ CCCGGG
1394
+ CCCGGT
1395
+ CCCGTA
1396
+ CCCGTC
1397
+ CCCGTG
1398
+ CCCGTT
1399
+ CCCTAA
1400
+ CCCTAC
1401
+ CCCTAG
1402
+ CCCTAT
1403
+ CCCTCA
1404
+ CCCTCC
1405
+ CCCTCG
1406
+ CCCTCT
1407
+ CCCTGA
1408
+ CCCTGC
1409
+ CCCTGG
1410
+ CCCTGT
1411
+ CCCTTA
1412
+ CCCTTC
1413
+ CCCTTG
1414
+ CCCTTT
1415
+ CCGAAA
1416
+ CCGAAC
1417
+ CCGAAG
1418
+ CCGAAT
1419
+ CCGACA
1420
+ CCGACC
1421
+ CCGACG
1422
+ CCGACT
1423
+ CCGAGA
1424
+ CCGAGC
1425
+ CCGAGG
1426
+ CCGAGT
1427
+ CCGATA
1428
+ CCGATC
1429
+ CCGATG
1430
+ CCGATT
1431
+ CCGCAA
1432
+ CCGCAC
1433
+ CCGCAG
1434
+ CCGCAT
1435
+ CCGCCA
1436
+ CCGCCC
1437
+ CCGCCG
1438
+ CCGCCT
1439
+ CCGCGA
1440
+ CCGCGC
1441
+ CCGCGG
1442
+ CCGCGT
1443
+ CCGCTA
1444
+ CCGCTC
1445
+ CCGCTG
1446
+ CCGCTT
1447
+ CCGGAA
1448
+ CCGGAC
1449
+ CCGGAG
1450
+ CCGGAT
1451
+ CCGGCA
1452
+ CCGGCC
1453
+ CCGGCG
1454
+ CCGGCT
1455
+ CCGGGA
1456
+ CCGGGC
1457
+ CCGGGG
1458
+ CCGGGT
1459
+ CCGGTA
1460
+ CCGGTC
1461
+ CCGGTG
1462
+ CCGGTT
1463
+ CCGTAA
1464
+ CCGTAC
1465
+ CCGTAG
1466
+ CCGTAT
1467
+ CCGTCA
1468
+ CCGTCC
1469
+ CCGTCG
1470
+ CCGTCT
1471
+ CCGTGA
1472
+ CCGTGC
1473
+ CCGTGG
1474
+ CCGTGT
1475
+ CCGTTA
1476
+ CCGTTC
1477
+ CCGTTG
1478
+ CCGTTT
1479
+ CCTAAA
1480
+ CCTAAC
1481
+ CCTAAG
1482
+ CCTAAT
1483
+ CCTACA
1484
+ CCTACC
1485
+ CCTACG
1486
+ CCTACT
1487
+ CCTAGA
1488
+ CCTAGC
1489
+ CCTAGG
1490
+ CCTAGT
1491
+ CCTATA
1492
+ CCTATC
1493
+ CCTATG
1494
+ CCTATT
1495
+ CCTCAA
1496
+ CCTCAC
1497
+ CCTCAG
1498
+ CCTCAT
1499
+ CCTCCA
1500
+ CCTCCC
1501
+ CCTCCG
1502
+ CCTCCT
1503
+ CCTCGA
1504
+ CCTCGC
1505
+ CCTCGG
1506
+ CCTCGT
1507
+ CCTCTA
1508
+ CCTCTC
1509
+ CCTCTG
1510
+ CCTCTT
1511
+ CCTGAA
1512
+ CCTGAC
1513
+ CCTGAG
1514
+ CCTGAT
1515
+ CCTGCA
1516
+ CCTGCC
1517
+ CCTGCG
1518
+ CCTGCT
1519
+ CCTGGA
1520
+ CCTGGC
1521
+ CCTGGG
1522
+ CCTGGT
1523
+ CCTGTA
1524
+ CCTGTC
1525
+ CCTGTG
1526
+ CCTGTT
1527
+ CCTTAA
1528
+ CCTTAC
1529
+ CCTTAG
1530
+ CCTTAT
1531
+ CCTTCA
1532
+ CCTTCC
1533
+ CCTTCG
1534
+ CCTTCT
1535
+ CCTTGA
1536
+ CCTTGC
1537
+ CCTTGG
1538
+ CCTTGT
1539
+ CCTTTA
1540
+ CCTTTC
1541
+ CCTTTG
1542
+ CCTTTT
1543
+ CGAAAA
1544
+ CGAAAC
1545
+ CGAAAG
1546
+ CGAAAT
1547
+ CGAACA
1548
+ CGAACC
1549
+ CGAACG
1550
+ CGAACT
1551
+ CGAAGA
1552
+ CGAAGC
1553
+ CGAAGG
1554
+ CGAAGT
1555
+ CGAATA
1556
+ CGAATC
1557
+ CGAATG
1558
+ CGAATT
1559
+ CGACAA
1560
+ CGACAC
1561
+ CGACAG
1562
+ CGACAT
1563
+ CGACCA
1564
+ CGACCC
1565
+ CGACCG
1566
+ CGACCT
1567
+ CGACGA
1568
+ CGACGC
1569
+ CGACGG
1570
+ CGACGT
1571
+ CGACTA
1572
+ CGACTC
1573
+ CGACTG
1574
+ CGACTT
1575
+ CGAGAA
1576
+ CGAGAC
1577
+ CGAGAG
1578
+ CGAGAT
1579
+ CGAGCA
1580
+ CGAGCC
1581
+ CGAGCG
1582
+ CGAGCT
1583
+ CGAGGA
1584
+ CGAGGC
1585
+ CGAGGG
1586
+ CGAGGT
1587
+ CGAGTA
1588
+ CGAGTC
1589
+ CGAGTG
1590
+ CGAGTT
1591
+ CGATAA
1592
+ CGATAC
1593
+ CGATAG
1594
+ CGATAT
1595
+ CGATCA
1596
+ CGATCC
1597
+ CGATCG
1598
+ CGATCT
1599
+ CGATGA
1600
+ CGATGC
1601
+ CGATGG
1602
+ CGATGT
1603
+ CGATTA
1604
+ CGATTC
1605
+ CGATTG
1606
+ CGATTT
1607
+ CGCAAA
1608
+ CGCAAC
1609
+ CGCAAG
1610
+ CGCAAT
1611
+ CGCACA
1612
+ CGCACC
1613
+ CGCACG
1614
+ CGCACT
1615
+ CGCAGA
1616
+ CGCAGC
1617
+ CGCAGG
1618
+ CGCAGT
1619
+ CGCATA
1620
+ CGCATC
1621
+ CGCATG
1622
+ CGCATT
1623
+ CGCCAA
1624
+ CGCCAC
1625
+ CGCCAG
1626
+ CGCCAT
1627
+ CGCCCA
1628
+ CGCCCC
1629
+ CGCCCG
1630
+ CGCCCT
1631
+ CGCCGA
1632
+ CGCCGC
1633
+ CGCCGG
1634
+ CGCCGT
1635
+ CGCCTA
1636
+ CGCCTC
1637
+ CGCCTG
1638
+ CGCCTT
1639
+ CGCGAA
1640
+ CGCGAC
1641
+ CGCGAG
1642
+ CGCGAT
1643
+ CGCGCA
1644
+ CGCGCC
1645
+ CGCGCG
1646
+ CGCGCT
1647
+ CGCGGA
1648
+ CGCGGC
1649
+ CGCGGG
1650
+ CGCGGT
1651
+ CGCGTA
1652
+ CGCGTC
1653
+ CGCGTG
1654
+ CGCGTT
1655
+ CGCTAA
1656
+ CGCTAC
1657
+ CGCTAG
1658
+ CGCTAT
1659
+ CGCTCA
1660
+ CGCTCC
1661
+ CGCTCG
1662
+ CGCTCT
1663
+ CGCTGA
1664
+ CGCTGC
1665
+ CGCTGG
1666
+ CGCTGT
1667
+ CGCTTA
1668
+ CGCTTC
1669
+ CGCTTG
1670
+ CGCTTT
1671
+ CGGAAA
1672
+ CGGAAC
1673
+ CGGAAG
1674
+ CGGAAT
1675
+ CGGACA
1676
+ CGGACC
1677
+ CGGACG
1678
+ CGGACT
1679
+ CGGAGA
1680
+ CGGAGC
1681
+ CGGAGG
1682
+ CGGAGT
1683
+ CGGATA
1684
+ CGGATC
1685
+ CGGATG
1686
+ CGGATT
1687
+ CGGCAA
1688
+ CGGCAC
1689
+ CGGCAG
1690
+ CGGCAT
1691
+ CGGCCA
1692
+ CGGCCC
1693
+ CGGCCG
1694
+ CGGCCT
1695
+ CGGCGA
1696
+ CGGCGC
1697
+ CGGCGG
1698
+ CGGCGT
1699
+ CGGCTA
1700
+ CGGCTC
1701
+ CGGCTG
1702
+ CGGCTT
1703
+ CGGGAA
1704
+ CGGGAC
1705
+ CGGGAG
1706
+ CGGGAT
1707
+ CGGGCA
1708
+ CGGGCC
1709
+ CGGGCG
1710
+ CGGGCT
1711
+ CGGGGA
1712
+ CGGGGC
1713
+ CGGGGG
1714
+ CGGGGT
1715
+ CGGGTA
1716
+ CGGGTC
1717
+ CGGGTG
1718
+ CGGGTT
1719
+ CGGTAA
1720
+ CGGTAC
1721
+ CGGTAG
1722
+ CGGTAT
1723
+ CGGTCA
1724
+ CGGTCC
1725
+ CGGTCG
1726
+ CGGTCT
1727
+ CGGTGA
1728
+ CGGTGC
1729
+ CGGTGG
1730
+ CGGTGT
1731
+ CGGTTA
1732
+ CGGTTC
1733
+ CGGTTG
1734
+ CGGTTT
1735
+ CGTAAA
1736
+ CGTAAC
1737
+ CGTAAG
1738
+ CGTAAT
1739
+ CGTACA
1740
+ CGTACC
1741
+ CGTACG
1742
+ CGTACT
1743
+ CGTAGA
1744
+ CGTAGC
1745
+ CGTAGG
1746
+ CGTAGT
1747
+ CGTATA
1748
+ CGTATC
1749
+ CGTATG
1750
+ CGTATT
1751
+ CGTCAA
1752
+ CGTCAC
1753
+ CGTCAG
1754
+ CGTCAT
1755
+ CGTCCA
1756
+ CGTCCC
1757
+ CGTCCG
1758
+ CGTCCT
1759
+ CGTCGA
1760
+ CGTCGC
1761
+ CGTCGG
1762
+ CGTCGT
1763
+ CGTCTA
1764
+ CGTCTC
1765
+ CGTCTG
1766
+ CGTCTT
1767
+ CGTGAA
1768
+ CGTGAC
1769
+ CGTGAG
1770
+ CGTGAT
1771
+ CGTGCA
1772
+ CGTGCC
1773
+ CGTGCG
1774
+ CGTGCT
1775
+ CGTGGA
1776
+ CGTGGC
1777
+ CGTGGG
1778
+ CGTGGT
1779
+ CGTGTA
1780
+ CGTGTC
1781
+ CGTGTG
1782
+ CGTGTT
1783
+ CGTTAA
1784
+ CGTTAC
1785
+ CGTTAG
1786
+ CGTTAT
1787
+ CGTTCA
1788
+ CGTTCC
1789
+ CGTTCG
1790
+ CGTTCT
1791
+ CGTTGA
1792
+ CGTTGC
1793
+ CGTTGG
1794
+ CGTTGT
1795
+ CGTTTA
1796
+ CGTTTC
1797
+ CGTTTG
1798
+ CGTTTT
1799
+ CTAAAA
1800
+ CTAAAC
1801
+ CTAAAG
1802
+ CTAAAT
1803
+ CTAACA
1804
+ CTAACC
1805
+ CTAACG
1806
+ CTAACT
1807
+ CTAAGA
1808
+ CTAAGC
1809
+ CTAAGG
1810
+ CTAAGT
1811
+ CTAATA
1812
+ CTAATC
1813
+ CTAATG
1814
+ CTAATT
1815
+ CTACAA
1816
+ CTACAC
1817
+ CTACAG
1818
+ CTACAT
1819
+ CTACCA
1820
+ CTACCC
1821
+ CTACCG
1822
+ CTACCT
1823
+ CTACGA
1824
+ CTACGC
1825
+ CTACGG
1826
+ CTACGT
1827
+ CTACTA
1828
+ CTACTC
1829
+ CTACTG
1830
+ CTACTT
1831
+ CTAGAA
1832
+ CTAGAC
1833
+ CTAGAG
1834
+ CTAGAT
1835
+ CTAGCA
1836
+ CTAGCC
1837
+ CTAGCG
1838
+ CTAGCT
1839
+ CTAGGA
1840
+ CTAGGC
1841
+ CTAGGG
1842
+ CTAGGT
1843
+ CTAGTA
1844
+ CTAGTC
1845
+ CTAGTG
1846
+ CTAGTT
1847
+ CTATAA
1848
+ CTATAC
1849
+ CTATAG
1850
+ CTATAT
1851
+ CTATCA
1852
+ CTATCC
1853
+ CTATCG
1854
+ CTATCT
1855
+ CTATGA
1856
+ CTATGC
1857
+ CTATGG
1858
+ CTATGT
1859
+ CTATTA
1860
+ CTATTC
1861
+ CTATTG
1862
+ CTATTT
1863
+ CTCAAA
1864
+ CTCAAC
1865
+ CTCAAG
1866
+ CTCAAT
1867
+ CTCACA
1868
+ CTCACC
1869
+ CTCACG
1870
+ CTCACT
1871
+ CTCAGA
1872
+ CTCAGC
1873
+ CTCAGG
1874
+ CTCAGT
1875
+ CTCATA
1876
+ CTCATC
1877
+ CTCATG
1878
+ CTCATT
1879
+ CTCCAA
1880
+ CTCCAC
1881
+ CTCCAG
1882
+ CTCCAT
1883
+ CTCCCA
1884
+ CTCCCC
1885
+ CTCCCG
1886
+ CTCCCT
1887
+ CTCCGA
1888
+ CTCCGC
1889
+ CTCCGG
1890
+ CTCCGT
1891
+ CTCCTA
1892
+ CTCCTC
1893
+ CTCCTG
1894
+ CTCCTT
1895
+ CTCGAA
1896
+ CTCGAC
1897
+ CTCGAG
1898
+ CTCGAT
1899
+ CTCGCA
1900
+ CTCGCC
1901
+ CTCGCG
1902
+ CTCGCT
1903
+ CTCGGA
1904
+ CTCGGC
1905
+ CTCGGG
1906
+ CTCGGT
1907
+ CTCGTA
1908
+ CTCGTC
1909
+ CTCGTG
1910
+ CTCGTT
1911
+ CTCTAA
1912
+ CTCTAC
1913
+ CTCTAG
1914
+ CTCTAT
1915
+ CTCTCA
1916
+ CTCTCC
1917
+ CTCTCG
1918
+ CTCTCT
1919
+ CTCTGA
1920
+ CTCTGC
1921
+ CTCTGG
1922
+ CTCTGT
1923
+ CTCTTA
1924
+ CTCTTC
1925
+ CTCTTG
1926
+ CTCTTT
1927
+ CTGAAA
1928
+ CTGAAC
1929
+ CTGAAG
1930
+ CTGAAT
1931
+ CTGACA
1932
+ CTGACC
1933
+ CTGACG
1934
+ CTGACT
1935
+ CTGAGA
1936
+ CTGAGC
1937
+ CTGAGG
1938
+ CTGAGT
1939
+ CTGATA
1940
+ CTGATC
1941
+ CTGATG
1942
+ CTGATT
1943
+ CTGCAA
1944
+ CTGCAC
1945
+ CTGCAG
1946
+ CTGCAT
1947
+ CTGCCA
1948
+ CTGCCC
1949
+ CTGCCG
1950
+ CTGCCT
1951
+ CTGCGA
1952
+ CTGCGC
1953
+ CTGCGG
1954
+ CTGCGT
1955
+ CTGCTA
1956
+ CTGCTC
1957
+ CTGCTG
1958
+ CTGCTT
1959
+ CTGGAA
1960
+ CTGGAC
1961
+ CTGGAG
1962
+ CTGGAT
1963
+ CTGGCA
1964
+ CTGGCC
1965
+ CTGGCG
1966
+ CTGGCT
1967
+ CTGGGA
1968
+ CTGGGC
1969
+ CTGGGG
1970
+ CTGGGT
1971
+ CTGGTA
1972
+ CTGGTC
1973
+ CTGGTG
1974
+ CTGGTT
1975
+ CTGTAA
1976
+ CTGTAC
1977
+ CTGTAG
1978
+ CTGTAT
1979
+ CTGTCA
1980
+ CTGTCC
1981
+ CTGTCG
1982
+ CTGTCT
1983
+ CTGTGA
1984
+ CTGTGC
1985
+ CTGTGG
1986
+ CTGTGT
1987
+ CTGTTA
1988
+ CTGTTC
1989
+ CTGTTG
1990
+ CTGTTT
1991
+ CTTAAA
1992
+ CTTAAC
1993
+ CTTAAG
1994
+ CTTAAT
1995
+ CTTACA
1996
+ CTTACC
1997
+ CTTACG
1998
+ CTTACT
1999
+ CTTAGA
2000
+ CTTAGC
2001
+ CTTAGG
2002
+ CTTAGT
2003
+ CTTATA
2004
+ CTTATC
2005
+ CTTATG
2006
+ CTTATT
2007
+ CTTCAA
2008
+ CTTCAC
2009
+ CTTCAG
2010
+ CTTCAT
2011
+ CTTCCA
2012
+ CTTCCC
2013
+ CTTCCG
2014
+ CTTCCT
2015
+ CTTCGA
2016
+ CTTCGC
2017
+ CTTCGG
2018
+ CTTCGT
2019
+ CTTCTA
2020
+ CTTCTC
2021
+ CTTCTG
2022
+ CTTCTT
2023
+ CTTGAA
2024
+ CTTGAC
2025
+ CTTGAG
2026
+ CTTGAT
2027
+ CTTGCA
2028
+ CTTGCC
2029
+ CTTGCG
2030
+ CTTGCT
2031
+ CTTGGA
2032
+ CTTGGC
2033
+ CTTGGG
2034
+ CTTGGT
2035
+ CTTGTA
2036
+ CTTGTC
2037
+ CTTGTG
2038
+ CTTGTT
2039
+ CTTTAA
2040
+ CTTTAC
2041
+ CTTTAG
2042
+ CTTTAT
2043
+ CTTTCA
2044
+ CTTTCC
2045
+ CTTTCG
2046
+ CTTTCT
2047
+ CTTTGA
2048
+ CTTTGC
2049
+ CTTTGG
2050
+ CTTTGT
2051
+ CTTTTA
2052
+ CTTTTC
2053
+ CTTTTG
2054
+ CTTTTT
2055
+ GAAAAA
2056
+ GAAAAC
2057
+ GAAAAG
2058
+ GAAAAT
2059
+ GAAACA
2060
+ GAAACC
2061
+ GAAACG
2062
+ GAAACT
2063
+ GAAAGA
2064
+ GAAAGC
2065
+ GAAAGG
2066
+ GAAAGT
2067
+ GAAATA
2068
+ GAAATC
2069
+ GAAATG
2070
+ GAAATT
2071
+ GAACAA
2072
+ GAACAC
2073
+ GAACAG
2074
+ GAACAT
2075
+ GAACCA
2076
+ GAACCC
2077
+ GAACCG
2078
+ GAACCT
2079
+ GAACGA
2080
+ GAACGC
2081
+ GAACGG
2082
+ GAACGT
2083
+ GAACTA
2084
+ GAACTC
2085
+ GAACTG
2086
+ GAACTT
2087
+ GAAGAA
2088
+ GAAGAC
2089
+ GAAGAG
2090
+ GAAGAT
2091
+ GAAGCA
2092
+ GAAGCC
2093
+ GAAGCG
2094
+ GAAGCT
2095
+ GAAGGA
2096
+ GAAGGC
2097
+ GAAGGG
2098
+ GAAGGT
2099
+ GAAGTA
2100
+ GAAGTC
2101
+ GAAGTG
2102
+ GAAGTT
2103
+ GAATAA
2104
+ GAATAC
2105
+ GAATAG
2106
+ GAATAT
2107
+ GAATCA
2108
+ GAATCC
2109
+ GAATCG
2110
+ GAATCT
2111
+ GAATGA
2112
+ GAATGC
2113
+ GAATGG
2114
+ GAATGT
2115
+ GAATTA
2116
+ GAATTC
2117
+ GAATTG
2118
+ GAATTT
2119
+ GACAAA
2120
+ GACAAC
2121
+ GACAAG
2122
+ GACAAT
2123
+ GACACA
2124
+ GACACC
2125
+ GACACG
2126
+ GACACT
2127
+ GACAGA
2128
+ GACAGC
2129
+ GACAGG
2130
+ GACAGT
2131
+ GACATA
2132
+ GACATC
2133
+ GACATG
2134
+ GACATT
2135
+ GACCAA
2136
+ GACCAC
2137
+ GACCAG
2138
+ GACCAT
2139
+ GACCCA
2140
+ GACCCC
2141
+ GACCCG
2142
+ GACCCT
2143
+ GACCGA
2144
+ GACCGC
2145
+ GACCGG
2146
+ GACCGT
2147
+ GACCTA
2148
+ GACCTC
2149
+ GACCTG
2150
+ GACCTT
2151
+ GACGAA
2152
+ GACGAC
2153
+ GACGAG
2154
+ GACGAT
2155
+ GACGCA
2156
+ GACGCC
2157
+ GACGCG
2158
+ GACGCT
2159
+ GACGGA
2160
+ GACGGC
2161
+ GACGGG
2162
+ GACGGT
2163
+ GACGTA
2164
+ GACGTC
2165
+ GACGTG
2166
+ GACGTT
2167
+ GACTAA
2168
+ GACTAC
2169
+ GACTAG
2170
+ GACTAT
2171
+ GACTCA
2172
+ GACTCC
2173
+ GACTCG
2174
+ GACTCT
2175
+ GACTGA
2176
+ GACTGC
2177
+ GACTGG
2178
+ GACTGT
2179
+ GACTTA
2180
+ GACTTC
2181
+ GACTTG
2182
+ GACTTT
2183
+ GAGAAA
2184
+ GAGAAC
2185
+ GAGAAG
2186
+ GAGAAT
2187
+ GAGACA
2188
+ GAGACC
2189
+ GAGACG
2190
+ GAGACT
2191
+ GAGAGA
2192
+ GAGAGC
2193
+ GAGAGG
2194
+ GAGAGT
2195
+ GAGATA
2196
+ GAGATC
2197
+ GAGATG
2198
+ GAGATT
2199
+ GAGCAA
2200
+ GAGCAC
2201
+ GAGCAG
2202
+ GAGCAT
2203
+ GAGCCA
2204
+ GAGCCC
2205
+ GAGCCG
2206
+ GAGCCT
2207
+ GAGCGA
2208
+ GAGCGC
2209
+ GAGCGG
2210
+ GAGCGT
2211
+ GAGCTA
2212
+ GAGCTC
2213
+ GAGCTG
2214
+ GAGCTT
2215
+ GAGGAA
2216
+ GAGGAC
2217
+ GAGGAG
2218
+ GAGGAT
2219
+ GAGGCA
2220
+ GAGGCC
2221
+ GAGGCG
2222
+ GAGGCT
2223
+ GAGGGA
2224
+ GAGGGC
2225
+ GAGGGG
2226
+ GAGGGT
2227
+ GAGGTA
2228
+ GAGGTC
2229
+ GAGGTG
2230
+ GAGGTT
2231
+ GAGTAA
2232
+ GAGTAC
2233
+ GAGTAG
2234
+ GAGTAT
2235
+ GAGTCA
2236
+ GAGTCC
2237
+ GAGTCG
2238
+ GAGTCT
2239
+ GAGTGA
2240
+ GAGTGC
2241
+ GAGTGG
2242
+ GAGTGT
2243
+ GAGTTA
2244
+ GAGTTC
2245
+ GAGTTG
2246
+ GAGTTT
2247
+ GATAAA
2248
+ GATAAC
2249
+ GATAAG
2250
+ GATAAT
2251
+ GATACA
2252
+ GATACC
2253
+ GATACG
2254
+ GATACT
2255
+ GATAGA
2256
+ GATAGC
2257
+ GATAGG
2258
+ GATAGT
2259
+ GATATA
2260
+ GATATC
2261
+ GATATG
2262
+ GATATT
2263
+ GATCAA
2264
+ GATCAC
2265
+ GATCAG
2266
+ GATCAT
2267
+ GATCCA
2268
+ GATCCC
2269
+ GATCCG
2270
+ GATCCT
2271
+ GATCGA
2272
+ GATCGC
2273
+ GATCGG
2274
+ GATCGT
2275
+ GATCTA
2276
+ GATCTC
2277
+ GATCTG
2278
+ GATCTT
2279
+ GATGAA
2280
+ GATGAC
2281
+ GATGAG
2282
+ GATGAT
2283
+ GATGCA
2284
+ GATGCC
2285
+ GATGCG
2286
+ GATGCT
2287
+ GATGGA
2288
+ GATGGC
2289
+ GATGGG
2290
+ GATGGT
2291
+ GATGTA
2292
+ GATGTC
2293
+ GATGTG
2294
+ GATGTT
2295
+ GATTAA
2296
+ GATTAC
2297
+ GATTAG
2298
+ GATTAT
2299
+ GATTCA
2300
+ GATTCC
2301
+ GATTCG
2302
+ GATTCT
2303
+ GATTGA
2304
+ GATTGC
2305
+ GATTGG
2306
+ GATTGT
2307
+ GATTTA
2308
+ GATTTC
2309
+ GATTTG
2310
+ GATTTT
2311
+ GCAAAA
2312
+ GCAAAC
2313
+ GCAAAG
2314
+ GCAAAT
2315
+ GCAACA
2316
+ GCAACC
2317
+ GCAACG
2318
+ GCAACT
2319
+ GCAAGA
2320
+ GCAAGC
2321
+ GCAAGG
2322
+ GCAAGT
2323
+ GCAATA
2324
+ GCAATC
2325
+ GCAATG
2326
+ GCAATT
2327
+ GCACAA
2328
+ GCACAC
2329
+ GCACAG
2330
+ GCACAT
2331
+ GCACCA
2332
+ GCACCC
2333
+ GCACCG
2334
+ GCACCT
2335
+ GCACGA
2336
+ GCACGC
2337
+ GCACGG
2338
+ GCACGT
2339
+ GCACTA
2340
+ GCACTC
2341
+ GCACTG
2342
+ GCACTT
2343
+ GCAGAA
2344
+ GCAGAC
2345
+ GCAGAG
2346
+ GCAGAT
2347
+ GCAGCA
2348
+ GCAGCC
2349
+ GCAGCG
2350
+ GCAGCT
2351
+ GCAGGA
2352
+ GCAGGC
2353
+ GCAGGG
2354
+ GCAGGT
2355
+ GCAGTA
2356
+ GCAGTC
2357
+ GCAGTG
2358
+ GCAGTT
2359
+ GCATAA
2360
+ GCATAC
2361
+ GCATAG
2362
+ GCATAT
2363
+ GCATCA
2364
+ GCATCC
2365
+ GCATCG
2366
+ GCATCT
2367
+ GCATGA
2368
+ GCATGC
2369
+ GCATGG
2370
+ GCATGT
2371
+ GCATTA
2372
+ GCATTC
2373
+ GCATTG
2374
+ GCATTT
2375
+ GCCAAA
2376
+ GCCAAC
2377
+ GCCAAG
2378
+ GCCAAT
2379
+ GCCACA
2380
+ GCCACC
2381
+ GCCACG
2382
+ GCCACT
2383
+ GCCAGA
2384
+ GCCAGC
2385
+ GCCAGG
2386
+ GCCAGT
2387
+ GCCATA
2388
+ GCCATC
2389
+ GCCATG
2390
+ GCCATT
2391
+ GCCCAA
2392
+ GCCCAC
2393
+ GCCCAG
2394
+ GCCCAT
2395
+ GCCCCA
2396
+ GCCCCC
2397
+ GCCCCG
2398
+ GCCCCT
2399
+ GCCCGA
2400
+ GCCCGC
2401
+ GCCCGG
2402
+ GCCCGT
2403
+ GCCCTA
2404
+ GCCCTC
2405
+ GCCCTG
2406
+ GCCCTT
2407
+ GCCGAA
2408
+ GCCGAC
2409
+ GCCGAG
2410
+ GCCGAT
2411
+ GCCGCA
2412
+ GCCGCC
2413
+ GCCGCG
2414
+ GCCGCT
2415
+ GCCGGA
2416
+ GCCGGC
2417
+ GCCGGG
2418
+ GCCGGT
2419
+ GCCGTA
2420
+ GCCGTC
2421
+ GCCGTG
2422
+ GCCGTT
2423
+ GCCTAA
2424
+ GCCTAC
2425
+ GCCTAG
2426
+ GCCTAT
2427
+ GCCTCA
2428
+ GCCTCC
2429
+ GCCTCG
2430
+ GCCTCT
2431
+ GCCTGA
2432
+ GCCTGC
2433
+ GCCTGG
2434
+ GCCTGT
2435
+ GCCTTA
2436
+ GCCTTC
2437
+ GCCTTG
2438
+ GCCTTT
2439
+ GCGAAA
2440
+ GCGAAC
2441
+ GCGAAG
2442
+ GCGAAT
2443
+ GCGACA
2444
+ GCGACC
2445
+ GCGACG
2446
+ GCGACT
2447
+ GCGAGA
2448
+ GCGAGC
2449
+ GCGAGG
2450
+ GCGAGT
2451
+ GCGATA
2452
+ GCGATC
2453
+ GCGATG
2454
+ GCGATT
2455
+ GCGCAA
2456
+ GCGCAC
2457
+ GCGCAG
2458
+ GCGCAT
2459
+ GCGCCA
2460
+ GCGCCC
2461
+ GCGCCG
2462
+ GCGCCT
2463
+ GCGCGA
2464
+ GCGCGC
2465
+ GCGCGG
2466
+ GCGCGT
2467
+ GCGCTA
2468
+ GCGCTC
2469
+ GCGCTG
2470
+ GCGCTT
2471
+ GCGGAA
2472
+ GCGGAC
2473
+ GCGGAG
2474
+ GCGGAT
2475
+ GCGGCA
2476
+ GCGGCC
2477
+ GCGGCG
2478
+ GCGGCT
2479
+ GCGGGA
2480
+ GCGGGC
2481
+ GCGGGG
2482
+ GCGGGT
2483
+ GCGGTA
2484
+ GCGGTC
2485
+ GCGGTG
2486
+ GCGGTT
2487
+ GCGTAA
2488
+ GCGTAC
2489
+ GCGTAG
2490
+ GCGTAT
2491
+ GCGTCA
2492
+ GCGTCC
2493
+ GCGTCG
2494
+ GCGTCT
2495
+ GCGTGA
2496
+ GCGTGC
2497
+ GCGTGG
2498
+ GCGTGT
2499
+ GCGTTA
2500
+ GCGTTC
2501
+ GCGTTG
2502
+ GCGTTT
2503
+ GCTAAA
2504
+ GCTAAC
2505
+ GCTAAG
2506
+ GCTAAT
2507
+ GCTACA
2508
+ GCTACC
2509
+ GCTACG
2510
+ GCTACT
2511
+ GCTAGA
2512
+ GCTAGC
2513
+ GCTAGG
2514
+ GCTAGT
2515
+ GCTATA
2516
+ GCTATC
2517
+ GCTATG
2518
+ GCTATT
2519
+ GCTCAA
2520
+ GCTCAC
2521
+ GCTCAG
2522
+ GCTCAT
2523
+ GCTCCA
2524
+ GCTCCC
2525
+ GCTCCG
2526
+ GCTCCT
2527
+ GCTCGA
2528
+ GCTCGC
2529
+ GCTCGG
2530
+ GCTCGT
2531
+ GCTCTA
2532
+ GCTCTC
2533
+ GCTCTG
2534
+ GCTCTT
2535
+ GCTGAA
2536
+ GCTGAC
2537
+ GCTGAG
2538
+ GCTGAT
2539
+ GCTGCA
2540
+ GCTGCC
2541
+ GCTGCG
2542
+ GCTGCT
2543
+ GCTGGA
2544
+ GCTGGC
2545
+ GCTGGG
2546
+ GCTGGT
2547
+ GCTGTA
2548
+ GCTGTC
2549
+ GCTGTG
2550
+ GCTGTT
2551
+ GCTTAA
2552
+ GCTTAC
2553
+ GCTTAG
2554
+ GCTTAT
2555
+ GCTTCA
2556
+ GCTTCC
2557
+ GCTTCG
2558
+ GCTTCT
2559
+ GCTTGA
2560
+ GCTTGC
2561
+ GCTTGG
2562
+ GCTTGT
2563
+ GCTTTA
2564
+ GCTTTC
2565
+ GCTTTG
2566
+ GCTTTT
2567
+ GGAAAA
2568
+ GGAAAC
2569
+ GGAAAG
2570
+ GGAAAT
2571
+ GGAACA
2572
+ GGAACC
2573
+ GGAACG
2574
+ GGAACT
2575
+ GGAAGA
2576
+ GGAAGC
2577
+ GGAAGG
2578
+ GGAAGT
2579
+ GGAATA
2580
+ GGAATC
2581
+ GGAATG
2582
+ GGAATT
2583
+ GGACAA
2584
+ GGACAC
2585
+ GGACAG
2586
+ GGACAT
2587
+ GGACCA
2588
+ GGACCC
2589
+ GGACCG
2590
+ GGACCT
2591
+ GGACGA
2592
+ GGACGC
2593
+ GGACGG
2594
+ GGACGT
2595
+ GGACTA
2596
+ GGACTC
2597
+ GGACTG
2598
+ GGACTT
2599
+ GGAGAA
2600
+ GGAGAC
2601
+ GGAGAG
2602
+ GGAGAT
2603
+ GGAGCA
2604
+ GGAGCC
2605
+ GGAGCG
2606
+ GGAGCT
2607
+ GGAGGA
2608
+ GGAGGC
2609
+ GGAGGG
2610
+ GGAGGT
2611
+ GGAGTA
2612
+ GGAGTC
2613
+ GGAGTG
2614
+ GGAGTT
2615
+ GGATAA
2616
+ GGATAC
2617
+ GGATAG
2618
+ GGATAT
2619
+ GGATCA
2620
+ GGATCC
2621
+ GGATCG
2622
+ GGATCT
2623
+ GGATGA
2624
+ GGATGC
2625
+ GGATGG
2626
+ GGATGT
2627
+ GGATTA
2628
+ GGATTC
2629
+ GGATTG
2630
+ GGATTT
2631
+ GGCAAA
2632
+ GGCAAC
2633
+ GGCAAG
2634
+ GGCAAT
2635
+ GGCACA
2636
+ GGCACC
2637
+ GGCACG
2638
+ GGCACT
2639
+ GGCAGA
2640
+ GGCAGC
2641
+ GGCAGG
2642
+ GGCAGT
2643
+ GGCATA
2644
+ GGCATC
2645
+ GGCATG
2646
+ GGCATT
2647
+ GGCCAA
2648
+ GGCCAC
2649
+ GGCCAG
2650
+ GGCCAT
2651
+ GGCCCA
2652
+ GGCCCC
2653
+ GGCCCG
2654
+ GGCCCT
2655
+ GGCCGA
2656
+ GGCCGC
2657
+ GGCCGG
2658
+ GGCCGT
2659
+ GGCCTA
2660
+ GGCCTC
2661
+ GGCCTG
2662
+ GGCCTT
2663
+ GGCGAA
2664
+ GGCGAC
2665
+ GGCGAG
2666
+ GGCGAT
2667
+ GGCGCA
2668
+ GGCGCC
2669
+ GGCGCG
2670
+ GGCGCT
2671
+ GGCGGA
2672
+ GGCGGC
2673
+ GGCGGG
2674
+ GGCGGT
2675
+ GGCGTA
2676
+ GGCGTC
2677
+ GGCGTG
2678
+ GGCGTT
2679
+ GGCTAA
2680
+ GGCTAC
2681
+ GGCTAG
2682
+ GGCTAT
2683
+ GGCTCA
2684
+ GGCTCC
2685
+ GGCTCG
2686
+ GGCTCT
2687
+ GGCTGA
2688
+ GGCTGC
2689
+ GGCTGG
2690
+ GGCTGT
2691
+ GGCTTA
2692
+ GGCTTC
2693
+ GGCTTG
2694
+ GGCTTT
2695
+ GGGAAA
2696
+ GGGAAC
2697
+ GGGAAG
2698
+ GGGAAT
2699
+ GGGACA
2700
+ GGGACC
2701
+ GGGACG
2702
+ GGGACT
2703
+ GGGAGA
2704
+ GGGAGC
2705
+ GGGAGG
2706
+ GGGAGT
2707
+ GGGATA
2708
+ GGGATC
2709
+ GGGATG
2710
+ GGGATT
2711
+ GGGCAA
2712
+ GGGCAC
2713
+ GGGCAG
2714
+ GGGCAT
2715
+ GGGCCA
2716
+ GGGCCC
2717
+ GGGCCG
2718
+ GGGCCT
2719
+ GGGCGA
2720
+ GGGCGC
2721
+ GGGCGG
2722
+ GGGCGT
2723
+ GGGCTA
2724
+ GGGCTC
2725
+ GGGCTG
2726
+ GGGCTT
2727
+ GGGGAA
2728
+ GGGGAC
2729
+ GGGGAG
2730
+ GGGGAT
2731
+ GGGGCA
2732
+ GGGGCC
2733
+ GGGGCG
2734
+ GGGGCT
2735
+ GGGGGA
2736
+ GGGGGC
2737
+ GGGGGG
2738
+ GGGGGT
2739
+ GGGGTA
2740
+ GGGGTC
2741
+ GGGGTG
2742
+ GGGGTT
2743
+ GGGTAA
2744
+ GGGTAC
2745
+ GGGTAG
2746
+ GGGTAT
2747
+ GGGTCA
2748
+ GGGTCC
2749
+ GGGTCG
2750
+ GGGTCT
2751
+ GGGTGA
2752
+ GGGTGC
2753
+ GGGTGG
2754
+ GGGTGT
2755
+ GGGTTA
2756
+ GGGTTC
2757
+ GGGTTG
2758
+ GGGTTT
2759
+ GGTAAA
2760
+ GGTAAC
2761
+ GGTAAG
2762
+ GGTAAT
2763
+ GGTACA
2764
+ GGTACC
2765
+ GGTACG
2766
+ GGTACT
2767
+ GGTAGA
2768
+ GGTAGC
2769
+ GGTAGG
2770
+ GGTAGT
2771
+ GGTATA
2772
+ GGTATC
2773
+ GGTATG
2774
+ GGTATT
2775
+ GGTCAA
2776
+ GGTCAC
2777
+ GGTCAG
2778
+ GGTCAT
2779
+ GGTCCA
2780
+ GGTCCC
2781
+ GGTCCG
2782
+ GGTCCT
2783
+ GGTCGA
2784
+ GGTCGC
2785
+ GGTCGG
2786
+ GGTCGT
2787
+ GGTCTA
2788
+ GGTCTC
2789
+ GGTCTG
2790
+ GGTCTT
2791
+ GGTGAA
2792
+ GGTGAC
2793
+ GGTGAG
2794
+ GGTGAT
2795
+ GGTGCA
2796
+ GGTGCC
2797
+ GGTGCG
2798
+ GGTGCT
2799
+ GGTGGA
2800
+ GGTGGC
2801
+ GGTGGG
2802
+ GGTGGT
2803
+ GGTGTA
2804
+ GGTGTC
2805
+ GGTGTG
2806
+ GGTGTT
2807
+ GGTTAA
2808
+ GGTTAC
2809
+ GGTTAG
2810
+ GGTTAT
2811
+ GGTTCA
2812
+ GGTTCC
2813
+ GGTTCG
2814
+ GGTTCT
2815
+ GGTTGA
2816
+ GGTTGC
2817
+ GGTTGG
2818
+ GGTTGT
2819
+ GGTTTA
2820
+ GGTTTC
2821
+ GGTTTG
2822
+ GGTTTT
2823
+ GTAAAA
2824
+ GTAAAC
2825
+ GTAAAG
2826
+ GTAAAT
2827
+ GTAACA
2828
+ GTAACC
2829
+ GTAACG
2830
+ GTAACT
2831
+ GTAAGA
2832
+ GTAAGC
2833
+ GTAAGG
2834
+ GTAAGT
2835
+ GTAATA
2836
+ GTAATC
2837
+ GTAATG
2838
+ GTAATT
2839
+ GTACAA
2840
+ GTACAC
2841
+ GTACAG
2842
+ GTACAT
2843
+ GTACCA
2844
+ GTACCC
2845
+ GTACCG
2846
+ GTACCT
2847
+ GTACGA
2848
+ GTACGC
2849
+ GTACGG
2850
+ GTACGT
2851
+ GTACTA
2852
+ GTACTC
2853
+ GTACTG
2854
+ GTACTT
2855
+ GTAGAA
2856
+ GTAGAC
2857
+ GTAGAG
2858
+ GTAGAT
2859
+ GTAGCA
2860
+ GTAGCC
2861
+ GTAGCG
2862
+ GTAGCT
2863
+ GTAGGA
2864
+ GTAGGC
2865
+ GTAGGG
2866
+ GTAGGT
2867
+ GTAGTA
2868
+ GTAGTC
2869
+ GTAGTG
2870
+ GTAGTT
2871
+ GTATAA
2872
+ GTATAC
2873
+ GTATAG
2874
+ GTATAT
2875
+ GTATCA
2876
+ GTATCC
2877
+ GTATCG
2878
+ GTATCT
2879
+ GTATGA
2880
+ GTATGC
2881
+ GTATGG
2882
+ GTATGT
2883
+ GTATTA
2884
+ GTATTC
2885
+ GTATTG
2886
+ GTATTT
2887
+ GTCAAA
2888
+ GTCAAC
2889
+ GTCAAG
2890
+ GTCAAT
2891
+ GTCACA
2892
+ GTCACC
2893
+ GTCACG
2894
+ GTCACT
2895
+ GTCAGA
2896
+ GTCAGC
2897
+ GTCAGG
2898
+ GTCAGT
2899
+ GTCATA
2900
+ GTCATC
2901
+ GTCATG
2902
+ GTCATT
2903
+ GTCCAA
2904
+ GTCCAC
2905
+ GTCCAG
2906
+ GTCCAT
2907
+ GTCCCA
2908
+ GTCCCC
2909
+ GTCCCG
2910
+ GTCCCT
2911
+ GTCCGA
2912
+ GTCCGC
2913
+ GTCCGG
2914
+ GTCCGT
2915
+ GTCCTA
2916
+ GTCCTC
2917
+ GTCCTG
2918
+ GTCCTT
2919
+ GTCGAA
2920
+ GTCGAC
2921
+ GTCGAG
2922
+ GTCGAT
2923
+ GTCGCA
2924
+ GTCGCC
2925
+ GTCGCG
2926
+ GTCGCT
2927
+ GTCGGA
2928
+ GTCGGC
2929
+ GTCGGG
2930
+ GTCGGT
2931
+ GTCGTA
2932
+ GTCGTC
2933
+ GTCGTG
2934
+ GTCGTT
2935
+ GTCTAA
2936
+ GTCTAC
2937
+ GTCTAG
2938
+ GTCTAT
2939
+ GTCTCA
2940
+ GTCTCC
2941
+ GTCTCG
2942
+ GTCTCT
2943
+ GTCTGA
2944
+ GTCTGC
2945
+ GTCTGG
2946
+ GTCTGT
2947
+ GTCTTA
2948
+ GTCTTC
2949
+ GTCTTG
2950
+ GTCTTT
2951
+ GTGAAA
2952
+ GTGAAC
2953
+ GTGAAG
2954
+ GTGAAT
2955
+ GTGACA
2956
+ GTGACC
2957
+ GTGACG
2958
+ GTGACT
2959
+ GTGAGA
2960
+ GTGAGC
2961
+ GTGAGG
2962
+ GTGAGT
2963
+ GTGATA
2964
+ GTGATC
2965
+ GTGATG
2966
+ GTGATT
2967
+ GTGCAA
2968
+ GTGCAC
2969
+ GTGCAG
2970
+ GTGCAT
2971
+ GTGCCA
2972
+ GTGCCC
2973
+ GTGCCG
2974
+ GTGCCT
2975
+ GTGCGA
2976
+ GTGCGC
2977
+ GTGCGG
2978
+ GTGCGT
2979
+ GTGCTA
2980
+ GTGCTC
2981
+ GTGCTG
2982
+ GTGCTT
2983
+ GTGGAA
2984
+ GTGGAC
2985
+ GTGGAG
2986
+ GTGGAT
2987
+ GTGGCA
2988
+ GTGGCC
2989
+ GTGGCG
2990
+ GTGGCT
2991
+ GTGGGA
2992
+ GTGGGC
2993
+ GTGGGG
2994
+ GTGGGT
2995
+ GTGGTA
2996
+ GTGGTC
2997
+ GTGGTG
2998
+ GTGGTT
2999
+ GTGTAA
3000
+ GTGTAC
3001
+ GTGTAG
3002
+ GTGTAT
3003
+ GTGTCA
3004
+ GTGTCC
3005
+ GTGTCG
3006
+ GTGTCT
3007
+ GTGTGA
3008
+ GTGTGC
3009
+ GTGTGG
3010
+ GTGTGT
3011
+ GTGTTA
3012
+ GTGTTC
3013
+ GTGTTG
3014
+ GTGTTT
3015
+ GTTAAA
3016
+ GTTAAC
3017
+ GTTAAG
3018
+ GTTAAT
3019
+ GTTACA
3020
+ GTTACC
3021
+ GTTACG
3022
+ GTTACT
3023
+ GTTAGA
3024
+ GTTAGC
3025
+ GTTAGG
3026
+ GTTAGT
3027
+ GTTATA
3028
+ GTTATC
3029
+ GTTATG
3030
+ GTTATT
3031
+ GTTCAA
3032
+ GTTCAC
3033
+ GTTCAG
3034
+ GTTCAT
3035
+ GTTCCA
3036
+ GTTCCC
3037
+ GTTCCG
3038
+ GTTCCT
3039
+ GTTCGA
3040
+ GTTCGC
3041
+ GTTCGG
3042
+ GTTCGT
3043
+ GTTCTA
3044
+ GTTCTC
3045
+ GTTCTG
3046
+ GTTCTT
3047
+ GTTGAA
3048
+ GTTGAC
3049
+ GTTGAG
3050
+ GTTGAT
3051
+ GTTGCA
3052
+ GTTGCC
3053
+ GTTGCG
3054
+ GTTGCT
3055
+ GTTGGA
3056
+ GTTGGC
3057
+ GTTGGG
3058
+ GTTGGT
3059
+ GTTGTA
3060
+ GTTGTC
3061
+ GTTGTG
3062
+ GTTGTT
3063
+ GTTTAA
3064
+ GTTTAC
3065
+ GTTTAG
3066
+ GTTTAT
3067
+ GTTTCA
3068
+ GTTTCC
3069
+ GTTTCG
3070
+ GTTTCT
3071
+ GTTTGA
3072
+ GTTTGC
3073
+ GTTTGG
3074
+ GTTTGT
3075
+ GTTTTA
3076
+ GTTTTC
3077
+ GTTTTG
3078
+ GTTTTT
3079
+ TAAAAA
3080
+ TAAAAC
3081
+ TAAAAG
3082
+ TAAAAT
3083
+ TAAACA
3084
+ TAAACC
3085
+ TAAACG
3086
+ TAAACT
3087
+ TAAAGA
3088
+ TAAAGC
3089
+ TAAAGG
3090
+ TAAAGT
3091
+ TAAATA
3092
+ TAAATC
3093
+ TAAATG
3094
+ TAAATT
3095
+ TAACAA
3096
+ TAACAC
3097
+ TAACAG
3098
+ TAACAT
3099
+ TAACCA
3100
+ TAACCC
3101
+ TAACCG
3102
+ TAACCT
3103
+ TAACGA
3104
+ TAACGC
3105
+ TAACGG
3106
+ TAACGT
3107
+ TAACTA
3108
+ TAACTC
3109
+ TAACTG
3110
+ TAACTT
3111
+ TAAGAA
3112
+ TAAGAC
3113
+ TAAGAG
3114
+ TAAGAT
3115
+ TAAGCA
3116
+ TAAGCC
3117
+ TAAGCG
3118
+ TAAGCT
3119
+ TAAGGA
3120
+ TAAGGC
3121
+ TAAGGG
3122
+ TAAGGT
3123
+ TAAGTA
3124
+ TAAGTC
3125
+ TAAGTG
3126
+ TAAGTT
3127
+ TAATAA
3128
+ TAATAC
3129
+ TAATAG
3130
+ TAATAT
3131
+ TAATCA
3132
+ TAATCC
3133
+ TAATCG
3134
+ TAATCT
3135
+ TAATGA
3136
+ TAATGC
3137
+ TAATGG
3138
+ TAATGT
3139
+ TAATTA
3140
+ TAATTC
3141
+ TAATTG
3142
+ TAATTT
3143
+ TACAAA
3144
+ TACAAC
3145
+ TACAAG
3146
+ TACAAT
3147
+ TACACA
3148
+ TACACC
3149
+ TACACG
3150
+ TACACT
3151
+ TACAGA
3152
+ TACAGC
3153
+ TACAGG
3154
+ TACAGT
3155
+ TACATA
3156
+ TACATC
3157
+ TACATG
3158
+ TACATT
3159
+ TACCAA
3160
+ TACCAC
3161
+ TACCAG
3162
+ TACCAT
3163
+ TACCCA
3164
+ TACCCC
3165
+ TACCCG
3166
+ TACCCT
3167
+ TACCGA
3168
+ TACCGC
3169
+ TACCGG
3170
+ TACCGT
3171
+ TACCTA
3172
+ TACCTC
3173
+ TACCTG
3174
+ TACCTT
3175
+ TACGAA
3176
+ TACGAC
3177
+ TACGAG
3178
+ TACGAT
3179
+ TACGCA
3180
+ TACGCC
3181
+ TACGCG
3182
+ TACGCT
3183
+ TACGGA
3184
+ TACGGC
3185
+ TACGGG
3186
+ TACGGT
3187
+ TACGTA
3188
+ TACGTC
3189
+ TACGTG
3190
+ TACGTT
3191
+ TACTAA
3192
+ TACTAC
3193
+ TACTAG
3194
+ TACTAT
3195
+ TACTCA
3196
+ TACTCC
3197
+ TACTCG
3198
+ TACTCT
3199
+ TACTGA
3200
+ TACTGC
3201
+ TACTGG
3202
+ TACTGT
3203
+ TACTTA
3204
+ TACTTC
3205
+ TACTTG
3206
+ TACTTT
3207
+ TAGAAA
3208
+ TAGAAC
3209
+ TAGAAG
3210
+ TAGAAT
3211
+ TAGACA
3212
+ TAGACC
3213
+ TAGACG
3214
+ TAGACT
3215
+ TAGAGA
3216
+ TAGAGC
3217
+ TAGAGG
3218
+ TAGAGT
3219
+ TAGATA
3220
+ TAGATC
3221
+ TAGATG
3222
+ TAGATT
3223
+ TAGCAA
3224
+ TAGCAC
3225
+ TAGCAG
3226
+ TAGCAT
3227
+ TAGCCA
3228
+ TAGCCC
3229
+ TAGCCG
3230
+ TAGCCT
3231
+ TAGCGA
3232
+ TAGCGC
3233
+ TAGCGG
3234
+ TAGCGT
3235
+ TAGCTA
3236
+ TAGCTC
3237
+ TAGCTG
3238
+ TAGCTT
3239
+ TAGGAA
3240
+ TAGGAC
3241
+ TAGGAG
3242
+ TAGGAT
3243
+ TAGGCA
3244
+ TAGGCC
3245
+ TAGGCG
3246
+ TAGGCT
3247
+ TAGGGA
3248
+ TAGGGC
3249
+ TAGGGG
3250
+ TAGGGT
3251
+ TAGGTA
3252
+ TAGGTC
3253
+ TAGGTG
3254
+ TAGGTT
3255
+ TAGTAA
3256
+ TAGTAC
3257
+ TAGTAG
3258
+ TAGTAT
3259
+ TAGTCA
3260
+ TAGTCC
3261
+ TAGTCG
3262
+ TAGTCT
3263
+ TAGTGA
3264
+ TAGTGC
3265
+ TAGTGG
3266
+ TAGTGT
3267
+ TAGTTA
3268
+ TAGTTC
3269
+ TAGTTG
3270
+ TAGTTT
3271
+ TATAAA
3272
+ TATAAC
3273
+ TATAAG
3274
+ TATAAT
3275
+ TATACA
3276
+ TATACC
3277
+ TATACG
3278
+ TATACT
3279
+ TATAGA
3280
+ TATAGC
3281
+ TATAGG
3282
+ TATAGT
3283
+ TATATA
3284
+ TATATC
3285
+ TATATG
3286
+ TATATT
3287
+ TATCAA
3288
+ TATCAC
3289
+ TATCAG
3290
+ TATCAT
3291
+ TATCCA
3292
+ TATCCC
3293
+ TATCCG
3294
+ TATCCT
3295
+ TATCGA
3296
+ TATCGC
3297
+ TATCGG
3298
+ TATCGT
3299
+ TATCTA
3300
+ TATCTC
3301
+ TATCTG
3302
+ TATCTT
3303
+ TATGAA
3304
+ TATGAC
3305
+ TATGAG
3306
+ TATGAT
3307
+ TATGCA
3308
+ TATGCC
3309
+ TATGCG
3310
+ TATGCT
3311
+ TATGGA
3312
+ TATGGC
3313
+ TATGGG
3314
+ TATGGT
3315
+ TATGTA
3316
+ TATGTC
3317
+ TATGTG
3318
+ TATGTT
3319
+ TATTAA
3320
+ TATTAC
3321
+ TATTAG
3322
+ TATTAT
3323
+ TATTCA
3324
+ TATTCC
3325
+ TATTCG
3326
+ TATTCT
3327
+ TATTGA
3328
+ TATTGC
3329
+ TATTGG
3330
+ TATTGT
3331
+ TATTTA
3332
+ TATTTC
3333
+ TATTTG
3334
+ TATTTT
3335
+ TCAAAA
3336
+ TCAAAC
3337
+ TCAAAG
3338
+ TCAAAT
3339
+ TCAACA
3340
+ TCAACC
3341
+ TCAACG
3342
+ TCAACT
3343
+ TCAAGA
3344
+ TCAAGC
3345
+ TCAAGG
3346
+ TCAAGT
3347
+ TCAATA
3348
+ TCAATC
3349
+ TCAATG
3350
+ TCAATT
3351
+ TCACAA
3352
+ TCACAC
3353
+ TCACAG
3354
+ TCACAT
3355
+ TCACCA
3356
+ TCACCC
3357
+ TCACCG
3358
+ TCACCT
3359
+ TCACGA
3360
+ TCACGC
3361
+ TCACGG
3362
+ TCACGT
3363
+ TCACTA
3364
+ TCACTC
3365
+ TCACTG
3366
+ TCACTT
3367
+ TCAGAA
3368
+ TCAGAC
3369
+ TCAGAG
3370
+ TCAGAT
3371
+ TCAGCA
3372
+ TCAGCC
3373
+ TCAGCG
3374
+ TCAGCT
3375
+ TCAGGA
3376
+ TCAGGC
3377
+ TCAGGG
3378
+ TCAGGT
3379
+ TCAGTA
3380
+ TCAGTC
3381
+ TCAGTG
3382
+ TCAGTT
3383
+ TCATAA
3384
+ TCATAC
3385
+ TCATAG
3386
+ TCATAT
3387
+ TCATCA
3388
+ TCATCC
3389
+ TCATCG
3390
+ TCATCT
3391
+ TCATGA
3392
+ TCATGC
3393
+ TCATGG
3394
+ TCATGT
3395
+ TCATTA
3396
+ TCATTC
3397
+ TCATTG
3398
+ TCATTT
3399
+ TCCAAA
3400
+ TCCAAC
3401
+ TCCAAG
3402
+ TCCAAT
3403
+ TCCACA
3404
+ TCCACC
3405
+ TCCACG
3406
+ TCCACT
3407
+ TCCAGA
3408
+ TCCAGC
3409
+ TCCAGG
3410
+ TCCAGT
3411
+ TCCATA
3412
+ TCCATC
3413
+ TCCATG
3414
+ TCCATT
3415
+ TCCCAA
3416
+ TCCCAC
3417
+ TCCCAG
3418
+ TCCCAT
3419
+ TCCCCA
3420
+ TCCCCC
3421
+ TCCCCG
3422
+ TCCCCT
3423
+ TCCCGA
3424
+ TCCCGC
3425
+ TCCCGG
3426
+ TCCCGT
3427
+ TCCCTA
3428
+ TCCCTC
3429
+ TCCCTG
3430
+ TCCCTT
3431
+ TCCGAA
3432
+ TCCGAC
3433
+ TCCGAG
3434
+ TCCGAT
3435
+ TCCGCA
3436
+ TCCGCC
3437
+ TCCGCG
3438
+ TCCGCT
3439
+ TCCGGA
3440
+ TCCGGC
3441
+ TCCGGG
3442
+ TCCGGT
3443
+ TCCGTA
3444
+ TCCGTC
3445
+ TCCGTG
3446
+ TCCGTT
3447
+ TCCTAA
3448
+ TCCTAC
3449
+ TCCTAG
3450
+ TCCTAT
3451
+ TCCTCA
3452
+ TCCTCC
3453
+ TCCTCG
3454
+ TCCTCT
3455
+ TCCTGA
3456
+ TCCTGC
3457
+ TCCTGG
3458
+ TCCTGT
3459
+ TCCTTA
3460
+ TCCTTC
3461
+ TCCTTG
3462
+ TCCTTT
3463
+ TCGAAA
3464
+ TCGAAC
3465
+ TCGAAG
3466
+ TCGAAT
3467
+ TCGACA
3468
+ TCGACC
3469
+ TCGACG
3470
+ TCGACT
3471
+ TCGAGA
3472
+ TCGAGC
3473
+ TCGAGG
3474
+ TCGAGT
3475
+ TCGATA
3476
+ TCGATC
3477
+ TCGATG
3478
+ TCGATT
3479
+ TCGCAA
3480
+ TCGCAC
3481
+ TCGCAG
3482
+ TCGCAT
3483
+ TCGCCA
3484
+ TCGCCC
3485
+ TCGCCG
3486
+ TCGCCT
3487
+ TCGCGA
3488
+ TCGCGC
3489
+ TCGCGG
3490
+ TCGCGT
3491
+ TCGCTA
3492
+ TCGCTC
3493
+ TCGCTG
3494
+ TCGCTT
3495
+ TCGGAA
3496
+ TCGGAC
3497
+ TCGGAG
3498
+ TCGGAT
3499
+ TCGGCA
3500
+ TCGGCC
3501
+ TCGGCG
3502
+ TCGGCT
3503
+ TCGGGA
3504
+ TCGGGC
3505
+ TCGGGG
3506
+ TCGGGT
3507
+ TCGGTA
3508
+ TCGGTC
3509
+ TCGGTG
3510
+ TCGGTT
3511
+ TCGTAA
3512
+ TCGTAC
3513
+ TCGTAG
3514
+ TCGTAT
3515
+ TCGTCA
3516
+ TCGTCC
3517
+ TCGTCG
3518
+ TCGTCT
3519
+ TCGTGA
3520
+ TCGTGC
3521
+ TCGTGG
3522
+ TCGTGT
3523
+ TCGTTA
3524
+ TCGTTC
3525
+ TCGTTG
3526
+ TCGTTT
3527
+ TCTAAA
3528
+ TCTAAC
3529
+ TCTAAG
3530
+ TCTAAT
3531
+ TCTACA
3532
+ TCTACC
3533
+ TCTACG
3534
+ TCTACT
3535
+ TCTAGA
3536
+ TCTAGC
3537
+ TCTAGG
3538
+ TCTAGT
3539
+ TCTATA
3540
+ TCTATC
3541
+ TCTATG
3542
+ TCTATT
3543
+ TCTCAA
3544
+ TCTCAC
3545
+ TCTCAG
3546
+ TCTCAT
3547
+ TCTCCA
3548
+ TCTCCC
3549
+ TCTCCG
3550
+ TCTCCT
3551
+ TCTCGA
3552
+ TCTCGC
3553
+ TCTCGG
3554
+ TCTCGT
3555
+ TCTCTA
3556
+ TCTCTC
3557
+ TCTCTG
3558
+ TCTCTT
3559
+ TCTGAA
3560
+ TCTGAC
3561
+ TCTGAG
3562
+ TCTGAT
3563
+ TCTGCA
3564
+ TCTGCC
3565
+ TCTGCG
3566
+ TCTGCT
3567
+ TCTGGA
3568
+ TCTGGC
3569
+ TCTGGG
3570
+ TCTGGT
3571
+ TCTGTA
3572
+ TCTGTC
3573
+ TCTGTG
3574
+ TCTGTT
3575
+ TCTTAA
3576
+ TCTTAC
3577
+ TCTTAG
3578
+ TCTTAT
3579
+ TCTTCA
3580
+ TCTTCC
3581
+ TCTTCG
3582
+ TCTTCT
3583
+ TCTTGA
3584
+ TCTTGC
3585
+ TCTTGG
3586
+ TCTTGT
3587
+ TCTTTA
3588
+ TCTTTC
3589
+ TCTTTG
3590
+ TCTTTT
3591
+ TGAAAA
3592
+ TGAAAC
3593
+ TGAAAG
3594
+ TGAAAT
3595
+ TGAACA
3596
+ TGAACC
3597
+ TGAACG
3598
+ TGAACT
3599
+ TGAAGA
3600
+ TGAAGC
3601
+ TGAAGG
3602
+ TGAAGT
3603
+ TGAATA
3604
+ TGAATC
3605
+ TGAATG
3606
+ TGAATT
3607
+ TGACAA
3608
+ TGACAC
3609
+ TGACAG
3610
+ TGACAT
3611
+ TGACCA
3612
+ TGACCC
3613
+ TGACCG
3614
+ TGACCT
3615
+ TGACGA
3616
+ TGACGC
3617
+ TGACGG
3618
+ TGACGT
3619
+ TGACTA
3620
+ TGACTC
3621
+ TGACTG
3622
+ TGACTT
3623
+ TGAGAA
3624
+ TGAGAC
3625
+ TGAGAG
3626
+ TGAGAT
3627
+ TGAGCA
3628
+ TGAGCC
3629
+ TGAGCG
3630
+ TGAGCT
3631
+ TGAGGA
3632
+ TGAGGC
3633
+ TGAGGG
3634
+ TGAGGT
3635
+ TGAGTA
3636
+ TGAGTC
3637
+ TGAGTG
3638
+ TGAGTT
3639
+ TGATAA
3640
+ TGATAC
3641
+ TGATAG
3642
+ TGATAT
3643
+ TGATCA
3644
+ TGATCC
3645
+ TGATCG
3646
+ TGATCT
3647
+ TGATGA
3648
+ TGATGC
3649
+ TGATGG
3650
+ TGATGT
3651
+ TGATTA
3652
+ TGATTC
3653
+ TGATTG
3654
+ TGATTT
3655
+ TGCAAA
3656
+ TGCAAC
3657
+ TGCAAG
3658
+ TGCAAT
3659
+ TGCACA
3660
+ TGCACC
3661
+ TGCACG
3662
+ TGCACT
3663
+ TGCAGA
3664
+ TGCAGC
3665
+ TGCAGG
3666
+ TGCAGT
3667
+ TGCATA
3668
+ TGCATC
3669
+ TGCATG
3670
+ TGCATT
3671
+ TGCCAA
3672
+ TGCCAC
3673
+ TGCCAG
3674
+ TGCCAT
3675
+ TGCCCA
3676
+ TGCCCC
3677
+ TGCCCG
3678
+ TGCCCT
3679
+ TGCCGA
3680
+ TGCCGC
3681
+ TGCCGG
3682
+ TGCCGT
3683
+ TGCCTA
3684
+ TGCCTC
3685
+ TGCCTG
3686
+ TGCCTT
3687
+ TGCGAA
3688
+ TGCGAC
3689
+ TGCGAG
3690
+ TGCGAT
3691
+ TGCGCA
3692
+ TGCGCC
3693
+ TGCGCG
3694
+ TGCGCT
3695
+ TGCGGA
3696
+ TGCGGC
3697
+ TGCGGG
3698
+ TGCGGT
3699
+ TGCGTA
3700
+ TGCGTC
3701
+ TGCGTG
3702
+ TGCGTT
3703
+ TGCTAA
3704
+ TGCTAC
3705
+ TGCTAG
3706
+ TGCTAT
3707
+ TGCTCA
3708
+ TGCTCC
3709
+ TGCTCG
3710
+ TGCTCT
3711
+ TGCTGA
3712
+ TGCTGC
3713
+ TGCTGG
3714
+ TGCTGT
3715
+ TGCTTA
3716
+ TGCTTC
3717
+ TGCTTG
3718
+ TGCTTT
3719
+ TGGAAA
3720
+ TGGAAC
3721
+ TGGAAG
3722
+ TGGAAT
3723
+ TGGACA
3724
+ TGGACC
3725
+ TGGACG
3726
+ TGGACT
3727
+ TGGAGA
3728
+ TGGAGC
3729
+ TGGAGG
3730
+ TGGAGT
3731
+ TGGATA
3732
+ TGGATC
3733
+ TGGATG
3734
+ TGGATT
3735
+ TGGCAA
3736
+ TGGCAC
3737
+ TGGCAG
3738
+ TGGCAT
3739
+ TGGCCA
3740
+ TGGCCC
3741
+ TGGCCG
3742
+ TGGCCT
3743
+ TGGCGA
3744
+ TGGCGC
3745
+ TGGCGG
3746
+ TGGCGT
3747
+ TGGCTA
3748
+ TGGCTC
3749
+ TGGCTG
3750
+ TGGCTT
3751
+ TGGGAA
3752
+ TGGGAC
3753
+ TGGGAG
3754
+ TGGGAT
3755
+ TGGGCA
3756
+ TGGGCC
3757
+ TGGGCG
3758
+ TGGGCT
3759
+ TGGGGA
3760
+ TGGGGC
3761
+ TGGGGG
3762
+ TGGGGT
3763
+ TGGGTA
3764
+ TGGGTC
3765
+ TGGGTG
3766
+ TGGGTT
3767
+ TGGTAA
3768
+ TGGTAC
3769
+ TGGTAG
3770
+ TGGTAT
3771
+ TGGTCA
3772
+ TGGTCC
3773
+ TGGTCG
3774
+ TGGTCT
3775
+ TGGTGA
3776
+ TGGTGC
3777
+ TGGTGG
3778
+ TGGTGT
3779
+ TGGTTA
3780
+ TGGTTC
3781
+ TGGTTG
3782
+ TGGTTT
3783
+ TGTAAA
3784
+ TGTAAC
3785
+ TGTAAG
3786
+ TGTAAT
3787
+ TGTACA
3788
+ TGTACC
3789
+ TGTACG
3790
+ TGTACT
3791
+ TGTAGA
3792
+ TGTAGC
3793
+ TGTAGG
3794
+ TGTAGT
3795
+ TGTATA
3796
+ TGTATC
3797
+ TGTATG
3798
+ TGTATT
3799
+ TGTCAA
3800
+ TGTCAC
3801
+ TGTCAG
3802
+ TGTCAT
3803
+ TGTCCA
3804
+ TGTCCC
3805
+ TGTCCG
3806
+ TGTCCT
3807
+ TGTCGA
3808
+ TGTCGC
3809
+ TGTCGG
3810
+ TGTCGT
3811
+ TGTCTA
3812
+ TGTCTC
3813
+ TGTCTG
3814
+ TGTCTT
3815
+ TGTGAA
3816
+ TGTGAC
3817
+ TGTGAG
3818
+ TGTGAT
3819
+ TGTGCA
3820
+ TGTGCC
3821
+ TGTGCG
3822
+ TGTGCT
3823
+ TGTGGA
3824
+ TGTGGC
3825
+ TGTGGG
3826
+ TGTGGT
3827
+ TGTGTA
3828
+ TGTGTC
3829
+ TGTGTG
3830
+ TGTGTT
3831
+ TGTTAA
3832
+ TGTTAC
3833
+ TGTTAG
3834
+ TGTTAT
3835
+ TGTTCA
3836
+ TGTTCC
3837
+ TGTTCG
3838
+ TGTTCT
3839
+ TGTTGA
3840
+ TGTTGC
3841
+ TGTTGG
3842
+ TGTTGT
3843
+ TGTTTA
3844
+ TGTTTC
3845
+ TGTTTG
3846
+ TGTTTT
3847
+ TTAAAA
3848
+ TTAAAC
3849
+ TTAAAG
3850
+ TTAAAT
3851
+ TTAACA
3852
+ TTAACC
3853
+ TTAACG
3854
+ TTAACT
3855
+ TTAAGA
3856
+ TTAAGC
3857
+ TTAAGG
3858
+ TTAAGT
3859
+ TTAATA
3860
+ TTAATC
3861
+ TTAATG
3862
+ TTAATT
3863
+ TTACAA
3864
+ TTACAC
3865
+ TTACAG
3866
+ TTACAT
3867
+ TTACCA
3868
+ TTACCC
3869
+ TTACCG
3870
+ TTACCT
3871
+ TTACGA
3872
+ TTACGC
3873
+ TTACGG
3874
+ TTACGT
3875
+ TTACTA
3876
+ TTACTC
3877
+ TTACTG
3878
+ TTACTT
3879
+ TTAGAA
3880
+ TTAGAC
3881
+ TTAGAG
3882
+ TTAGAT
3883
+ TTAGCA
3884
+ TTAGCC
3885
+ TTAGCG
3886
+ TTAGCT
3887
+ TTAGGA
3888
+ TTAGGC
3889
+ TTAGGG
3890
+ TTAGGT
3891
+ TTAGTA
3892
+ TTAGTC
3893
+ TTAGTG
3894
+ TTAGTT
3895
+ TTATAA
3896
+ TTATAC
3897
+ TTATAG
3898
+ TTATAT
3899
+ TTATCA
3900
+ TTATCC
3901
+ TTATCG
3902
+ TTATCT
3903
+ TTATGA
3904
+ TTATGC
3905
+ TTATGG
3906
+ TTATGT
3907
+ TTATTA
3908
+ TTATTC
3909
+ TTATTG
3910
+ TTATTT
3911
+ TTCAAA
3912
+ TTCAAC
3913
+ TTCAAG
3914
+ TTCAAT
3915
+ TTCACA
3916
+ TTCACC
3917
+ TTCACG
3918
+ TTCACT
3919
+ TTCAGA
3920
+ TTCAGC
3921
+ TTCAGG
3922
+ TTCAGT
3923
+ TTCATA
3924
+ TTCATC
3925
+ TTCATG
3926
+ TTCATT
3927
+ TTCCAA
3928
+ TTCCAC
3929
+ TTCCAG
3930
+ TTCCAT
3931
+ TTCCCA
3932
+ TTCCCC
3933
+ TTCCCG
3934
+ TTCCCT
3935
+ TTCCGA
3936
+ TTCCGC
3937
+ TTCCGG
3938
+ TTCCGT
3939
+ TTCCTA
3940
+ TTCCTC
3941
+ TTCCTG
3942
+ TTCCTT
3943
+ TTCGAA
3944
+ TTCGAC
3945
+ TTCGAG
3946
+ TTCGAT
3947
+ TTCGCA
3948
+ TTCGCC
3949
+ TTCGCG
3950
+ TTCGCT
3951
+ TTCGGA
3952
+ TTCGGC
3953
+ TTCGGG
3954
+ TTCGGT
3955
+ TTCGTA
3956
+ TTCGTC
3957
+ TTCGTG
3958
+ TTCGTT
3959
+ TTCTAA
3960
+ TTCTAC
3961
+ TTCTAG
3962
+ TTCTAT
3963
+ TTCTCA
3964
+ TTCTCC
3965
+ TTCTCG
3966
+ TTCTCT
3967
+ TTCTGA
3968
+ TTCTGC
3969
+ TTCTGG
3970
+ TTCTGT
3971
+ TTCTTA
3972
+ TTCTTC
3973
+ TTCTTG
3974
+ TTCTTT
3975
+ TTGAAA
3976
+ TTGAAC
3977
+ TTGAAG
3978
+ TTGAAT
3979
+ TTGACA
3980
+ TTGACC
3981
+ TTGACG
3982
+ TTGACT
3983
+ TTGAGA
3984
+ TTGAGC
3985
+ TTGAGG
3986
+ TTGAGT
3987
+ TTGATA
3988
+ TTGATC
3989
+ TTGATG
3990
+ TTGATT
3991
+ TTGCAA
3992
+ TTGCAC
3993
+ TTGCAG
3994
+ TTGCAT
3995
+ TTGCCA
3996
+ TTGCCC
3997
+ TTGCCG
3998
+ TTGCCT
3999
+ TTGCGA
4000
+ TTGCGC
4001
+ TTGCGG
4002
+ TTGCGT
4003
+ TTGCTA
4004
+ TTGCTC
4005
+ TTGCTG
4006
+ TTGCTT
4007
+ TTGGAA
4008
+ TTGGAC
4009
+ TTGGAG
4010
+ TTGGAT
4011
+ TTGGCA
4012
+ TTGGCC
4013
+ TTGGCG
4014
+ TTGGCT
4015
+ TTGGGA
4016
+ TTGGGC
4017
+ TTGGGG
4018
+ TTGGGT
4019
+ TTGGTA
4020
+ TTGGTC
4021
+ TTGGTG
4022
+ TTGGTT
4023
+ TTGTAA
4024
+ TTGTAC
4025
+ TTGTAG
4026
+ TTGTAT
4027
+ TTGTCA
4028
+ TTGTCC
4029
+ TTGTCG
4030
+ TTGTCT
4031
+ TTGTGA
4032
+ TTGTGC
4033
+ TTGTGG
4034
+ TTGTGT
4035
+ TTGTTA
4036
+ TTGTTC
4037
+ TTGTTG
4038
+ TTGTTT
4039
+ TTTAAA
4040
+ TTTAAC
4041
+ TTTAAG
4042
+ TTTAAT
4043
+ TTTACA
4044
+ TTTACC
4045
+ TTTACG
4046
+ TTTACT
4047
+ TTTAGA
4048
+ TTTAGC
4049
+ TTTAGG
4050
+ TTTAGT
4051
+ TTTATA
4052
+ TTTATC
4053
+ TTTATG
4054
+ TTTATT
4055
+ TTTCAA
4056
+ TTTCAC
4057
+ TTTCAG
4058
+ TTTCAT
4059
+ TTTCCA
4060
+ TTTCCC
4061
+ TTTCCG
4062
+ TTTCCT
4063
+ TTTCGA
4064
+ TTTCGC
4065
+ TTTCGG
4066
+ TTTCGT
4067
+ TTTCTA
4068
+ TTTCTC
4069
+ TTTCTG
4070
+ TTTCTT
4071
+ TTTGAA
4072
+ TTTGAC
4073
+ TTTGAG
4074
+ TTTGAT
4075
+ TTTGCA
4076
+ TTTGCC
4077
+ TTTGCG
4078
+ TTTGCT
4079
+ TTTGGA
4080
+ TTTGGC
4081
+ TTTGGG
4082
+ TTTGGT
4083
+ TTTGTA
4084
+ TTTGTC
4085
+ TTTGTG
4086
+ TTTGTT
4087
+ TTTTAA
4088
+ TTTTAC
4089
+ TTTTAG
4090
+ TTTTAT
4091
+ TTTTCA
4092
+ TTTTCC
4093
+ TTTTCG
4094
+ TTTTCT
4095
+ TTTTGA
4096
+ TTTTGC
4097
+ TTTTGG
4098
+ TTTTGT
4099
+ TTTTTA
4100
+ TTTTTC
4101
+ TTTTTG
4102
+ TTTTTT