mikemayuare commited on
Commit
8a20937
·
verified ·
1 Parent(s): bbfcbe1

Upload tokenizer

Browse files
Files changed (3) hide show
  1. special_tokens_map.json +37 -0
  2. tokenizer.json +4528 -0
  3. tokenizer_config.json +52 -0
special_tokens_map.json ADDED
@@ -0,0 +1,37 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "bos_token": {
3
+ "content": "<s>",
4
+ "lstrip": false,
5
+ "normalized": false,
6
+ "rstrip": false,
7
+ "single_word": false
8
+ },
9
+ "eos_token": {
10
+ "content": "</s>",
11
+ "lstrip": false,
12
+ "normalized": false,
13
+ "rstrip": false,
14
+ "single_word": false
15
+ },
16
+ "mask_token": {
17
+ "content": "<mask>",
18
+ "lstrip": false,
19
+ "normalized": false,
20
+ "rstrip": false,
21
+ "single_word": false
22
+ },
23
+ "pad_token": {
24
+ "content": "<pad>",
25
+ "lstrip": false,
26
+ "normalized": false,
27
+ "rstrip": false,
28
+ "single_word": false
29
+ },
30
+ "unk_token": {
31
+ "content": "<unk>",
32
+ "lstrip": false,
33
+ "normalized": false,
34
+ "rstrip": false,
35
+ "single_word": false
36
+ }
37
+ }
tokenizer.json ADDED
@@ -0,0 +1,4528 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "version": "1.0",
3
+ "truncation": null,
4
+ "padding": null,
5
+ "added_tokens": [
6
+ {
7
+ "id": 0,
8
+ "content": "<pad>",
9
+ "single_word": false,
10
+ "lstrip": false,
11
+ "rstrip": false,
12
+ "normalized": false,
13
+ "special": true
14
+ },
15
+ {
16
+ "id": 1,
17
+ "content": "<s>",
18
+ "single_word": false,
19
+ "lstrip": false,
20
+ "rstrip": false,
21
+ "normalized": false,
22
+ "special": true
23
+ },
24
+ {
25
+ "id": 2,
26
+ "content": "<unk>",
27
+ "single_word": false,
28
+ "lstrip": false,
29
+ "rstrip": false,
30
+ "normalized": false,
31
+ "special": true
32
+ },
33
+ {
34
+ "id": 3,
35
+ "content": "<mask>",
36
+ "single_word": false,
37
+ "lstrip": false,
38
+ "rstrip": false,
39
+ "normalized": false,
40
+ "special": true
41
+ },
42
+ {
43
+ "id": 2260,
44
+ "content": "</s>",
45
+ "single_word": false,
46
+ "lstrip": false,
47
+ "rstrip": false,
48
+ "normalized": false,
49
+ "special": true
50
+ }
51
+ ],
52
+ "normalizer": null,
53
+ "pre_tokenizer": null,
54
+ "post_processor": null,
55
+ "decoder": null,
56
+ "model": {
57
+ "type": "BPE",
58
+ "dropout": null,
59
+ "unk_token": "<unk>",
60
+ "continuing_subword_prefix": null,
61
+ "end_of_word_suffix": null,
62
+ "fuse_unk": false,
63
+ "byte_fallback": false,
64
+ "vocab": {
65
+ "<pad>": 0,
66
+ "<s>": 1,
67
+ "<unk>": 2,
68
+ "<mask>": 3,
69
+ "#": 4,
70
+ "%": 5,
71
+ "(": 6,
72
+ ")": 7,
73
+ "+": 8,
74
+ "-": 9,
75
+ "0": 10,
76
+ "1": 11,
77
+ "2": 12,
78
+ "3": 13,
79
+ "4": 14,
80
+ "5": 15,
81
+ "6": 16,
82
+ "7": 17,
83
+ "8": 18,
84
+ "9": 19,
85
+ "=": 20,
86
+ "A": 21,
87
+ "B": 22,
88
+ "C": 23,
89
+ "E": 24,
90
+ "F": 25,
91
+ "G": 26,
92
+ "H": 27,
93
+ "I": 28,
94
+ "M": 29,
95
+ "N": 30,
96
+ "O": 31,
97
+ "P": 32,
98
+ "R": 33,
99
+ "S": 34,
100
+ "T": 35,
101
+ "U": 36,
102
+ "V": 37,
103
+ "W": 38,
104
+ "X": 39,
105
+ "Y": 40,
106
+ "Z": 41,
107
+ "[": 42,
108
+ "]": 43,
109
+ "a": 44,
110
+ "b": 45,
111
+ "c": 46,
112
+ "d": 47,
113
+ "e": 48,
114
+ "g": 49,
115
+ "h": 50,
116
+ "i": 51,
117
+ "l": 52,
118
+ "m": 53,
119
+ "n": 54,
120
+ "o": 55,
121
+ "p": 56,
122
+ "r": 57,
123
+ "s": 58,
124
+ "t": 59,
125
+ "u": 60,
126
+ "cc": 61,
127
+ "CC": 62,
128
+ "(C": 63,
129
+ "c1": 64,
130
+ "O)": 65,
131
+ "=O)": 66,
132
+ "(=O)": 67,
133
+ "ccc": 68,
134
+ "(C)": 69,
135
+ "c2": 70,
136
+ "C(=O)": 71,
137
+ ")cc": 72,
138
+ "+]": 73,
139
+ "[N": 74,
140
+ "CCC": 75,
141
+ "c1cc": 76,
142
+ "[NH": 77,
143
+ "c1ccc": 78,
144
+ "c(": 79,
145
+ "C(": 80,
146
+ "c3": 81,
147
+ "2)": 82,
148
+ "F)": 83,
149
+ "C1": 84,
150
+ "CCCC": 85,
151
+ "c2cc": 86,
152
+ "OC": 87,
153
+ "c1cccc": 88,
154
+ "NC(=O)": 89,
155
+ ")cc1": 90,
156
+ "CC1": 91,
157
+ "(=O)N": 92,
158
+ "(C)C": 93,
159
+ "-]": 94,
160
+ "CO": 95,
161
+ "c1ccc(": 96,
162
+ "[O": 97,
163
+ "[O-]": 98,
164
+ "n1": 99,
165
+ "[NH+]": 100,
166
+ "c2ccc": 101,
167
+ "3)": 102,
168
+ "(Cl": 103,
169
+ "(F)": 104,
170
+ "c1ccccc1": 105,
171
+ "ccccc": 106,
172
+ "CCO": 107,
173
+ "C(=O)N": 108,
174
+ "2+]": 109,
175
+ "[NH2+]": 110,
176
+ "c2ccccc": 111,
177
+ "(CC": 112,
178
+ "C2": 113,
179
+ "[O-])": 114,
180
+ "cn": 115,
181
+ "c1n": 116,
182
+ "S(=O)": 117,
183
+ "[n": 118,
184
+ "N)": 119,
185
+ "O=": 120,
186
+ "CCN": 121,
187
+ "(C(=O)": 122,
188
+ "[nH": 123,
189
+ "(C(=O)N": 124,
190
+ "c4": 125,
191
+ "(Cl)": 126,
192
+ "Br": 127,
193
+ "CC(C)": 128,
194
+ "C(C)": 129,
195
+ "[nH]": 130,
196
+ "(C)C)": 131,
197
+ "CC(": 132,
198
+ "2)cc1": 133,
199
+ "c(C": 134,
200
+ "3+]": 135,
201
+ "[NH3+]": 136,
202
+ "c3ccc": 137,
203
+ "c2ccc(": 138,
204
+ "CN": 139,
205
+ "C(C": 140,
206
+ "c(C)": 141,
207
+ "c3ccccc": 142,
208
+ "Cl": 143,
209
+ "CCCCC": 144,
210
+ "C=": 145,
211
+ "cc(": 146,
212
+ "c2)": 147,
213
+ "c2n": 148,
214
+ "cc1": 149,
215
+ "OC)": 150,
216
+ "c2ccccc2": 151,
217
+ "O=C(": 152,
218
+ "c1cc(": 153,
219
+ "F)cc": 154,
220
+ "c1ccc(C": 155,
221
+ "CC(=O)N": 156,
222
+ ")N": 157,
223
+ "n2": 158,
224
+ "CC2": 159,
225
+ "[N+]": 160,
226
+ "2)c1": 161,
227
+ "C)": 162,
228
+ "[NH3+])": 163,
229
+ "CC[NH+]": 164,
230
+ "Br)": 165,
231
+ "4)": 166,
232
+ "c(N": 167,
233
+ "CCC(": 168,
234
+ "=O": 169,
235
+ "(Cl)cc": 170,
236
+ "(F)(F)": 171,
237
+ "c1)": 172,
238
+ "c(=O)": 173,
239
+ "c3cc": 174,
240
+ "[N+](=O)": 175,
241
+ "Cc1ccc(": 176,
242
+ "CC(=O)": 177,
243
+ "c2cccc": 178,
244
+ "c1ccc2": 179,
245
+ "c1cccc(": 180,
246
+ "CC2)": 181,
247
+ "N1": 182,
248
+ "C(F)": 183,
249
+ "C3": 184,
250
+ "s1": 185,
251
+ "c3ccccc3": 186,
252
+ "C[NH+]": 187,
253
+ "CCC1": 188,
254
+ "ccc2": 189,
255
+ "Cc1": 190,
256
+ "nc(": 191,
257
+ "nc1": 192,
258
+ "OCC": 193,
259
+ "Cc1cc": 194,
260
+ "CCCCCCCC": 195,
261
+ "C(O)": 196,
262
+ "N2": 197,
263
+ "=C": 198,
264
+ "c3ccc(": 199,
265
+ "OC(C)": 200,
266
+ "Cc1n": 201,
267
+ "c3)": 202,
268
+ "COC(=O)": 203,
269
+ "Cl)": 204,
270
+ "c(Cl)": 205,
271
+ "#N)": 206,
272
+ "C(F)(F)": 207,
273
+ "c5": 208,
274
+ "2)CC1": 209,
275
+ "(CC)": 210,
276
+ "OC(=O)": 211,
277
+ "(O)": 212,
278
+ "CC[NH2+]": 213,
279
+ "1)": 214,
280
+ "cc2": 215,
281
+ "=C(": 216,
282
+ "C[NH2+]": 217,
283
+ ")ccc1": 218,
284
+ "CCN(": 219,
285
+ "O=C(N": 220,
286
+ "F)cc1": 221,
287
+ "(F)(F)F)": 222,
288
+ "nn": 223,
289
+ "=N": 224,
290
+ ")cc2": 225,
291
+ "COc1ccc(": 226,
292
+ "c4ccccc": 227,
293
+ "2)C1": 228,
294
+ "CS": 229,
295
+ "CC(C)(C)": 230,
296
+ "CCCC1": 231,
297
+ "c(F)": 232,
298
+ "c1cn": 233,
299
+ "CCOC(=O)": 234,
300
+ "c2cc(": 235,
301
+ "CCCN": 236,
302
+ "CCC(C)": 237,
303
+ "CC3)": 238,
304
+ "nc2": 239,
305
+ "NC(=O)N": 240,
306
+ "C(C)C": 241,
307
+ "=S": 242,
308
+ "c4ccc": 243,
309
+ "CC(O)": 244,
310
+ "CC3": 245,
311
+ "o1": 246,
312
+ "cs": 247,
313
+ "CCCO": 248,
314
+ "CCC2": 249,
315
+ "(C(C)": 250,
316
+ "(Cl)cc1": 251,
317
+ "c1ccc2c(": 252,
318
+ "cn1": 253,
319
+ "CC(C": 254,
320
+ "C(=O)N1": 255,
321
+ "(N)": 256,
322
+ "c2c(": 257,
323
+ "[S": 258,
324
+ "Cn1": 259,
325
+ "=[NH+]": 260,
326
+ "Cc1ccc": 261,
327
+ "CCCCC1": 262,
328
+ "n3": 263,
329
+ "Cc1cc(": 264,
330
+ "O=C(C": 265,
331
+ "c2c1": 266,
332
+ "ncc": 267,
333
+ "c1cc(C": 268,
334
+ "2)n1": 269,
335
+ "c1cccc(C": 270,
336
+ "CCC(C": 271,
337
+ "c2ccc3": 272,
338
+ "CC)": 273,
339
+ "c2cn": 274,
340
+ "c(C(=O)N": 275,
341
+ "c12": 276,
342
+ ")N1": 277,
343
+ "[nH+]": 278,
344
+ "[Si": 279,
345
+ "(CC(=O)N": 280,
346
+ "c3cccc": 281,
347
+ "ccc3": 282,
348
+ "CNC(=O)": 283,
349
+ "[NH+]1": 284,
350
+ "CC=": 285,
351
+ ")cc(": 286,
352
+ "CC(C)C": 287,
353
+ "OC1": 288,
354
+ "n(": 289,
355
+ "c2cccc(": 290,
356
+ "[Si]": 291,
357
+ "[NH+]2": 292,
358
+ "OC2": 293,
359
+ "CC1)": 294,
360
+ "c4ccccc4": 295,
361
+ "CCn1": 296,
362
+ "cccc": 297,
363
+ "c2ccc(C": 298,
364
+ "c(C)c1": 299,
365
+ "(C)cc": 300,
366
+ "N#": 301,
367
+ ")cc2)": 302,
368
+ "CCNC(=O)": 303,
369
+ "c1c(": 304,
370
+ "CC2)cc1": 305,
371
+ "CCS": 306,
372
+ "3)n": 307,
373
+ "OC(C": 308,
374
+ "=O)cc1": 309,
375
+ "c1cc2": 310,
376
+ "c2)cc1": 311,
377
+ "n(C)": 312,
378
+ "5)": 313,
379
+ "NS(=O)": 314,
380
+ "NC(=O)C(": 315,
381
+ "c1C": 316,
382
+ "c[nH]": 317,
383
+ "NC(": 318,
384
+ "([O-])": 319,
385
+ "c3n": 320,
386
+ "(C)C(=O)": 321,
387
+ "c(OC)": 322,
388
+ "#N": 323,
389
+ ")cc3)": 324,
390
+ "CCCC2": 325,
391
+ "CN1": 326,
392
+ "c(N)": 327,
393
+ "Cc1ccc(C": 328,
394
+ "(C)(=O)": 329,
395
+ "C(C)C)": 330,
396
+ "c6": 331,
397
+ "O=C1": 332,
398
+ "nc(N": 333,
399
+ "C[NH+]1": 334,
400
+ ")cc3": 335,
401
+ ")C(=O)": 336,
402
+ "c(C(=O)": 337,
403
+ "C2)": 338,
404
+ "CC2)c1": 339,
405
+ "c1cccc2": 340,
406
+ "Br)cc1": 341,
407
+ "N(": 342,
408
+ "cc(C": 343,
409
+ "C1CC1": 344,
410
+ "S(C)(=O)": 345,
411
+ "nc(C": 346,
412
+ "CCC(=O)": 347,
413
+ "ccc(": 348,
414
+ "CCC(=O)N": 349,
415
+ "[O-])cc1": 350,
416
+ "c(NC(=O)": 351,
417
+ "C(N)": 352,
418
+ "CN(": 353,
419
+ "CCN1": 354,
420
+ "c1ccc(N": 355,
421
+ "c3c(": 356,
422
+ "C4": 357,
423
+ "[O-])c1": 358,
424
+ "OC(": 359,
425
+ "c1ccc(C)": 360,
426
+ "(C(=O)N2": 361,
427
+ ")cc2)cc1": 362,
428
+ "[nH]1": 363,
429
+ "c(Cl)cc": 364,
430
+ "OCCO": 365,
431
+ "C1=O": 366,
432
+ "Cc1cccc(": 367,
433
+ "=C2": 368,
434
+ "n(C": 369,
435
+ "CCC[NH+]": 370,
436
+ "CCCC(": 371,
437
+ "=S)": 372,
438
+ "O1": 373,
439
+ "nn1": 374,
440
+ "CCC3": 375,
441
+ "Br)c1": 376,
442
+ "NC(=O)C1": 377,
443
+ "[Si](C)": 378,
444
+ "(CC(=O)": 379,
445
+ "cc3": 380,
446
+ "OCO": 381,
447
+ ")C1": 382,
448
+ "c4ccc(": 383,
449
+ "N1C(=O)": 384,
450
+ "n2)": 385,
451
+ "c2)c1": 386,
452
+ "C(C)(C)C": 387,
453
+ "nc3": 388,
454
+ "OCC(=O)N": 389,
455
+ "c2ccc3c(": 390,
456
+ "c4)": 391,
457
+ "=S)N": 392,
458
+ "Nc1n": 393,
459
+ "Cc1cn": 394,
460
+ "c5ccccc": 395,
461
+ "NC(=O)C2": 396,
462
+ "(N)=O)": 397,
463
+ "CCS(=O)": 398,
464
+ "F)cc2": 399,
465
+ "P(=O)": 400,
466
+ "ccccc2": 401,
467
+ "(Cl)c1": 402,
468
+ "O)cc1": 403,
469
+ "c1ccc(C2": 404,
470
+ "CCc1n": 405,
471
+ "C(C)(C)": 406,
472
+ "c(Cl)c1": 407,
473
+ "c2ccc(N": 408,
474
+ "C(N": 409,
475
+ "ncn": 410,
476
+ "(C2": 411,
477
+ "c(S": 412,
478
+ "c3cc(": 413,
479
+ "(CCC": 414,
480
+ "C#": 415,
481
+ "c(F)c1": 416,
482
+ "c2s": 417,
483
+ "3)C2": 418,
484
+ "CS(=O)": 419,
485
+ "CCOCC1": 420,
486
+ "CC1(C)": 421,
487
+ "OCC)": 422,
488
+ "CN(C(=O)": 423,
489
+ "c(O)": 424,
490
+ "ncc1": 425,
491
+ "ccc1": 426,
492
+ "COc1cc(": 427,
493
+ "3CCCC": 428,
494
+ "Cc1cc(C)": 429,
495
+ "N2C(=O)": 430,
496
+ "CC(CC": 431,
497
+ "CC[NH+]1": 432,
498
+ "c1=O": 433,
499
+ "N=": 434,
500
+ "C3)": 435,
501
+ "cs1": 436,
502
+ "n3)": 437,
503
+ "c3ccc4": 438,
504
+ "I)": 439,
505
+ "c2cc3": 440,
506
+ "CC(C)C)": 441,
507
+ "CC4": 442,
508
+ "C)cc1": 443,
509
+ "c2nc(": 444,
510
+ "s2)": 445,
511
+ "C(F)(F)F": 446,
512
+ "C=C": 447,
513
+ "C(=O)NC(": 448,
514
+ "c(C2": 449,
515
+ "c2)CC1": 450,
516
+ "c1ncc": 451,
517
+ "(C)C1": 452,
518
+ "(CO)": 453,
519
+ "CC(=O)N1": 454,
520
+ "(C)c1": 455,
521
+ "CCC(O)": 456,
522
+ "c4cc": 457,
523
+ "C(=O)N2": 458,
524
+ "sc1": 459,
525
+ "([NH3+])": 460,
526
+ "COC1": 461,
527
+ "[O-])C1": 462,
528
+ "OC)cc1": 463,
529
+ "c1ccc(O": 464,
530
+ "C(=O)N(": 465,
531
+ "COc1ccc": 466,
532
+ "(=O)N2": 467,
533
+ "Cc1cccc": 468,
534
+ "(C)C)cc1": 469,
535
+ "n1)": 470,
536
+ "3)cc1": 471,
537
+ "=C(N": 472,
538
+ "l)": 473,
539
+ "CCC=": 474,
540
+ "(F)c1": 475,
541
+ "c(C)cc": 476,
542
+ "c2ncc": 477,
543
+ "(Cl)cc2": 478,
544
+ "(C#N)": 479,
545
+ "OC3": 480,
546
+ "n2)cc1": 481,
547
+ "ccc21": 482,
548
+ "c1s": 483,
549
+ "(C)C(C)": 484,
550
+ "(C(=O)NC": 485,
551
+ "CN(C)": 486,
552
+ "[NH2+]C": 487,
553
+ "OC)c1": 488,
554
+ "C(C#N)": 489,
555
+ "c1nc(": 490,
556
+ "CC4)": 491,
557
+ "COc1cc": 492,
558
+ "(N": 493,
559
+ "CCCCC2": 494,
560
+ "C1=": 495,
561
+ "F)cc2)": 496,
562
+ "C1)": 497,
563
+ "s1)": 498,
564
+ "nc(C)": 499,
565
+ "ccccc3": 500,
566
+ "=O)c1": 501,
567
+ "COC": 502,
568
+ "o2)": 503,
569
+ "COc1cc(C": 504,
570
+ "c2cc(Cl)": 505,
571
+ "CCOCCO": 506,
572
+ "CCCCO": 507,
573
+ "c3cccc(": 508,
574
+ "CCN(CC)": 509,
575
+ "c2ccc(OC": 510,
576
+ "c(C(C)": 511,
577
+ "N(C": 512,
578
+ "N(C)": 513,
579
+ "F)ccc1": 514,
580
+ "C(CO)": 515,
581
+ "N(C(=O)": 516,
582
+ "[NH2+]C1": 517,
583
+ "c7": 518,
584
+ "OC(C)=O)": 519,
585
+ "c3cn": 520,
586
+ "n4": 521,
587
+ "CCN(C": 522,
588
+ "CN(C": 523,
589
+ "(CCO)": 524,
590
+ "SC": 525,
591
+ "c5ccccc5": 526,
592
+ "=C1": 527,
593
+ "c1cs": 528,
594
+ "c1ccc(F)": 529,
595
+ "oc(": 530,
596
+ "(C)C2": 531,
597
+ "C(C(=O)": 532,
598
+ "c(N2": 533,
599
+ "CCOCC2)": 534,
600
+ "CCc1ccc(": 535,
601
+ "(CCCC": 536,
602
+ "c2cc(C": 537,
603
+ "c2ccc(Br": 538,
604
+ "c3ccc(C": 539,
605
+ "[nH]c(": 540,
606
+ "3)CC2)": 541,
607
+ "NC(=O)C": 542,
608
+ "OCCC": 543,
609
+ "(c2ccccc": 544,
610
+ "no1": 545,
611
+ "(=O)N1": 546,
612
+ "c(OC)c1": 547,
613
+ "c2cc(C)": 548,
614
+ "ccc4": 549,
615
+ "C(O)C(O)": 550,
616
+ "CCOC1": 551,
617
+ "OCC1": 552,
618
+ "c2ccc(C)": 553,
619
+ "c1cc2c(": 554,
620
+ "c2cccc3": 555,
621
+ "O)cc": 556,
622
+ "o1)": 557,
623
+ "=O)cc": 558,
624
+ "c[nH+]": 559,
625
+ "CCOC": 560,
626
+ "O=S(=O)": 561,
627
+ "CCCC(C)": 562,
628
+ "N=C": 563,
629
+ "CCCn1": 564,
630
+ "3CCO": 565,
631
+ "c4cccc": 566,
632
+ "c2C)": 567,
633
+ "c2ncn": 568,
634
+ "C(=": 569,
635
+ "c(Br)": 570,
636
+ "CCC4": 571,
637
+ "c2cs": 572,
638
+ "c2cccc(C": 573,
639
+ "c(O": 574,
640
+ "[n+]": 575,
641
+ "CCCCC2)": 576,
642
+ "(C)C)c1": 577,
643
+ "C1CCCCC1": 578,
644
+ "F)cc3)": 579,
645
+ ")C(=O)N": 580,
646
+ "Cc1cc(C": 581,
647
+ "(Cl)ccc1": 582,
648
+ "CCCC2)": 583,
649
+ "c1nnc(": 584,
650
+ "c12)": 585,
651
+ "nc2)": 586,
652
+ "C(=C": 587,
653
+ "c1c(C)": 588,
654
+ "(Cl)cc2)": 589,
655
+ "ccccc6": 590,
656
+ "C12": 591,
657
+ "%1": 592,
658
+ "C(NC(=O)": 593,
659
+ "OC(CO)": 594,
660
+ "(CC2": 595,
661
+ "c2cc(F)": 596,
662
+ "c1ccc(N2": 597,
663
+ "o2)cc1": 598,
664
+ "c1cccs1": 599,
665
+ "[O-])cc": 600,
666
+ "C2=O)": 601,
667
+ "c1cccnc1": 602,
668
+ "=C(N)": 603,
669
+ "C=C1": 604,
670
+ "c1cc(N": 605,
671
+ "3CCCCC": 606,
672
+ "CCC)": 607,
673
+ "C(=S)N": 608,
674
+ "c(C#N)": 609,
675
+ "c21": 610,
676
+ "[N-]": 611,
677
+ "CCO)": 612,
678
+ "n2cc": 613,
679
+ "c(S(=O)": 614,
680
+ "CCCN(": 615,
681
+ "C(C(=O)N": 616,
682
+ "c1ncn": 617,
683
+ "n1C": 618,
684
+ "c2ccc(F)": 619,
685
+ "C[NH+]2": 620,
686
+ "NC(=O)CS": 621,
687
+ "c2nnc(": 622,
688
+ "(O": 623,
689
+ "n2cn": 624,
690
+ "(C(C)C)": 625,
691
+ "c3ccccc2": 626,
692
+ "n2)c1": 627,
693
+ "[NH2+]C2": 628,
694
+ "Cc1cc2": 629,
695
+ "N)=O)": 630,
696
+ "s3)": 631,
697
+ "OC(=O)N": 632,
698
+ "C1CCC": 633,
699
+ "F)cc3": 634,
700
+ "CCCCC1)": 635,
701
+ "Oc1ccc(": 636,
702
+ "(C)C)cc": 637,
703
+ "NC(C)": 638,
704
+ "CN1C(=O)": 639,
705
+ "=[NH2+]": 640,
706
+ "C1O": 641,
707
+ "c(OC": 642,
708
+ "c6ccccc6": 643,
709
+ "S1": 644,
710
+ "CC1(": 645,
711
+ "SCC(=O)N": 646,
712
+ "c1[nH]": 647,
713
+ "c2)C1": 648,
714
+ "c2c(C)": 649,
715
+ "=CC(=O)": 650,
716
+ "c3ccc4c(": 651,
717
+ "OC(F)(F)": 652,
718
+ "(N)(=O)": 653,
719
+ "CCC(CC)": 654,
720
+ "c1c[nH]": 655,
721
+ "co": 656,
722
+ "CC(C(=O)": 657,
723
+ "CO)": 658,
724
+ "no": 659,
725
+ "CCN(CC": 660,
726
+ "s2)cc1": 661,
727
+ "OCCCC": 662,
728
+ "C(=O)OC": 663,
729
+ "c2n1": 664,
730
+ "C2)cc1": 665,
731
+ "F)c1": 666,
732
+ "nc12": 667,
733
+ "Br)cc": 668,
734
+ "NC1": 669,
735
+ "CCNC(N": 670,
736
+ "3)CC1": 671,
737
+ "c2)n1": 672,
738
+ "c2[nH]": 673,
739
+ "C=C(": 674,
740
+ "3CCOCC3)": 675,
741
+ "=C(O)": 676,
742
+ "ncc2": 677,
743
+ "C#N)": 678,
744
+ "c1ccncc1": 679,
745
+ "c(Br)c1": 680,
746
+ "CCCCCC1": 681,
747
+ ")cc21": 682,
748
+ "-2": 683,
749
+ "C2)c1": 684,
750
+ "Cc1cs": 685,
751
+ "N3": 686,
752
+ "O=[N+]": 687,
753
+ "Br)ccc1": 688,
754
+ "c2=O)": 689,
755
+ "Cc1ccc2": 690,
756
+ "Cc2ccc": 691,
757
+ "NC(=O)CO": 692,
758
+ "C1CCCC1": 693,
759
+ "3)c1": 694,
760
+ "c(F)c(F)": 695,
761
+ "C[NH+](C": 696,
762
+ "C)c1": 697,
763
+ "c1cc(C)": 698,
764
+ "C#N": 699,
765
+ "NC(=S)N": 700,
766
+ "F)cc1)": 701,
767
+ "nnc1": 702,
768
+ "CC(C)O": 703,
769
+ "c5ccc": 704,
770
+ "O)c1": 705,
771
+ ")cc2)c1": 706,
772
+ "S(N)(=O)": 707,
773
+ "CC2)CC1": 708,
774
+ "C5": 709,
775
+ "CC#": 710,
776
+ "4)CC3)": 711,
777
+ "O[Si](C)": 712,
778
+ "CCCC(C": 713,
779
+ "CCN2": 714,
780
+ "CC12": 715,
781
+ "c1c(F)cc": 716,
782
+ "nn2": 717,
783
+ "COc1ccc2": 718,
784
+ "CC(N": 719,
785
+ "c2nc(C": 720,
786
+ "O=C(NC1": 721,
787
+ "C=C(C)": 722,
788
+ "cc(N": 723,
789
+ "n(CC": 724,
790
+ "3CC3)": 725,
791
+ "n2)CC1": 726,
792
+ "oc(C": 727,
793
+ "nc3)": 728,
794
+ "c4c(": 729,
795
+ "CC1CCC": 730,
796
+ "(Cl)cc3": 731,
797
+ "Cl)cc1": 732,
798
+ "c2cc(Br)": 733,
799
+ "OC(F)": 734,
800
+ "c2ccncc": 735,
801
+ "n[nH]": 736,
802
+ "OCC2": 737,
803
+ "(Cl)cc3)": 738,
804
+ "Cc1nc(": 739,
805
+ "CN2": 740,
806
+ "nc(N)": 741,
807
+ "C2=O)cc1": 742,
808
+ "nc2c1": 743,
809
+ "CCCC(=O)": 744,
810
+ "(F)F)": 745,
811
+ "Cc2ccccc": 746,
812
+ "(C)CC1": 747,
813
+ "c3s": 748,
814
+ "NC(=O)N2": 749,
815
+ "CCCC1)": 750,
816
+ "OP(=O)": 751,
817
+ "Nc1ccc(": 752,
818
+ "C(N)=O": 753,
819
+ ")cc2)CC1": 754,
820
+ "6)": 755,
821
+ "CC2)n1": 756,
822
+ "ncn1": 757,
823
+ "CCCS": 758,
824
+ "OCC(=O)": 759,
825
+ "2)C1=O": 760,
826
+ "CC2)ccc1": 761,
827
+ ")cc4": 762,
828
+ ")cccc1": 763,
829
+ "C(=O)N(C": 764,
830
+ "sc2c1": 765,
831
+ "C(=O)C1": 766,
832
+ "o3)": 767,
833
+ "c1cc(Cl)": 768,
834
+ "OC)c(OC)": 769,
835
+ "Cc1ccc(N": 770,
836
+ "c3ccc(OC": 771,
837
+ "c2cc1": 772,
838
+ "CC1=": 773,
839
+ "C(CC)": 774,
840
+ "CC=C": 775,
841
+ "c2c(F)cc": 776,
842
+ "C(OC": 777,
843
+ "c1)OCO": 778,
844
+ "c3ncc": 779,
845
+ "N(CC": 780,
846
+ "c1nc(N": 781,
847
+ "c(=O)n2": 782,
848
+ "c3ccc(N": 783,
849
+ "c8": 784,
850
+ "c3C)": 785,
851
+ "CC1(C": 786,
852
+ "CC1C": 787,
853
+ "c1)C(=O)": 788,
854
+ "cc4": 789,
855
+ "CN(CC": 790,
856
+ "ccc2c1": 791,
857
+ "ccccc4": 792,
858
+ "c4ccc5": 793,
859
+ "C(=O)NC1": 794,
860
+ "(C)C3": 795,
861
+ "CC(=": 796,
862
+ "CC(F)(F)": 797,
863
+ "cc(Br)": 798,
864
+ "(C(=O)C2": 799,
865
+ "CC[NH+]2": 800,
866
+ "CC(O": 801,
867
+ "C(C)O": 802,
868
+ "3)C1": 803,
869
+ "CCC(CC": 804,
870
+ "[O-])CC1": 805,
871
+ "CCC=CCC=": 806,
872
+ "[NH3+]C1": 807,
873
+ "c(=O)n1": 808,
874
+ "cn2": 809,
875
+ "[NH+]3": 810,
876
+ "[NH2+]1": 811,
877
+ "CCC2)": 812,
878
+ "(OC)": 813,
879
+ "ccnc1": 814,
880
+ "c5ccc(": 815,
881
+ "CC2)C1": 816,
882
+ "NC(=O)N1": 817,
883
+ "c1N": 818,
884
+ "(C(C)=O)": 819,
885
+ "CCC(N": 820,
886
+ "c3c2": 821,
887
+ "CCl)": 822,
888
+ "C(Cl)": 823,
889
+ "c2cccs2)": 824,
890
+ "Cc1c(": 825,
891
+ "CC(C)C1": 826,
892
+ "3CCCC3": 827,
893
+ "C(N)=O)": 828,
894
+ "[O-])cc2": 829,
895
+ "nc4": 830,
896
+ "(c3ccccc": 831,
897
+ "[NH2+]C)": 832,
898
+ "(C)C)C1": 833,
899
+ "n1cn": 834,
900
+ "O2": 835,
901
+ "c3nn": 836,
902
+ "OC(C)(C)": 837,
903
+ "c1nc(C": 838,
904
+ "c3cccc4": 839,
905
+ "c2cccnc2": 840,
906
+ ")c1ccc(": 841,
907
+ "CC1(C)C": 842,
908
+ "c3cc4": 843,
909
+ "nc1)": 844,
910
+ "COc1cccc": 845,
911
+ "NC2": 846,
912
+ "Cc1s": 847,
913
+ "CCCC3": 848,
914
+ "CC(=O)N2": 849,
915
+ "=NNC(=O)": 850,
916
+ "=C(C)": 851,
917
+ "[NH+](C)": 852,
918
+ "CC(CC(C": 853,
919
+ "=C(C": 854,
920
+ "#C": 855,
921
+ "F)CC1": 856,
922
+ "=C(C(=O)": 857,
923
+ "C(=O)C(": 858,
924
+ "sc2": 859,
925
+ "c1ccco1": 860,
926
+ "[nH]c1": 861,
927
+ "(=O)C1": 862,
928
+ "[NH2+]C(": 863,
929
+ "c2cc3c(": 864,
930
+ "=C([O-])": 865,
931
+ "c3cc(Cl)": 866,
932
+ "c1ccccn1": 867,
933
+ "oc1": 868,
934
+ "(NC(=O)": 869,
935
+ "CC(C)(O)": 870,
936
+ "c1ncccc1": 871,
937
+ "OCC(C": 872,
938
+ "CCO1": 873,
939
+ "3C)": 874,
940
+ "Cl)c1": 875,
941
+ "CCC(C)C": 876,
942
+ "Sc1n": 877,
943
+ "ccccc7": 878,
944
+ "ccccc12": 879,
945
+ "#N)cc1": 880,
946
+ "Oc2ccc(": 881,
947
+ "CCOC2": 882,
948
+ "c3nc(": 883,
949
+ "C(=O)C2": 884,
950
+ "Cc1no": 885,
951
+ "c5)": 886,
952
+ "n2ccc": 887,
953
+ "Cc1[nH]": 888,
954
+ "3)n2)cc1": 889,
955
+ "(Cl)c2": 890,
956
+ "C1C": 891,
957
+ "CC(CO)": 892,
958
+ "OCC(O)": 893,
959
+ "=C(C#N)": 894,
960
+ "CCCO1": 895,
961
+ ")cc4)": 896,
962
+ "n12": 897,
963
+ "[nH]c2c1": 898,
964
+ "cccc1": 899,
965
+ "CCCN1": 900,
966
+ "c1c(C": 901,
967
+ "Cc1nn(C)": 902,
968
+ "c3cc(C)": 903,
969
+ "NC": 904,
970
+ "2)nc1": 905,
971
+ "C3=O)": 906,
972
+ "OC(C)C)": 907,
973
+ "CCOCC1)": 908,
974
+ "OC(C)C": 909,
975
+ "(Cl)c2)": 910,
976
+ "4CCCC": 911,
977
+ "c2c[nH]": 912,
978
+ "C=CC(=O)": 913,
979
+ "OC(=O)N1": 914,
980
+ "C1CCN": 915,
981
+ "S)": 916,
982
+ "c2nc(N": 917,
983
+ "[NH2+]CC": 918,
984
+ "C=N": 919,
985
+ "c%1": 920,
986
+ "CCN(C)": 921,
987
+ "=[NH2+])": 922,
988
+ "c(C)n1": 923,
989
+ "C(=[NH+]": 924,
990
+ "n[nH]1": 925,
991
+ "(Cl)cc1)": 926,
992
+ "=CC=": 927,
993
+ "c2)OCO": 928,
994
+ "cc21": 929,
995
+ "c1cc(Br)": 930,
996
+ "F)cc1F": 931,
997
+ "c1cc(F)": 932,
998
+ "CCCC)": 933,
999
+ "(F)c2)": 934,
1000
+ "CCC(O": 935,
1001
+ "C(=O)O": 936,
1002
+ ")N(C": 937,
1003
+ "(C(=O)C": 938,
1004
+ "=C2S": 939,
1005
+ "=C)": 940,
1006
+ "NC(=O)CC": 941,
1007
+ "NC(=O)C3": 942,
1008
+ "c(Cl)c2)": 943,
1009
+ "s2)c1": 944,
1010
+ "C=CC=": 945,
1011
+ "CCNS(=O)": 946,
1012
+ "CC(N)=O)": 947,
1013
+ "c(Cl)c3)": 948,
1014
+ "OC)c1OC": 949,
1015
+ "(C)(C)": 950,
1016
+ "OCCO2": 951,
1017
+ "FC(F)(F)": 952,
1018
+ "c4cc(": 953,
1019
+ "3CCCC3)": 954,
1020
+ "Cl)CC1": 955,
1021
+ "ccccc2c1": 956,
1022
+ "ccc(C": 957,
1023
+ "cn2)": 958,
1024
+ "COCCO": 959,
1025
+ "CC1(O)": 960,
1026
+ "nc(N2": 961,
1027
+ "[nH]1)": 962,
1028
+ "CC(C)C(": 963,
1029
+ "o2)c1": 964,
1030
+ "F)C1": 965,
1031
+ "c7ccccc7": 966,
1032
+ "cc(C)": 967,
1033
+ "F)cc(": 968,
1034
+ "c3c(C)": 969,
1035
+ "[nH]2)": 970,
1036
+ "(Cl)c3)": 971,
1037
+ "c2ccco2)": 972,
1038
+ "SC1": 973,
1039
+ "n2c(": 974,
1040
+ "C2=O": 975,
1041
+ "nn3": 976,
1042
+ "on1": 977,
1043
+ "[nH]c2": 978,
1044
+ "CC5": 979,
1045
+ "4)c3)": 980,
1046
+ "C)CC1": 981,
1047
+ "C2CCCC": 982,
1048
+ "c3ccc(Br": 983,
1049
+ "=C2C(=O)": 984,
1050
+ "3CCCCC3)": 985,
1051
+ "c2ccc(O": 986,
1052
+ "2)ccc1": 987,
1053
+ "(C(=O)N3": 988,
1054
+ "c3ccncc": 989,
1055
+ "CC1CCCC": 990,
1056
+ "n2c(=O)": 991,
1057
+ "N)c1": 992,
1058
+ ")NC1": 993,
1059
+ "CNS(=O)": 994,
1060
+ "C2CC2)": 995,
1061
+ "Cn1cn": 996,
1062
+ "CC(C)(C": 997,
1063
+ "Cc1cccc2": 998,
1064
+ "(C)C)CC1": 999,
1065
+ "cccc(": 1000,
1066
+ "cnc1": 1001,
1067
+ "[C": 1002,
1068
+ "4CCO": 1003,
1069
+ "nc2cccc": 1004,
1070
+ "[nH]c1=O": 1005,
1071
+ "CCc1cc": 1006,
1072
+ "C=C(C": 1007,
1073
+ "c1)C1": 1008,
1074
+ "F)cc(C": 1009,
1075
+ "CCc1": 1010,
1076
+ "c(C)c(C)": 1011,
1077
+ "C2)n1": 1012,
1078
+ "CCCS(=O)": 1013,
1079
+ "OC)c(": 1014,
1080
+ "N=C(": 1015,
1081
+ "(C)(C)C)": 1016,
1082
+ "(C)O": 1017,
1083
+ "=CC(=O)N": 1018,
1084
+ "CCC12": 1019,
1085
+ "c1cccc(N": 1020,
1086
+ "Cc1nc(C": 1021,
1087
+ "=C3": 1022,
1088
+ "4)c3": 1023,
1089
+ "c2cc(N": 1024,
1090
+ "C3CC3)": 1025,
1091
+ "NC(=S)": 1026,
1092
+ "C=CC1": 1027,
1093
+ "CCCO)": 1028,
1094
+ "CCCCCC": 1029,
1095
+ "Cc1cc(N": 1030,
1096
+ "c2ccs": 1031,
1097
+ "CC(C)CC(": 1032,
1098
+ "SC)": 1033,
1099
+ "C(C)=O)": 1034,
1100
+ "=[N+]": 1035,
1101
+ "[NH+](C": 1036,
1102
+ "COCC1": 1037,
1103
+ "3)c2": 1038,
1104
+ "c2c(C)cc": 1039,
1105
+ "s2)CC1": 1040,
1106
+ "CCl": 1041,
1107
+ "CCC2(": 1042,
1108
+ ")(": 1043,
1109
+ "c2nn": 1044,
1110
+ "Cc2ccc(": 1045,
1111
+ "=C(N)N)": 1046,
1112
+ "2)C(=O)": 1047,
1113
+ "c1cc(C2": 1048,
1114
+ "(C(=O)OC": 1049,
1115
+ "cc1Cl": 1050,
1116
+ "COc1": 1051,
1117
+ "C4)": 1052,
1118
+ "CCC3)": 1053,
1119
+ ")ccn1": 1054,
1120
+ "c3cccc(C": 1055,
1121
+ "CC(=O)N(": 1056,
1122
+ "c1ccc(OC": 1057,
1123
+ "CCCc1n": 1058,
1124
+ "c3cccs3)": 1059,
1125
+ "CC(N)": 1060,
1126
+ "ccn1": 1061,
1127
+ "Br)cc(": 1062,
1128
+ "[O-])c(": 1063,
1129
+ "c2o": 1064,
1130
+ "C(OC(=O)": 1065,
1131
+ ")NC(=O)": 1066,
1132
+ "2)cc1OC": 1067,
1133
+ "C=CCO": 1068,
1134
+ ")ccc1OC": 1069,
1135
+ "c2ncccc2": 1070,
1136
+ "2)s1": 1071,
1137
+ "O=S(=O)(": 1072,
1138
+ "c2ccc(N3": 1073,
1139
+ "4)cc3": 1074,
1140
+ "[nH]c3": 1075,
1141
+ "(C)C)n1": 1076,
1142
+ "CSc1n": 1077,
1143
+ "C2CCC": 1078,
1144
+ "C=CC": 1079,
1145
+ "c3ncn": 1080,
1146
+ "C2)CC1": 1081,
1147
+ "C2O)": 1082,
1148
+ "c3ccc(C)": 1083,
1149
+ "c(=O)o": 1084,
1150
+ "O=C(C1": 1085,
1151
+ "[nH+]1": 1086,
1152
+ ")cc2c1": 1087,
1153
+ "C(OC)": 1088,
1154
+ "c(Cl)cc1": 1089,
1155
+ "[n-]": 1090,
1156
+ "C2)C1": 1091,
1157
+ "NN": 1092,
1158
+ "(C(=O)O": 1093,
1159
+ "c1cccs1)": 1094,
1160
+ "[P": 1095,
1161
+ "c1ccco1)": 1096,
1162
+ "OCCCO": 1097,
1163
+ "(F)c3)": 1098,
1164
+ "Cc1o": 1099,
1165
+ "CC(C)n1": 1100,
1166
+ "C=CCn1": 1101,
1167
+ "cc1C": 1102,
1168
+ "c4n": 1103,
1169
+ "CC1CC1": 1104,
1170
+ "c2nnc(C": 1105,
1171
+ "c(N)c1": 1106,
1172
+ "CCCCCCC": 1107,
1173
+ ")N1CCN(": 1108,
1174
+ "c(N3": 1109,
1175
+ "c(O)c1": 1110,
1176
+ "c3cc(F)": 1111,
1177
+ "(C)CC": 1112,
1178
+ "C)C1": 1113,
1179
+ "2)cc1)": 1114,
1180
+ ")cc(C": 1115,
1181
+ "C3CCCCC": 1116,
1182
+ "c3ccc(F)": 1117,
1183
+ "c(=O)c1": 1118,
1184
+ "c1)OCO2": 1119,
1185
+ "c4cn": 1120,
1186
+ "c2ccc(O)": 1121,
1187
+ "n2)C1": 1122,
1188
+ "cc(Cl)": 1123,
1189
+ "c(F)c2)": 1124,
1190
+ "C1(": 1125,
1191
+ "OS(=O)": 1126,
1192
+ "NC(C)=O)": 1127,
1193
+ "C(CC(=O)": 1128,
1194
+ "Cn1cc(": 1129,
1195
+ "(C)cc2)": 1130,
1196
+ "S1(=O)": 1131,
1197
+ "c2ccc(CC": 1132,
1198
+ "(CC)CC": 1133,
1199
+ "2)c1C": 1134,
1200
+ "c1ncc(": 1135,
1201
+ "(C)cc3)": 1136,
1202
+ "CC(=O)O": 1137,
1203
+ "(F)c2": 1138,
1204
+ "4)ccc3": 1139,
1205
+ "ccc5": 1140,
1206
+ "c1ccc(CN": 1141,
1207
+ "c(=O)c2": 1142,
1208
+ "c1cncc(": 1143,
1209
+ "c2nc(C)": 1144,
1210
+ "[P+]": 1145,
1211
+ "c3[nH]": 1146,
1212
+ "[n+]1": 1147,
1213
+ "CCOc1ccc": 1148,
1214
+ "C[NH3+]": 1149,
1215
+ "4CCCCC": 1150,
1216
+ "4)cc3)": 1151,
1217
+ "C(=O)NC": 1152,
1218
+ "ncc(": 1153,
1219
+ "3)C1)": 1154,
1220
+ "CC(F)": 1155,
1221
+ "C(C)C1": 1156,
1222
+ "Nc1cc": 1157,
1223
+ "[O-])c2)": 1158,
1224
+ "C1=O)": 1159,
1225
+ "(C)C(": 1160,
1226
+ "O2)": 1161,
1227
+ "=NN": 1162,
1228
+ "CCCC3)": 1163,
1229
+ "c2c(C": 1164,
1230
+ "c2nccc": 1165,
1231
+ "nc(C2": 1166,
1232
+ "C1(C": 1167,
1233
+ "c(NC": 1168,
1234
+ "nc2n1": 1169,
1235
+ "c2sccc2": 1170,
1236
+ "n5": 1171,
1237
+ "=[N-]": 1172,
1238
+ "[S-]": 1173,
1239
+ "CCOC(C": 1174,
1240
+ "COP(=O)": 1175,
1241
+ "c1ccc(n2": 1176,
1242
+ "(Br)": 1177,
1243
+ "c3cc(C": 1178,
1244
+ "c1o": 1179,
1245
+ "=[NH+]O)": 1180,
1246
+ "CC1CCN": 1181,
1247
+ "CC2CCC1": 1182,
1248
+ ")ccc1Cl": 1183,
1249
+ "C(=O)NC2": 1184,
1250
+ "I)c1": 1185,
1251
+ "c3ccco3)": 1186,
1252
+ "nnn1": 1187,
1253
+ "CCC(CO)": 1188,
1254
+ "n2c1": 1189,
1255
+ "nc21": 1190,
1256
+ "n3cn": 1191,
1257
+ "cnn1": 1192,
1258
+ "n(C2": 1193,
1259
+ "c4ccc(C": 1194,
1260
+ ")S(=O)": 1195,
1261
+ "cc12": 1196,
1262
+ "Nc1cc(": 1197,
1263
+ "I)cc1": 1198,
1264
+ "nc1C": 1199,
1265
+ "NC(N)": 1200,
1266
+ "cnc2": 1201,
1267
+ "(CCC(=O)": 1202,
1268
+ "n4)": 1203,
1269
+ "c2cc(OC)": 1204,
1270
+ "2)cn1": 1205,
1271
+ "CCc1cccc": 1206,
1272
+ "Cn1c(=O)": 1207,
1273
+ ")cc2)n1": 1208,
1274
+ "C1C2": 1209,
1275
+ "(CC)CC)": 1210,
1276
+ "(CC(C)C)": 1211,
1277
+ "C1CCN(": 1212,
1278
+ "C(=N": 1213,
1279
+ "P(": 1214,
1280
+ "O=C(CO": 1215,
1281
+ "(F)c1)": 1216,
1282
+ "c2ncc(": 1217,
1283
+ "C=C(C)C": 1218,
1284
+ "NC(=O)CN": 1219,
1285
+ "CCc1cn": 1220,
1286
+ "n2)n1": 1221,
1287
+ "c([O-])": 1222,
1288
+ "CC1O": 1223,
1289
+ "(C)cc(C)": 1224,
1290
+ "C(=O)OCC": 1225,
1291
+ "(C)ccc1": 1226,
1292
+ "Cc1c(C": 1227,
1293
+ "SC2": 1228,
1294
+ ")ccc1F": 1229,
1295
+ "cn2)cc1": 1230,
1296
+ "c1nc2c(": 1231,
1297
+ "COc1cc2": 1232,
1298
+ "CC21": 1233,
1299
+ "CCc1ccc": 1234,
1300
+ "C(F)F": 1235,
1301
+ "CC1CN(": 1236,
1302
+ "c2nnn": 1237,
1303
+ "(Cl)c1)": 1238,
1304
+ "c4cccc(": 1239,
1305
+ "CCOC(": 1240,
1306
+ "c(F)c1F": 1241,
1307
+ "Cl)C1": 1242,
1308
+ "c2C)cc1": 1243,
1309
+ "(Cl)c(": 1244,
1310
+ "CCCC(O)": 1245,
1311
+ "OC4": 1246,
1312
+ "c1csc(": 1247,
1313
+ "CC5)": 1248,
1314
+ "4CCOCC4)": 1249,
1315
+ "CC(Cl)": 1250,
1316
+ "C(O)C1": 1251,
1317
+ "3)cc2": 1252,
1318
+ "CCCCn1": 1253,
1319
+ "C(CS": 1254,
1320
+ "OC)C1": 1255,
1321
+ "OC(C)=O": 1256,
1322
+ "CC=C(": 1257,
1323
+ "NC(=O)OC": 1258,
1324
+ "CC[NH+](": 1259,
1325
+ "c4ccccc3": 1260,
1326
+ "3CC4": 1261,
1327
+ "C=O": 1262,
1328
+ "C1=C(C)": 1263,
1329
+ "F)cc2)c1": 1264,
1330
+ "CC[NH3+]": 1265,
1331
+ "CCc1cc(": 1266,
1332
+ "c9": 1267,
1333
+ "O)cc2)": 1268,
1334
+ ")cc2)C1": 1269,
1335
+ "c1ccsc1": 1270,
1336
+ "nccc1": 1271,
1337
+ "O[Si]": 1272,
1338
+ "c1nc(C)": 1273,
1339
+ "=[NH+]C": 1274,
1340
+ "c2ccc(C3": 1275,
1341
+ "CCCC2)c1": 1276,
1342
+ "c1)N": 1277,
1343
+ "CCCCN": 1278,
1344
+ "(=O)C": 1279,
1345
+ "nc(Cl)": 1280,
1346
+ "-3": 1281,
1347
+ "c1c(C)cc": 1282,
1348
+ "Cc1cc2c(": 1283,
1349
+ "C(OC2": 1284,
1350
+ "c4ccc5c(": 1285,
1351
+ "2)c(C)c1": 1286,
1352
+ "[O-])cc(": 1287,
1353
+ "C(CCCC": 1288,
1354
+ "CC1CC1)": 1289,
1355
+ "cc(C(=O)": 1290,
1356
+ "cc2c1": 1291,
1357
+ "CC1CO": 1292,
1358
+ "CC2)c1C": 1293,
1359
+ "N)cc1": 1294,
1360
+ ")ccc1O": 1295,
1361
+ "C)n1": 1296,
1362
+ "O=C(CS": 1297,
1363
+ "(C)O)": 1298,
1364
+ "(Cl)c(C": 1299,
1365
+ "c1c(N": 1300,
1366
+ "3)c2)": 1301,
1367
+ "C1CCCC": 1302,
1368
+ "(OCC)": 1303,
1369
+ "CC1CCN(": 1304,
1370
+ "c(=S)": 1305,
1371
+ "c(C3": 1306,
1372
+ "c2noc(C": 1307,
1373
+ "F)cc4)": 1308,
1374
+ "C(C(C)C)": 1309,
1375
+ "(CCC)": 1310,
1376
+ "OC)cc(": 1311,
1377
+ "O)cc1)": 1312,
1378
+ "cn1)": 1313,
1379
+ "c2n[nH]": 1314,
1380
+ "(C)CCC": 1315,
1381
+ "1)N": 1316,
1382
+ "c(Cl)c1)": 1317,
1383
+ "[O-])n1": 1318,
1384
+ "C(C)=O": 1319,
1385
+ "c(=O)n(": 1320,
1386
+ "O=C(Cn1": 1321,
1387
+ "OC)cc": 1322,
1388
+ "2)o1": 1323,
1389
+ "cc2Cl)": 1324,
1390
+ "c3cccnc3": 1325,
1391
+ "C(CC": 1326,
1392
+ "c(F)c3)": 1327,
1393
+ "CCc2c(": 1328,
1394
+ "CC(C)=": 1329,
1395
+ "c2nc3c(": 1330,
1396
+ "n3cc": 1331,
1397
+ "cn3": 1332,
1398
+ "Oc2ccc": 1333,
1399
+ "o2)CC1": 1334,
1400
+ "2)c(": 1335,
1401
+ "c(SC": 1336,
1402
+ "3)CC2": 1337,
1403
+ "C6": 1338,
1404
+ "=O)C1": 1339,
1405
+ "ccc3C)": 1340,
1406
+ "Cc1nn(": 1341,
1407
+ "nc(S": 1342,
1408
+ "O=c1": 1343,
1409
+ "=O)N": 1344,
1410
+ "sc3": 1345,
1411
+ "CCCOC1": 1346,
1412
+ "nn1C": 1347,
1413
+ "CCC(C)C(": 1348,
1414
+ "n(C)c1": 1349,
1415
+ "O=C1N": 1350,
1416
+ "c1ccc(S": 1351,
1417
+ "ncnc1": 1352,
1418
+ "2)N1": 1353,
1419
+ "F)n1": 1354,
1420
+ "CC=CC=": 1355,
1421
+ "c[nH]1)": 1356,
1422
+ "CCC3(CC": 1357,
1423
+ "(N)=S)": 1358,
1424
+ "[NH3+])N": 1359,
1425
+ "e]": 1360,
1426
+ "(=O)[nH]": 1361,
1427
+ "c1nn": 1362,
1428
+ "ncc3": 1363,
1429
+ "(Cc2ccc": 1364,
1430
+ "OCC3": 1365,
1431
+ "sc(": 1366,
1432
+ "CCC1)": 1367,
1433
+ "7)": 1368,
1434
+ "cccc3": 1369,
1435
+ "c5ccc6": 1370,
1436
+ "CCCN2": 1371,
1437
+ "3)ccc2": 1372,
1438
+ "c2=O)cc1": 1373,
1439
+ "C=CC(": 1374,
1440
+ "(C)C(C": 1375,
1441
+ "n2cc(": 1376,
1442
+ "2)C2": 1377,
1443
+ "n2nn": 1378,
1444
+ "C=CCN": 1379,
1445
+ ")N1CCC(": 1380,
1446
+ "cccn1": 1381,
1447
+ "CC1CC(": 1382,
1448
+ "3CCCCC3": 1383,
1449
+ "C[Si](C)": 1384,
1450
+ "Cn1cc(C": 1385,
1451
+ "n2n": 1386,
1452
+ "nnc2": 1387,
1453
+ "3)C2)cc1": 1388,
1454
+ "nc1N": 1389,
1455
+ "CC1CCC(": 1390,
1456
+ "(C(=O)N1": 1391,
1457
+ "B(O)": 1392,
1458
+ "COC(=O)N": 1393,
1459
+ "CC3)cc2": 1394,
1460
+ "N=N": 1395,
1461
+ "c5cccc": 1396,
1462
+ "c1cc(N2": 1397,
1463
+ "ccccc5": 1398,
1464
+ "(CC(O)": 1399,
1465
+ "4)C3": 1400,
1466
+ "cn2)CC1": 1401,
1467
+ "3CCC": 1402,
1468
+ "COC(": 1403,
1469
+ "c1c(N)": 1404,
1470
+ "H]": 1405,
1471
+ "(=O)o": 1406,
1472
+ "c2cncc": 1407,
1473
+ "c-2": 1408,
1474
+ "C[NH3+])": 1409,
1475
+ "ccccc8": 1410,
1476
+ "3)C2)": 1411,
1477
+ "CCC2C3": 1412,
1478
+ "(=O)[O-]": 1413,
1479
+ "F)cc1Cl": 1414,
1480
+ "CCCCCO": 1415,
1481
+ ")cc5": 1416,
1482
+ "CC1CCCC1": 1417,
1483
+ "COCCC": 1418,
1484
+ "3C(=O)": 1419,
1485
+ "Nc1ccc": 1420,
1486
+ "OCO4)": 1421,
1487
+ "CCCl)": 1422,
1488
+ "C(Cc1cn": 1423,
1489
+ "c3cccs": 1424,
1490
+ "c2sc(": 1425,
1491
+ "(CC(C)": 1426,
1492
+ "CCOCC2": 1427,
1493
+ "O=c1[nH]": 1428,
1494
+ "Oc2ccccc": 1429,
1495
+ "CCCN(C": 1430,
1496
+ "c1ccc(C(": 1431,
1497
+ "C(F)F)": 1432,
1498
+ "c1)OCCO2": 1433,
1499
+ "c(Cl)cc2": 1434,
1500
+ "sc1C": 1435,
1501
+ ")cc12": 1436,
1502
+ "c6ccc": 1437,
1503
+ "c2csc(": 1438,
1504
+ "c2cc(N)": 1439,
1505
+ "c4c3": 1440,
1506
+ "c3cs": 1441,
1507
+ "(CCl)": 1442,
1508
+ "c(C)cc1": 1443,
1509
+ "N#C": 1444,
1510
+ "C(O)C1O": 1445,
1511
+ "c2cnn(C)": 1446,
1512
+ "c2)OCCO": 1447,
1513
+ "c2)OCO3)": 1448,
1514
+ "C(C2": 1449,
1515
+ "F)cc4": 1450,
1516
+ "4CC4)": 1451,
1517
+ "c5cc": 1452,
1518
+ "c3c(c2": 1453,
1519
+ "Cn1cc": 1454,
1520
+ "CC2CCCO": 1455,
1521
+ "C(C)C)c1": 1456,
1522
+ "[C-]": 1457,
1523
+ "Cc1c(C)": 1458,
1524
+ "(=O)c1cc": 1459,
1525
+ "c1ccc(O)": 1460,
1526
+ "c2cco": 1461,
1527
+ "[O-])c2": 1462,
1528
+ ")cc1)": 1463,
1529
+ "CC1CCC(C": 1464,
1530
+ "c(C)c3)": 1465,
1531
+ "cc5": 1466,
1532
+ "(C)c2": 1467,
1533
+ "ccc2n1": 1468,
1534
+ "OC(F)F": 1469,
1535
+ "CCCC(C)C": 1470,
1536
+ "COC(C": 1471,
1537
+ "[O-])c3)": 1472,
1538
+ "[N+]1": 1473,
1539
+ "n2C": 1474,
1540
+ "Cc1cc(N2": 1475,
1541
+ "CNC(=O)N": 1476,
1542
+ "C(C)C2": 1477,
1543
+ "CCCO1)": 1478,
1544
+ "Cc1ccc(O": 1479,
1545
+ "CC(C)c1n": 1480,
1546
+ "n(C)n1": 1481,
1547
+ "COCCn1": 1482,
1548
+ "C(C)CC": 1483,
1549
+ "Br)cc2": 1484,
1550
+ "Cl)N": 1485,
1551
+ "c1sccc1": 1486,
1552
+ "(C)cc3": 1487,
1553
+ "CC(=N": 1488,
1554
+ "C2CCCCC": 1489,
1555
+ "c4cccc5": 1490,
1556
+ "C)C(=O)": 1491,
1557
+ "SC(=C": 1492,
1558
+ "(C3": 1493,
1559
+ "c(=O)n3": 1494,
1560
+ "CC(C#N)": 1495,
1561
+ "c(F)c2": 1496,
1562
+ "C1CCC(": 1497,
1563
+ "(C)CC2": 1498,
1564
+ "C=C2": 1499,
1565
+ "2)N": 1500,
1566
+ "cc1Cl)": 1501,
1567
+ "c1cnc(": 1502,
1568
+ "c1C#N": 1503,
1569
+ "N1CCCC1": 1504,
1570
+ "=O)CC1": 1505,
1571
+ "CC(C)(": 1506,
1572
+ "CCCCC=": 1507,
1573
+ "2)cc1C": 1508,
1574
+ "(c2ccc(": 1509,
1575
+ "(CCCC)": 1510,
1576
+ "C(C1": 1511,
1577
+ "4)C2)": 1512,
1578
+ "c1c(Cl)": 1513,
1579
+ "c(C(C)C)": 1514,
1580
+ "n2C)": 1515,
1581
+ "CCc2ccc(": 1516,
1582
+ "COCCN": 1517,
1583
+ "OCc1ccc(": 1518,
1584
+ "c(=O)n(C": 1519,
1585
+ "N(C)C": 1520,
1586
+ "3)C1)C2": 1521,
1587
+ "C(Br)": 1522,
1588
+ "c1nc(N)": 1523,
1589
+ "Br)c2)": 1524,
1590
+ "c(C)c(": 1525,
1591
+ "ccc2Cl)": 1526,
1592
+ "nc2s": 1527,
1593
+ "[nH]3)": 1528,
1594
+ "CSCCC(": 1529,
1595
+ "c8ccccc8": 1530,
1596
+ "CCC(C)C)": 1531,
1597
+ "CCCCC)": 1532,
1598
+ "(C)c(": 1533,
1599
+ "CO1": 1534,
1600
+ "=NC(=O)": 1535,
1601
+ "OC)C(=O)": 1536,
1602
+ "CCCCCC)": 1537,
1603
+ "O)C(O)": 1538,
1604
+ "c2c(c1)": 1539,
1605
+ "n2nc(": 1540,
1606
+ "c(F)cc2": 1541,
1607
+ "C2CC3": 1542,
1608
+ "3)n2)": 1543,
1609
+ "#[N+]": 1544,
1610
+ "c(C)c1C": 1545,
1611
+ "Cc1ccsc1": 1546,
1612
+ "O1)": 1547,
1613
+ "C2CCCCC2": 1548,
1614
+ "s2)C1": 1549,
1615
+ "F)cccc3": 1550,
1616
+ "c(=O)n": 1551,
1617
+ "C1CC": 1552,
1618
+ "[NH3+]C(": 1553,
1619
+ "C2=O)c1": 1554,
1620
+ "(Cl)s1": 1555,
1621
+ "[n+]2": 1556,
1622
+ "[O-])cc3": 1557,
1623
+ "CC(S": 1558,
1624
+ "Cc1ccco1": 1559,
1625
+ ")N1CCCC1": 1560,
1626
+ "cc2C)": 1561,
1627
+ "C(C)O)": 1562,
1628
+ "C1(C)": 1563,
1629
+ "(CNC(=O)": 1564,
1630
+ "C1CC1)": 1565,
1631
+ "O3)": 1566,
1632
+ "COc1c(": 1567,
1633
+ "Cc1nc(N": 1568,
1634
+ "(c2ccc": 1569,
1635
+ "N1CCN": 1570,
1636
+ "(CC[NH+]": 1571,
1637
+ "(C)(C)C": 1572,
1638
+ "c1nc(Cl)": 1573,
1639
+ "c2cc(O)": 1574,
1640
+ "cs2)cc1": 1575,
1641
+ "c2noc(": 1576,
1642
+ "c1cc(O)": 1577,
1643
+ "nc2)cc1": 1578,
1644
+ "ccccc23)": 1579,
1645
+ "2)CC1)": 1580,
1646
+ "N1CCN(": 1581,
1647
+ "(=O)NC": 1582,
1648
+ "O=C(C=C": 1583,
1649
+ "OC)c(C": 1584,
1650
+ "OC(F)F)": 1585,
1651
+ "nc4)": 1586,
1652
+ "c1cnn(": 1587,
1653
+ "(C)cc1": 1588,
1654
+ "S2": 1589,
1655
+ "c1nc(C2": 1590,
1656
+ "c(C)c2)": 1591,
1657
+ "C1=C(": 1592,
1658
+ "COC(C)": 1593,
1659
+ "c3ccc(O": 1594,
1660
+ "CCC(C2": 1595,
1661
+ ")ccc12": 1596,
1662
+ "ncn2": 1597,
1663
+ "c1cncc": 1598,
1664
+ "c3)OCO4)": 1599,
1665
+ "N2CCN": 1600,
1666
+ "CC1CC": 1601,
1667
+ "CC(C)N": 1602,
1668
+ "s2)n1": 1603,
1669
+ "(=O)N(C)": 1604,
1670
+ "Nc1ncn": 1605,
1671
+ "c2cn3": 1606,
1672
+ "(Cl)c3": 1607,
1673
+ "CCC1CCCC": 1608,
1674
+ "CC(=C": 1609,
1675
+ "Oc1ccc(C": 1610,
1676
+ "CC(O)C1": 1611,
1677
+ "=CN": 1612,
1678
+ "(=O)NC2": 1613,
1679
+ "[O-])C(": 1614,
1680
+ "CCCOC": 1615,
1681
+ "(C)C)C2": 1616,
1682
+ "2)cc1Cl": 1617,
1683
+ ")Nc1ccc(": 1618,
1684
+ "c1n[nH]": 1619,
1685
+ ")ccc1N": 1620,
1686
+ "cc(S(=O)": 1621,
1687
+ "CCOCC": 1622,
1688
+ "cn2)c1": 1623,
1689
+ "c4cc5": 1624,
1690
+ "3CCOCC3": 1625,
1691
+ "(C)=O)": 1626,
1692
+ "nnc(": 1627,
1693
+ "c2c(c1": 1628,
1694
+ "CN2C(=O)": 1629,
1695
+ "C1CC2": 1630,
1696
+ "2)C(=O)N": 1631,
1697
+ "NC(N": 1632,
1698
+ "c1nccs1": 1633,
1699
+ "c2C1": 1634,
1700
+ "ccccc12)": 1635,
1701
+ "CCc1nc(": 1636,
1702
+ "nnnc1": 1637,
1703
+ "c2c1C": 1638,
1704
+ "3)n2)c1": 1639,
1705
+ "c5c(": 1640,
1706
+ "=C(N)N": 1641,
1707
+ "2)C(": 1642,
1708
+ "cn3)": 1643,
1709
+ "CCC#N)": 1644,
1710
+ "c1ccc(N)": 1645,
1711
+ "c1ncc(C": 1646,
1712
+ "c3ccco": 1647,
1713
+ "C2C3": 1648,
1714
+ "c2no": 1649,
1715
+ "c2C)CC1": 1650,
1716
+ "c1no": 1651,
1717
+ "c2c(Cl)": 1652,
1718
+ "Br)C1": 1653,
1719
+ "=CC2": 1654,
1720
+ "(C)n1": 1655,
1721
+ "=CC": 1656,
1722
+ "CCn1cn": 1657,
1723
+ "CCCCC(": 1658,
1724
+ "OCC1OC(": 1659,
1725
+ "cs1)": 1660,
1726
+ "[O-])C2": 1661,
1727
+ "=C(Cl)": 1662,
1728
+ "=CC1": 1663,
1729
+ "oc(=O)": 1664,
1730
+ "Oc1ccc": 1665,
1731
+ "C(=S)": 1666,
1732
+ "C=CCCC": 1667,
1733
+ "cc1)": 1668,
1734
+ "c3=O)": 1669,
1735
+ "[nH]2)c1": 1670,
1736
+ "(C1": 1671,
1737
+ "(C)c3)": 1672,
1738
+ "c(F)c1)": 1673,
1739
+ "C=CC(C": 1674,
1740
+ "ccc(N": 1675,
1741
+ "C(=O)OC1": 1676,
1742
+ "CC)c1": 1677,
1743
+ "C12CC3": 1678,
1744
+ "CCC(C(C)": 1679,
1745
+ "Cc1nnc(": 1680,
1746
+ "[O-])c1)": 1681,
1747
+ "O=C(NCC1": 1682,
1748
+ "CCSC1": 1683,
1749
+ "c2)cc1OC": 1684,
1750
+ "s4)": 1685,
1751
+ "c2ccnc(N": 1686,
1752
+ "C(=O)C3": 1687,
1753
+ "c6)": 1688,
1754
+ "(C(N)=O)": 1689,
1755
+ "[NH3+])C": 1690,
1756
+ "=N)": 1691,
1757
+ "(CCCCC": 1692,
1758
+ "3)C2=O)": 1693,
1759
+ "2)n": 1694,
1760
+ "CCOC(C)": 1695,
1761
+ "C(N)=S": 1696,
1762
+ "ccc3Cl)": 1697,
1763
+ "ccccc34)": 1698,
1764
+ "CCCCCCO": 1699,
1765
+ "CC(=O)OC": 1700,
1766
+ "Cn1ccnc1": 1701,
1767
+ "3CC[NH+]": 1702,
1768
+ "(=O)O": 1703,
1769
+ "C21": 1704,
1770
+ "O=C(CC": 1705,
1771
+ "CCCCC3": 1706,
1772
+ "F)cc2)n1": 1707,
1773
+ "c2ccncc2": 1708,
1774
+ "C1(C)C": 1709,
1775
+ "C(=C)": 1710,
1776
+ "c2c(N": 1711,
1777
+ "n2n1": 1712,
1778
+ "c1cnc(N": 1713,
1779
+ "OCC)c1": 1714,
1780
+ "c(CO": 1715,
1781
+ "c1cn2": 1716,
1782
+ "CC1CCCO1": 1717,
1783
+ "nc2)c1": 1718,
1784
+ "c1[nH+]": 1719,
1785
+ "c(F)cc1": 1720,
1786
+ "CC(O)CO": 1721,
1787
+ "ccc2F)": 1722,
1788
+ "c3cc4c(": 1723,
1789
+ "CCOC3": 1724,
1790
+ "(Cc2ccc(": 1725,
1791
+ "c(CO)": 1726,
1792
+ "c2nc(=O)": 1727,
1793
+ "cc1F": 1728,
1794
+ "C(=O)NCC": 1729,
1795
+ "n2nc(C)": 1730,
1796
+ ")ccc1Br": 1731,
1797
+ "N1CCC(": 1732,
1798
+ "c4)CC3)": 1733,
1799
+ "c2nc(N)": 1734,
1800
+ "COCCN1": 1735,
1801
+ "F)ccc1F": 1736,
1802
+ "=[N-])": 1737,
1803
+ "C3)cc1": 1738,
1804
+ "c1ccc(CO": 1739,
1805
+ "c(C)c2": 1740,
1806
+ "cc(O)": 1741,
1807
+ "Cl)n1": 1742,
1808
+ "3)c2)cc1": 1743,
1809
+ "nc5": 1744,
1810
+ "c2C)c1": 1745,
1811
+ "O=C(CC1": 1746,
1812
+ "C(C)S": 1747,
1813
+ "(CCO": 1748,
1814
+ "NC1=O": 1749,
1815
+ "c2ncc(C": 1750,
1816
+ "#N)c1": 1751,
1817
+ "C1CCCN": 1752,
1818
+ "c2n(": 1753,
1819
+ "N1CC": 1754,
1820
+ "C(O)=C(": 1755,
1821
+ "Cc1nn": 1756,
1822
+ "Nc1": 1757,
1823
+ "SC(C)": 1758,
1824
+ "CCC1(C": 1759,
1825
+ "OCO2": 1760,
1826
+ "CCCCCC=": 1761,
1827
+ "CC#CC#": 1762,
1828
+ "c1cc(N)": 1763,
1829
+ "F)cc2F)": 1764,
1830
+ "2)cc(": 1765,
1831
+ "(C)C(C)C": 1766,
1832
+ "cs2)": 1767,
1833
+ "c3cc(OC)": 1768,
1834
+ "CCC21": 1769,
1835
+ "c1=O)": 1770,
1836
+ "(CCOC)": 1771,
1837
+ "c23)": 1772,
1838
+ "C(=O)OC)": 1773,
1839
+ "(C)C)cc2": 1774,
1840
+ "c2nc3": 1775,
1841
+ "OO": 1776,
1842
+ "c2nnnn2": 1777,
1843
+ "3)cc2)": 1778,
1844
+ "=CCCC": 1779,
1845
+ "(Cl)c1Cl": 1780,
1846
+ "N#Cc1": 1781,
1847
+ "c(CN": 1782,
1848
+ "coc(": 1783,
1849
+ "(C(C)(C)": 1784,
1850
+ "OCc1ccc": 1785,
1851
+ "C=CC2": 1786,
1852
+ "%10": 1787,
1853
+ "O=C(NC": 1788,
1854
+ "NC(=O)N(": 1789,
1855
+ "c4ncc": 1790,
1856
+ "=C(S": 1791,
1857
+ "CCOP(=O)": 1792,
1858
+ "Cc1csc(": 1793,
1859
+ "CC(C)O1": 1794,
1860
+ ")cc(C)c1": 1795,
1861
+ "c1ccc(N(": 1796,
1862
+ "CSC1": 1797,
1863
+ "CC2)nc1": 1798,
1864
+ "c4)cc3": 1799,
1865
+ "c1cc(=O)": 1800,
1866
+ "N=[N+]": 1801,
1867
+ "C(c1ccc(": 1802,
1868
+ ")cc5)": 1803,
1869
+ "(C(=O)C3": 1804,
1870
+ "N1CCOCC1": 1805,
1871
+ "Cc2cccc": 1806,
1872
+ "CC2(": 1807,
1873
+ "nn(C)": 1808,
1874
+ "n2cc(C": 1809,
1875
+ "c1nn(": 1810,
1876
+ "CCC1(CC)": 1811,
1877
+ "C(O": 1812,
1878
+ "n3)n": 1813,
1879
+ "o2)C1": 1814,
1880
+ "Cl)C(=O)": 1815,
1881
+ "n3ccc": 1816,
1882
+ "(C)c2)": 1817,
1883
+ "c(Br)cc1": 1818,
1884
+ "n2c(C)": 1819,
1885
+ "Br)s1": 1820,
1886
+ "CC(O)(C": 1821,
1887
+ "C1CC(": 1822,
1888
+ "nc(C)n1": 1823,
1889
+ "c6ccc(": 1824,
1890
+ "C(C)(C": 1825,
1891
+ "F)cc2)C1": 1826,
1892
+ "F)C(=O)": 1827,
1893
+ ")N1CC": 1828,
1894
+ "(Cl)(Cl)": 1829,
1895
+ "c1cccc(O": 1830,
1896
+ "=O)cc2": 1831,
1897
+ "CC)cc1": 1832,
1898
+ "4C)": 1833,
1899
+ "CC2(CC": 1834,
1900
+ "c1co": 1835,
1901
+ "C1CCC2": 1836,
1902
+ "c2c(N)": 1837,
1903
+ "c2ccsc2)": 1838,
1904
+ "(Cl)cc4)": 1839,
1905
+ "C[NH2+]1": 1840,
1906
+ "c[nH]1": 1841,
1907
+ "CCC5": 1842,
1908
+ "c(=O)c3": 1843,
1909
+ "c1cc(OC": 1844,
1910
+ "CCCCCCC1": 1845,
1911
+ "c4c3)": 1846,
1912
+ "CCO2": 1847,
1913
+ "[NH2+]1)": 1848,
1914
+ "C[NH2+]C": 1849,
1915
+ "c1c[nH+]": 1850,
1916
+ "Br)cc1)": 1851,
1917
+ "CCCCC(C)": 1852,
1918
+ "ccc2C)": 1853,
1919
+ "CCn1c(": 1854,
1920
+ "(C(=O)CC": 1855,
1921
+ "CN(S(=O)": 1856,
1922
+ "c(F)c3": 1857,
1923
+ "CC2CCC": 1858,
1924
+ "=CC(": 1859,
1925
+ "N2CC": 1860,
1926
+ "=[NH+]O": 1861,
1927
+ "ccc12": 1862,
1928
+ "C2C1": 1863,
1929
+ "Cc1nc(C)": 1864,
1930
+ "(C(=O)CS": 1865,
1931
+ "OC)n1": 1866,
1932
+ "2)c1=O": 1867,
1933
+ "c%10": 1868,
1934
+ "COCC(C)": 1869,
1935
+ "3)CC2)c1": 1870,
1936
+ "c(NC2": 1871,
1937
+ "c3ccc(O)": 1872,
1938
+ "NC(=O)NC": 1873,
1939
+ "-c2ccc(": 1874,
1940
+ "n2nc(C": 1875,
1941
+ "c2cc(=O)": 1876,
1942
+ "CCC(N)": 1877,
1943
+ "CCn1c(S": 1878,
1944
+ "ncnc3": 1879,
1945
+ "CCCl": 1880,
1946
+ "c1nnc(C": 1881,
1947
+ "(c4ccccc": 1882,
1948
+ "Fc1ccc(": 1883,
1949
+ "c3cc(Br)": 1884,
1950
+ "=Cc1ccc(": 1885,
1951
+ "c2nc(Cl)": 1886,
1952
+ "CCS1": 1887,
1953
+ "COC2": 1888,
1954
+ "SCC(=O)": 1889,
1955
+ "c2[nH+]": 1890,
1956
+ "C(C)(O)": 1891,
1957
+ "COc1cc(N": 1892,
1958
+ "n2ccnc2)": 1893,
1959
+ "(C)C)c(": 1894,
1960
+ "2)c1)": 1895,
1961
+ "(F)c3": 1896,
1962
+ "(F)(F)F": 1897,
1963
+ "CCC4)": 1898,
1964
+ "c(Cl)c2": 1899,
1965
+ "[nH]n1": 1900,
1966
+ "n2c(C": 1901,
1967
+ "(C2CC2)": 1902,
1968
+ "C=CCC1": 1903,
1969
+ "N=C1": 1904,
1970
+ "OC12": 1905,
1971
+ "C4CCCCC": 1906,
1972
+ "(=O)C2": 1907,
1973
+ "CCCC2)C1": 1908,
1974
+ "OC)CC1": 1909,
1975
+ "O=C(NCC": 1910,
1976
+ "nc2c(": 1911,
1977
+ "S1(=O)=O": 1912,
1978
+ "N#CC1": 1913,
1979
+ "Oc3ccc(": 1914,
1980
+ "C(=O)C(C": 1915,
1981
+ "C3O)": 1916,
1982
+ "F)cc1)N": 1917,
1983
+ "CC2(O)": 1918,
1984
+ "c3cc2": 1919,
1985
+ "cnn1C": 1920,
1986
+ "nn2)cc1": 1921,
1987
+ "Cc1cccs1": 1922,
1988
+ "2)no1": 1923,
1989
+ "CC2C1": 1924,
1990
+ ")C2": 1925,
1991
+ "OCC[NH+]": 1926,
1992
+ "SC(": 1927,
1993
+ "3CCN(": 1928,
1994
+ "CCCC4)": 1929,
1995
+ "CCC(C#N)": 1930,
1996
+ "OCC(": 1931,
1997
+ "(C(=O)CO": 1932,
1998
+ "n(C)c1=O": 1933,
1999
+ "[Se]": 1934,
2000
+ "c4ccc(OC": 1935,
2001
+ "F)cc3F)": 1936,
2002
+ "n(CC)": 1937,
2003
+ "[S-])": 1938,
2004
+ "3)C(=O)": 1939,
2005
+ "N#Cc1cc(": 1940,
2006
+ "CCCNC(N)": 1941,
2007
+ "2)nn1": 1942,
2008
+ "c(Cl)cc(": 1943,
2009
+ "Cc1ccnc(": 1944,
2010
+ "C(C)N": 1945,
2011
+ "oc1C": 1946,
2012
+ "cc1OC)": 1947,
2013
+ "4)CC3": 1948,
2014
+ "c3ncccc3": 1949,
2015
+ "cnc3": 1950,
2016
+ "CCC1(C)": 1951,
2017
+ "c1c(O)": 1952,
2018
+ "Cc1nn(C": 1953,
2019
+ "CCC(C1)": 1954,
2020
+ "c3c[nH]": 1955,
2021
+ "(Cl)cc4": 1956,
2022
+ "CCOc1cc": 1957,
2023
+ "CC(Br)": 1958,
2024
+ "CN(CC1": 1959,
2025
+ "c4cc(F)": 1960,
2026
+ "Cc1c(N": 1961,
2027
+ "Cc1cc(F)": 1962,
2028
+ "C(C)CC)": 1963,
2029
+ "c3o": 1964,
2030
+ "c2c[nH+]": 1965,
2031
+ "[O-])N": 1966,
2032
+ "OC)c2)": 1967,
2033
+ "C2CCC1": 1968,
2034
+ "3)nc2": 1969,
2035
+ "Cc1ccc(S": 1970,
2036
+ "=O)ccc1": 1971,
2037
+ ")cccc2": 1972,
2038
+ "CCSCC1": 1973,
2039
+ "N(C)C)": 1974,
2040
+ "c3nccc": 1975,
2041
+ ")cc3)C2": 1976,
2042
+ "OC)cc1)": 1977,
2043
+ "3)n1": 1978,
2044
+ "CC1=N": 1979,
2045
+ "CC(C1": 1980,
2046
+ "n1cc": 1981,
2047
+ "2CCN(": 1982,
2048
+ "CC(CC)": 1983,
2049
+ "(NN)": 1984,
2050
+ "(C)CC2)": 1985,
2051
+ "F)cc1F)": 1986,
2052
+ "Br)cc(C": 1987,
2053
+ "Cn1c(": 1988,
2054
+ "2)cc1F": 1989,
2055
+ "Cc1nc(C2": 1990,
2056
+ "c1ccnc(N": 1991,
2057
+ "OCC(C)C)": 1992,
2058
+ "(C)C)cc(": 1993,
2059
+ "csc1": 1994,
2060
+ "3)c(": 1995,
2061
+ "SS": 1996,
2062
+ "c2c3c(": 1997,
2063
+ "CCCC2)n1": 1998,
2064
+ "C#CCO": 1999,
2065
+ "c1ccoc1": 2000,
2066
+ "C(O)C2": 2001,
2067
+ "4CC[NH+]": 2002,
2068
+ "(C)CC3)": 2003,
2069
+ "CC1CCCC(": 2004,
2070
+ "c1[nH]c(": 2005,
2071
+ "(F)c1F": 2006,
2072
+ ")N(": 2007,
2073
+ "c1nc(=O)": 2008,
2074
+ "c2c(O)": 2009,
2075
+ "c2nn(": 2010,
2076
+ "CCC(C)C1": 2011,
2077
+ "2C(": 2012,
2078
+ "C3)n": 2013,
2079
+ "cnn2": 2014,
2080
+ "2)C(C)": 2015,
2081
+ "CC4)cc3": 2016,
2082
+ "no2)": 2017,
2083
+ "C2(": 2018,
2084
+ "[O-])cn1": 2019,
2085
+ "=C(C)C": 2020,
2086
+ "n6": 2021,
2087
+ "c1noc(": 2022,
2088
+ "c2cnn(": 2023,
2089
+ ")N1CCN": 2024,
2090
+ "N2CCCC2": 2025,
2091
+ ")cc(=O)": 2026,
2092
+ "o2)n1": 2027,
2093
+ "OC)c3)": 2028,
2094
+ "c5cc(": 2029,
2095
+ "c1O": 2030,
2096
+ "Cc1c[nH]": 2031,
2097
+ "OCC(C)C": 2032,
2098
+ "2CC[NH+]": 2033,
2099
+ "O=S1(=O)": 2034,
2100
+ "C(C)(": 2035,
2101
+ "[Si](": 2036,
2102
+ "c2cc1OC": 2037,
2103
+ "c(C)s1": 2038,
2104
+ "c[nH+]1": 2039,
2105
+ ")cc3)n": 2040,
2106
+ ")C": 2041,
2107
+ "c-3": 2042,
2108
+ "CC2(C": 2043,
2109
+ ")ccc1C": 2044,
2110
+ "N3C(=O)": 2045,
2111
+ "Nc1cn": 2046,
2112
+ "CC1CN": 2047,
2113
+ "Br)c1)": 2048,
2114
+ "CC4)cc3)": 2049,
2115
+ "OC)c2": 2050,
2116
+ "3CCN": 2051,
2117
+ "C1(O)": 2052,
2118
+ "nn(": 2053,
2119
+ "=O)cc2)": 2054,
2120
+ "(C)CCO": 2055,
2121
+ "c4nc(": 2056,
2122
+ "c(NS(=O)": 2057,
2123
+ "c2nn[n-]": 2058,
2124
+ "ccc6": 2059,
2125
+ "c3nnc(": 2060,
2126
+ "c2cnc(N": 2061,
2127
+ "co1": 2062,
2128
+ "4)C3)": 2063,
2129
+ "c(C[NH+]": 2064,
2130
+ "CCn1cc": 2065,
2131
+ ")cc1F": 2066,
2132
+ "c2nnc3": 2067,
2133
+ "OCCO)": 2068,
2134
+ "3)n2": 2069,
2135
+ "n2ccnc2": 2070,
2136
+ "c2cn(C)": 2071,
2137
+ "c1ncn2": 2072,
2138
+ "C1CCC1": 2073,
2139
+ "#N)cc2": 2074,
2140
+ "c2ccnn2": 2075,
2141
+ "N1S(=O)": 2076,
2142
+ "[CH]": 2077,
2143
+ "C2CC2)c1": 2078,
2144
+ "CCC12C": 2079,
2145
+ "c6ccc7": 2080,
2146
+ "no1)": 2081,
2147
+ "C2(C)": 2082,
2148
+ "Nc1nc(": 2083,
2149
+ "2CC3": 2084,
2150
+ "c2co": 2085,
2151
+ "c1ccs": 2086,
2152
+ "CC2)cn1": 2087,
2153
+ "c1csc(C": 2088,
2154
+ "C(C)=": 2089,
2155
+ "CCCN(CCC": 2090,
2156
+ "c(N)n": 2091,
2157
+ "Cc1c(Cl)": 2092,
2158
+ "oc2c1": 2093,
2159
+ "F)cc3)n": 2094,
2160
+ "c1)C(": 2095,
2161
+ "CC1CC2": 2096,
2162
+ "C2CCN": 2097,
2163
+ "c(N)n1": 2098,
2164
+ ")C(C)C": 2099,
2165
+ "(CC=C)": 2100,
2166
+ "COCC(": 2101,
2167
+ "C(O)C(": 2102,
2168
+ "c2ccc(I": 2103,
2169
+ "[O-])c(N": 2104,
2170
+ "c1)CCC2": 2105,
2171
+ "c(OC)c3)": 2106,
2172
+ ")C(": 2107,
2173
+ "2)O1": 2108,
2174
+ "c2cn[nH]": 2109,
2175
+ "CCn1cc(": 2110,
2176
+ "C=CCN1": 2111,
2177
+ "c1)OCCO": 2112,
2178
+ "C)CC3)": 2113,
2179
+ "CCC1=O": 2114,
2180
+ "OCCO4)": 2115,
2181
+ "CNc1n": 2116,
2182
+ "(CC3": 2117,
2183
+ "sc(N": 2118,
2184
+ "2)cs1": 2119,
2185
+ "C1(C(=O)": 2120,
2186
+ "CCC1O": 2121,
2187
+ "(C)C)cc3": 2122,
2188
+ "c1ccnc(": 2123,
2189
+ "=N1": 2124,
2190
+ "Cn1nccc1": 2125,
2191
+ "C1=N": 2126,
2192
+ "O=C(OC": 2127,
2193
+ "c1c(Br)": 2128,
2194
+ "c4ccc(N": 2129,
2195
+ "c3ccc(n4": 2130,
2196
+ "c1sc(": 2131,
2197
+ "COc1c(C)": 2132,
2198
+ "CSc1ccc(": 2133,
2199
+ "c1ccc2n": 2134,
2200
+ "3)CC2)n1": 2135,
2201
+ "CC(O)C(": 2136,
2202
+ "c2ccc(N)": 2137,
2203
+ "c4ccncc": 2138,
2204
+ "CCn1cc(C": 2139,
2205
+ "C1=C": 2140,
2206
+ "[O-])s1": 2141,
2207
+ "(=O)CC1": 2142,
2208
+ "(CCN": 2143,
2209
+ "n3C)": 2144,
2210
+ "c2ccn": 2145,
2211
+ "CC(C)Cn1": 2146,
2212
+ "c1ccn": 2147,
2213
+ ")cc(N": 2148,
2214
+ "2CCC": 2149,
2215
+ "ccc2o1": 2150,
2216
+ "COc1cn": 2151,
2217
+ "c(=O)c(": 2152,
2218
+ "Cc1ncc": 2153,
2219
+ "OC5": 2154,
2220
+ "c4ccco": 2155,
2221
+ "Oc2ccc(C": 2156,
2222
+ "(=O)NCC": 2157,
2223
+ "CC3CCC2": 2158,
2224
+ "CCOc1cc(": 2159,
2225
+ "Nc1nc(N": 2160,
2226
+ "oc2": 2161,
2227
+ "CC2)s1": 2162,
2228
+ "nn2)": 2163,
2229
+ "[NH2+]C3": 2164,
2230
+ "cc2c(": 2165,
2231
+ "Cn1nc(": 2166,
2232
+ "cc2F)": 2167,
2233
+ "n2cnc3": 2168,
2234
+ "CCCCN(": 2169,
2235
+ "c1)OCO2)": 2170,
2236
+ "(Cc2cccc": 2171,
2237
+ "C(c2ccc": 2172,
2238
+ "CC2)cc1C": 2173,
2239
+ "c3ccc(C4": 2174,
2240
+ "C2CCCC2": 2175,
2241
+ "CC(OC)": 2176,
2242
+ "c(C=O)": 2177,
2243
+ "C(C#N)=C": 2178,
2244
+ "CCc1c(C)": 2179,
2245
+ "Nc1cccc(": 2180,
2246
+ "-4": 2181,
2247
+ "c2ccc(n3": 2182,
2248
+ "Br)c2": 2183,
2249
+ "CC1CCCN": 2184,
2250
+ "c3ccs": 2185,
2251
+ "Br)CC1": 2186,
2252
+ ")[NH+]1": 2187,
2253
+ "[O-])cn": 2188,
2254
+ "C#N)cc1": 2189,
2255
+ "c(I)": 2190,
2256
+ "F)cc21": 2191,
2257
+ "c(C#N)c1": 2192,
2258
+ "C[NH+](": 2193,
2259
+ "[NH+]2CC": 2194,
2260
+ "c3)cc2": 2195,
2261
+ "c2cnc(": 2196,
2262
+ "cc(F)": 2197,
2263
+ "c2)cn1": 2198,
2264
+ "c(OC)cc2": 2199,
2265
+ "C2(C)C": 2200,
2266
+ "N1CCCCC1": 2201,
2267
+ "CC(C)C(C": 2202,
2268
+ "B(": 2203,
2269
+ "CC(=O)NC": 2204,
2270
+ "=S)N1": 2205,
2271
+ "cccc2": 2206,
2272
+ "O)C1": 2207,
2273
+ "F)cc(F)": 2208,
2274
+ "c34)": 2209,
2275
+ "C=CCC": 2210,
2276
+ "(C)c(C": 2211,
2277
+ "N1C": 2212,
2278
+ "c1ccc(NC": 2213,
2279
+ "(C(=O)C(": 2214,
2280
+ "Cc1cn2": 2215,
2281
+ "c(C)cc2": 2216,
2282
+ "O)ccc1": 2217,
2283
+ "c(C)o1": 2218,
2284
+ ")c1ccc": 2219,
2285
+ "c1cnn2": 2220,
2286
+ "CC3)cc2)": 2221,
2287
+ "C=C=": 2222,
2288
+ ")N(C)": 2223,
2289
+ "OP": 2224,
2290
+ "[NH+](CC": 2225,
2291
+ "C1CO": 2226,
2292
+ "C2C": 2227,
2293
+ "CC#N)": 2228,
2294
+ "[O-])c(C": 2229,
2295
+ "(=S)": 2230,
2296
+ "CCCC(N": 2231,
2297
+ "FC(F)": 2232,
2298
+ "Cc1nccn1": 2233,
2299
+ "CCc2cc(": 2234,
2300
+ "3c(": 2235,
2301
+ "cccnc2": 2236,
2302
+ "CCC1C": 2237,
2303
+ "Sc2n": 2238,
2304
+ "C=C(C#N)": 2239,
2305
+ "COc1n": 2240,
2306
+ "O)CC1": 2241,
2307
+ "[nH]c(C": 2242,
2308
+ ")N(C)C": 2243,
2309
+ "5)ccc4": 2244,
2310
+ "4)C2)C3)": 2245,
2311
+ ")cc1Cl": 2246,
2312
+ "Br)c(": 2247,
2313
+ "(F)F": 2248,
2314
+ "cc1F)": 2249,
2315
+ "4CCCCC4)": 2250,
2316
+ "C=O)": 2251,
2317
+ "5)cc4": 2252,
2318
+ "OC1(C)C": 2253,
2319
+ "5)c4": 2254,
2320
+ "Cc3ccccc": 2255,
2321
+ "CCc2ccc": 2256,
2322
+ "c1ccc(C#": 2257,
2323
+ ")cc1C": 2258,
2324
+ "O=C(CCC": 2259
2325
+ },
2326
+ "merges": [
2327
+ "c c",
2328
+ "C C",
2329
+ "( C",
2330
+ "c 1",
2331
+ "O )",
2332
+ "= O)",
2333
+ "( =O)",
2334
+ "cc c",
2335
+ "(C )",
2336
+ "c 2",
2337
+ "C (=O)",
2338
+ ") cc",
2339
+ "+ ]",
2340
+ "[ N",
2341
+ "CC C",
2342
+ "c1 cc",
2343
+ "[N H",
2344
+ "c1 ccc",
2345
+ "c (",
2346
+ "C (",
2347
+ "c 3",
2348
+ "2 )",
2349
+ "F )",
2350
+ "C 1",
2351
+ "CC CC",
2352
+ "c2 cc",
2353
+ "O C",
2354
+ "c1cc cc",
2355
+ "N C(=O)",
2356
+ ")cc 1",
2357
+ "CC 1",
2358
+ "(=O) N",
2359
+ "(C) C",
2360
+ "- ]",
2361
+ "C O",
2362
+ "c1ccc (",
2363
+ "[ O",
2364
+ "[O -]",
2365
+ "n 1",
2366
+ "[NH +]",
2367
+ "c2 ccc",
2368
+ "3 )",
2369
+ "(C l",
2370
+ "( F)",
2371
+ "c1cccc c1",
2372
+ "cc ccc",
2373
+ "CC O",
2374
+ "C(=O) N",
2375
+ "2 +]",
2376
+ "[NH 2+]",
2377
+ "c2cc ccc",
2378
+ "( CC",
2379
+ "C 2",
2380
+ "[O-] )",
2381
+ "c n",
2382
+ "c1 n",
2383
+ "S (=O)",
2384
+ "[ n",
2385
+ "N )",
2386
+ "O =",
2387
+ "CC N",
2388
+ "(C (=O)",
2389
+ "[n H",
2390
+ "(C (=O)N",
2391
+ "c 4",
2392
+ "(Cl )",
2393
+ "B r",
2394
+ "CC (C)",
2395
+ "C (C)",
2396
+ "[nH ]",
2397
+ "(C)C )",
2398
+ "CC (",
2399
+ "2 )cc1",
2400
+ "c (C",
2401
+ "3 +]",
2402
+ "[NH 3+]",
2403
+ "c3 ccc",
2404
+ "c2ccc (",
2405
+ "C N",
2406
+ "C (C",
2407
+ "c (C)",
2408
+ "c3 ccccc",
2409
+ "C l",
2410
+ "CC CCC",
2411
+ "C =",
2412
+ "cc (",
2413
+ "c2 )",
2414
+ "c2 n",
2415
+ "cc 1",
2416
+ "OC )",
2417
+ "c2ccccc 2",
2418
+ "O= C(",
2419
+ "c1cc (",
2420
+ "F )cc",
2421
+ "c1ccc (C",
2422
+ "CC (=O)N",
2423
+ ") N",
2424
+ "n 2",
2425
+ "CC 2",
2426
+ "[N +]",
2427
+ "2) c1",
2428
+ "C )",
2429
+ "[NH3+] )",
2430
+ "CC [NH+]",
2431
+ "Br )",
2432
+ "4 )",
2433
+ "c( N",
2434
+ "CCC (",
2435
+ "= O",
2436
+ "(Cl )cc",
2437
+ "(F) (F)",
2438
+ "c1 )",
2439
+ "c (=O)",
2440
+ "c3 cc",
2441
+ "[N+] (=O)",
2442
+ "C c1ccc(",
2443
+ "CC (=O)",
2444
+ "c2cc cc",
2445
+ "c1ccc 2",
2446
+ "c1cccc (",
2447
+ "CC 2)",
2448
+ "N 1",
2449
+ "C( F)",
2450
+ "C 3",
2451
+ "s 1",
2452
+ "c3ccccc 3",
2453
+ "C [NH+]",
2454
+ "CCC 1",
2455
+ "ccc 2",
2456
+ "C c1",
2457
+ "n c(",
2458
+ "n c1",
2459
+ "O CC",
2460
+ "C c1cc",
2461
+ "CCCC CCCC",
2462
+ "C( O)",
2463
+ "N 2",
2464
+ "= C",
2465
+ "c3ccc (",
2466
+ "OC (C)",
2467
+ "C c1n",
2468
+ "c3 )",
2469
+ "CO C(=O)",
2470
+ "Cl )",
2471
+ "c (Cl)",
2472
+ "# N)",
2473
+ "C(F) (F)",
2474
+ "c 5",
2475
+ "2) CC1",
2476
+ "(CC )",
2477
+ "O C(=O)",
2478
+ "( O)",
2479
+ "CC [NH2+]",
2480
+ "1 )",
2481
+ "cc 2",
2482
+ "= C(",
2483
+ "C [NH2+]",
2484
+ ")cc c1",
2485
+ "CCN (",
2486
+ "O=C( N",
2487
+ "F )cc1",
2488
+ "(F)(F) F)",
2489
+ "n n",
2490
+ "= N",
2491
+ ")cc 2",
2492
+ "CO c1ccc(",
2493
+ "c4 ccccc",
2494
+ "2) C1",
2495
+ "C S",
2496
+ "CC(C) (C)",
2497
+ "CCCC 1",
2498
+ "c( F)",
2499
+ "c1 cn",
2500
+ "CCO C(=O)",
2501
+ "c2cc (",
2502
+ "CCC N",
2503
+ "CCC (C)",
2504
+ "CC 3)",
2505
+ "n c2",
2506
+ "NC(=O) N",
2507
+ "C (C)C",
2508
+ "= S",
2509
+ "c4 ccc",
2510
+ "CC( O)",
2511
+ "CC 3",
2512
+ "o 1",
2513
+ "c s",
2514
+ "CCC O",
2515
+ "CCC 2",
2516
+ "(C (C)",
2517
+ "(Cl )cc1",
2518
+ "c1ccc2 c(",
2519
+ "c n1",
2520
+ "CC (C",
2521
+ "C(=O)N 1",
2522
+ "( N)",
2523
+ "c2 c(",
2524
+ "[ S",
2525
+ "C n1",
2526
+ "= [NH+]",
2527
+ "C c1ccc",
2528
+ "CCCCC 1",
2529
+ "n 3",
2530
+ "C c1cc(",
2531
+ "O= C(C",
2532
+ "c2 c1",
2533
+ "n cc",
2534
+ "c1cc (C",
2535
+ "2) n1",
2536
+ "c1cccc (C",
2537
+ "CCC (C",
2538
+ "c2ccc 3",
2539
+ "CC )",
2540
+ "c2 cn",
2541
+ "c (C(=O)N",
2542
+ "c1 2",
2543
+ ")N 1",
2544
+ "[nH +]",
2545
+ "[S i",
2546
+ "(CC (=O)N",
2547
+ "c3cc cc",
2548
+ "ccc 3",
2549
+ "C NC(=O)",
2550
+ "[NH+] 1",
2551
+ "CC =",
2552
+ ")cc (",
2553
+ "CC (C)C",
2554
+ "O C1",
2555
+ "n (",
2556
+ "c2cc cc(",
2557
+ "[Si ]",
2558
+ "[NH+] 2",
2559
+ "OC 2",
2560
+ "CC1 )",
2561
+ "c4ccccc 4",
2562
+ "CC n1",
2563
+ "cc cc",
2564
+ "c2ccc (C",
2565
+ "c(C) c1",
2566
+ "(C) cc",
2567
+ "N #",
2568
+ ")cc 2)",
2569
+ "CC NC(=O)",
2570
+ "c1 c(",
2571
+ "CC 2)cc1",
2572
+ "CC S",
2573
+ "3) n",
2574
+ "OC (C",
2575
+ "=O) cc1",
2576
+ "c1cc 2",
2577
+ "c2 )cc1",
2578
+ "n (C)",
2579
+ "5 )",
2580
+ "N S(=O)",
2581
+ "NC(=O) C(",
2582
+ "c1 C",
2583
+ "c [nH]",
2584
+ "N C(",
2585
+ "( [O-])",
2586
+ "c3 n",
2587
+ "(C) C(=O)",
2588
+ "c( OC)",
2589
+ "# N",
2590
+ ")cc 3)",
2591
+ "CCCC 2",
2592
+ "CN 1",
2593
+ "c( N)",
2594
+ "C c1ccc(C",
2595
+ "(C) (=O)",
2596
+ "C (C)C)",
2597
+ "c 6",
2598
+ "O= C1",
2599
+ "n c(N",
2600
+ "C[NH+] 1",
2601
+ ")cc 3",
2602
+ ") C(=O)",
2603
+ "c (C(=O)",
2604
+ "C 2)",
2605
+ "CC 2)c1",
2606
+ "c1cccc 2",
2607
+ "Br )cc1",
2608
+ "N (",
2609
+ "cc (C",
2610
+ "C1 CC1",
2611
+ "S (C)(=O)",
2612
+ "n c(C",
2613
+ "CC C(=O)",
2614
+ "ccc (",
2615
+ "CC C(=O)N",
2616
+ "[O-] )cc1",
2617
+ "c( NC(=O)",
2618
+ "C( N)",
2619
+ "CN (",
2620
+ "CCN 1",
2621
+ "c1ccc( N",
2622
+ "c3 c(",
2623
+ "C 4",
2624
+ "[O-]) c1",
2625
+ "O C(",
2626
+ "c1ccc (C)",
2627
+ "(C(=O)N 2",
2628
+ ")cc 2)cc1",
2629
+ "[nH] 1",
2630
+ "c (Cl)cc",
2631
+ "O CCO",
2632
+ "C1 =O",
2633
+ "C c1cccc(",
2634
+ "= C2",
2635
+ "n (C",
2636
+ "CCC [NH+]",
2637
+ "CCCC (",
2638
+ "=S )",
2639
+ "O 1",
2640
+ "n n1",
2641
+ "CCC 3",
2642
+ "Br) c1",
2643
+ "NC(=O) C1",
2644
+ "[Si] (C)",
2645
+ "(CC (=O)",
2646
+ "cc 3",
2647
+ "OC O",
2648
+ ") C1",
2649
+ "c4ccc (",
2650
+ "N1 C(=O)",
2651
+ "n 2)",
2652
+ "c2) c1",
2653
+ "C(C) (C)C",
2654
+ "n c3",
2655
+ "O CC(=O)N",
2656
+ "c2ccc3 c(",
2657
+ "c4 )",
2658
+ "=S )N",
2659
+ "N c1n",
2660
+ "Cc1 cn",
2661
+ "c5 ccccc",
2662
+ "NC(=O) C2",
2663
+ "(N) =O)",
2664
+ "CC S(=O)",
2665
+ "F)cc 2",
2666
+ "P (=O)",
2667
+ "ccccc 2",
2668
+ "(Cl) c1",
2669
+ "O) cc1",
2670
+ "c1ccc(C 2",
2671
+ "CC c1n",
2672
+ "C(C) (C)",
2673
+ "c(Cl) c1",
2674
+ "c2ccc( N",
2675
+ "C( N",
2676
+ "n cn",
2677
+ "(C 2",
2678
+ "c( S",
2679
+ "c3 cc(",
2680
+ "( CCC",
2681
+ "C #",
2682
+ "c(F) c1",
2683
+ "c2 s",
2684
+ "3) C2",
2685
+ "C S(=O)",
2686
+ "CCO CC1",
2687
+ "CC1 (C)",
2688
+ "OCC )",
2689
+ "CN (C(=O)",
2690
+ "c( O)",
2691
+ "n cc1",
2692
+ "cc c1",
2693
+ "CO c1cc(",
2694
+ "3 CCCC",
2695
+ "Cc1cc (C)",
2696
+ "N2 C(=O)",
2697
+ "CC (CC",
2698
+ "CC[NH+] 1",
2699
+ "c1 =O",
2700
+ "N =",
2701
+ "C 3)",
2702
+ "c s1",
2703
+ "n 3)",
2704
+ "c3ccc 4",
2705
+ "I )",
2706
+ "c2cc 3",
2707
+ "CC (C)C)",
2708
+ "CC 4",
2709
+ "C )cc1",
2710
+ "c2n c(",
2711
+ "s 2)",
2712
+ "C(F)(F) F",
2713
+ "C= C",
2714
+ "C(=O)N C(",
2715
+ "c(C 2",
2716
+ "c2) CC1",
2717
+ "c1n cc",
2718
+ "(C) C1",
2719
+ "(C O)",
2720
+ "CC(=O)N 1",
2721
+ "(C) c1",
2722
+ "CCC( O)",
2723
+ "c4 cc",
2724
+ "C(=O)N 2",
2725
+ "s c1",
2726
+ "( [NH3+])",
2727
+ "CO C1",
2728
+ "[O-]) C1",
2729
+ "OC )cc1",
2730
+ "c1ccc( O",
2731
+ "C(=O)N (",
2732
+ "CO c1ccc",
2733
+ "(=O)N 2",
2734
+ "C c1cccc",
2735
+ "(C)C )cc1",
2736
+ "n1 )",
2737
+ "3 )cc1",
2738
+ "=C( N",
2739
+ "l )",
2740
+ "CCC =",
2741
+ "(F) c1",
2742
+ "c(C) cc",
2743
+ "c2n cc",
2744
+ "(Cl)cc 2",
2745
+ "(C #N)",
2746
+ "OC 3",
2747
+ "n 2)cc1",
2748
+ "ccc2 1",
2749
+ "c1 s",
2750
+ "(C)C (C)",
2751
+ "(C(=O)N C",
2752
+ "CN (C)",
2753
+ "[NH2+] C",
2754
+ "OC) c1",
2755
+ "C(C #N)",
2756
+ "c1n c(",
2757
+ "CC 4)",
2758
+ "CO c1cc",
2759
+ "( N",
2760
+ "CCCCC 2",
2761
+ "C1 =",
2762
+ "F)cc 2)",
2763
+ "C1 )",
2764
+ "s1 )",
2765
+ "n c(C)",
2766
+ "ccccc 3",
2767
+ "=O) c1",
2768
+ "C OC",
2769
+ "o 2)",
2770
+ "CO c1cc(C",
2771
+ "c2cc (Cl)",
2772
+ "CCO CCO",
2773
+ "CCCC O",
2774
+ "c3cc cc(",
2775
+ "CCN (CC)",
2776
+ "c2ccc( OC",
2777
+ "c(C (C)",
2778
+ "N (C",
2779
+ "N (C)",
2780
+ "F)cc c1",
2781
+ "C(C O)",
2782
+ "N (C(=O)",
2783
+ "[NH2+] C1",
2784
+ "c 7",
2785
+ "OC(C) =O)",
2786
+ "c3 cn",
2787
+ "n 4",
2788
+ "CCN (C",
2789
+ "CN (C",
2790
+ "(CC O)",
2791
+ "S C",
2792
+ "c5ccccc 5",
2793
+ "= C1",
2794
+ "c1 cs",
2795
+ "c1ccc( F)",
2796
+ "o c(",
2797
+ "(C)C 2",
2798
+ "C (C(=O)",
2799
+ "c(N 2",
2800
+ "CCO CC2)",
2801
+ "CC c1ccc(",
2802
+ "( CCCC",
2803
+ "c2cc (C",
2804
+ "c2ccc( Br",
2805
+ "c3ccc (C",
2806
+ "[nH] c(",
2807
+ "3) CC2)",
2808
+ "NC(=O) C",
2809
+ "O CCC",
2810
+ "( c2ccccc",
2811
+ "n o1",
2812
+ "(=O)N 1",
2813
+ "c(OC) c1",
2814
+ "c2cc (C)",
2815
+ "ccc 4",
2816
+ "C(O) C(O)",
2817
+ "CCO C1",
2818
+ "O CC1",
2819
+ "c2ccc (C)",
2820
+ "c1cc2 c(",
2821
+ "c2cccc 3",
2822
+ "O) cc",
2823
+ "o 1)",
2824
+ "=O) cc",
2825
+ "c [nH+]",
2826
+ "CC OC",
2827
+ "O= S(=O)",
2828
+ "CCCC (C)",
2829
+ "N =C",
2830
+ "CCC n1",
2831
+ "3 CCO",
2832
+ "c4 cccc",
2833
+ "c2 C)",
2834
+ "c2n cn",
2835
+ "C( =",
2836
+ "c( Br)",
2837
+ "CCC 4",
2838
+ "c2 cs",
2839
+ "c2cccc (C",
2840
+ "c( O",
2841
+ "[n +]",
2842
+ "CCCCC 2)",
2843
+ "(C)C) c1",
2844
+ "C1 CCCCC1",
2845
+ "F)cc 3)",
2846
+ ") C(=O)N",
2847
+ "Cc1cc (C",
2848
+ "(Cl)cc c1",
2849
+ "CCCC 2)",
2850
+ "c1n nc(",
2851
+ "c1 2)",
2852
+ "n c2)",
2853
+ "C( =C",
2854
+ "c1 c(C)",
2855
+ "(Cl)cc 2)",
2856
+ "ccccc 6",
2857
+ "C1 2",
2858
+ "% 1",
2859
+ "C( NC(=O)",
2860
+ "OC(C O)",
2861
+ "(CC 2",
2862
+ "c2cc (F)",
2863
+ "c1ccc( N2",
2864
+ "o 2)cc1",
2865
+ "c1ccc s1",
2866
+ "[O-] )cc",
2867
+ "C2 =O)",
2868
+ "c1ccc nc1",
2869
+ "=C( N)",
2870
+ "C= C1",
2871
+ "c1cc( N",
2872
+ "3 CCCCC",
2873
+ "CCC )",
2874
+ "C( =S)N",
2875
+ "c(C #N)",
2876
+ "c2 1",
2877
+ "[N -]",
2878
+ "CC O)",
2879
+ "n2 cc",
2880
+ "c( S(=O)",
2881
+ "CCCN (",
2882
+ "C (C(=O)N",
2883
+ "c1n cn",
2884
+ "n1 C",
2885
+ "c2ccc (F)",
2886
+ "C[NH+] 2",
2887
+ "NC(=O) CS",
2888
+ "c2n nc(",
2889
+ "( O",
2890
+ "n2 cn",
2891
+ "(C (C)C)",
2892
+ "c3ccccc 2",
2893
+ "n 2)c1",
2894
+ "[NH2+] C2",
2895
+ "Cc1cc 2",
2896
+ "N) =O)",
2897
+ "s 3)",
2898
+ "O C(=O)N",
2899
+ "C1 CCC",
2900
+ "F)cc 3",
2901
+ "CCCCC 1)",
2902
+ "O c1ccc(",
2903
+ "(C)C )cc",
2904
+ "N C(C)",
2905
+ "CN1 C(=O)",
2906
+ "= [NH2+]",
2907
+ "C1 O",
2908
+ "c( OC",
2909
+ "c6 ccccc6",
2910
+ "S 1",
2911
+ "CC1 (",
2912
+ "S CC(=O)N",
2913
+ "c1 [nH]",
2914
+ "c2) C1",
2915
+ "c2 c(C)",
2916
+ "= CC(=O)",
2917
+ "c3ccc4 c(",
2918
+ "O C(F)(F)",
2919
+ "(N) (=O)",
2920
+ "CCC (CC)",
2921
+ "c1 c[nH]",
2922
+ "c o",
2923
+ "CC (C(=O)",
2924
+ "C O)",
2925
+ "n o",
2926
+ "CCN (CC",
2927
+ "s 2)cc1",
2928
+ "O CCCC",
2929
+ "C(=O) OC",
2930
+ "c2 n1",
2931
+ "C2 )cc1",
2932
+ "F) c1",
2933
+ "nc1 2",
2934
+ "Br )cc",
2935
+ "N C1",
2936
+ "CCN C(N",
2937
+ "3) CC1",
2938
+ "c2) n1",
2939
+ "c2 [nH]",
2940
+ "C= C(",
2941
+ "3CCO CC3)",
2942
+ "= C(O)",
2943
+ "n cc2",
2944
+ "C #N)",
2945
+ "c1cc ncc1",
2946
+ "c( Br)c1",
2947
+ "CCCC CC1",
2948
+ ")cc2 1",
2949
+ "- 2",
2950
+ "C 2)c1",
2951
+ "Cc1 cs",
2952
+ "N 3",
2953
+ "O= [N+]",
2954
+ "Br )ccc1",
2955
+ "c2 =O)",
2956
+ "C c1ccc2",
2957
+ "C c2ccc",
2958
+ "NC(=O) CO",
2959
+ "C1 CCCC1",
2960
+ "3) c1",
2961
+ "c(F) c(F)",
2962
+ "C[NH+] (C",
2963
+ "C) c1",
2964
+ "c1cc (C)",
2965
+ "C #N",
2966
+ "NC( =S)N",
2967
+ "F)cc1 )",
2968
+ "n nc1",
2969
+ "CC(C) O",
2970
+ "c5 ccc",
2971
+ "O) c1",
2972
+ ")cc 2)c1",
2973
+ "S (N)(=O)",
2974
+ "CC2) CC1",
2975
+ "C 5",
2976
+ "CC #",
2977
+ "4) CC3)",
2978
+ "O [Si](C)",
2979
+ "CCCC (C",
2980
+ "CCN 2",
2981
+ "CC1 2",
2982
+ "c1c( F)cc",
2983
+ "n n2",
2984
+ "CO c1ccc2",
2985
+ "CC( N",
2986
+ "c2n c(C",
2987
+ "O=C(N C1",
2988
+ "C= C(C)",
2989
+ "cc( N",
2990
+ "n (CC",
2991
+ "3 CC3)",
2992
+ "n 2)CC1",
2993
+ "o c(C",
2994
+ "n c3)",
2995
+ "c4 c(",
2996
+ "CC1 CCC",
2997
+ "(Cl)cc 3",
2998
+ "Cl )cc1",
2999
+ "c2cc( Br)",
3000
+ "O C(F)",
3001
+ "c2cc ncc",
3002
+ "n [nH]",
3003
+ "O CC2",
3004
+ "(Cl)cc 3)",
3005
+ "Cc1n c(",
3006
+ "CN 2",
3007
+ "nc( N)",
3008
+ "C2 =O)cc1",
3009
+ "nc2 c1",
3010
+ "CCCC (=O)",
3011
+ "(F) F)",
3012
+ "C c2ccccc",
3013
+ "(C) CC1",
3014
+ "c3 s",
3015
+ "NC(=O) N2",
3016
+ "CCCC 1)",
3017
+ "O P(=O)",
3018
+ "N c1ccc(",
3019
+ "C(N) =O",
3020
+ ")cc 2)CC1",
3021
+ "6 )",
3022
+ "CC2) n1",
3023
+ "n cn1",
3024
+ "CCC S",
3025
+ "O CC(=O)",
3026
+ "2)C1 =O",
3027
+ "CC2 )ccc1",
3028
+ ")cc 4",
3029
+ ")cc cc1",
3030
+ "C(=O)N (C",
3031
+ "s c2c1",
3032
+ "C(=O) C1",
3033
+ "o 3)",
3034
+ "c1cc (Cl)",
3035
+ "OC) c(OC)",
3036
+ "Cc1ccc( N",
3037
+ "c3ccc( OC",
3038
+ "c2cc 1",
3039
+ "CC1 =",
3040
+ "C( CC)",
3041
+ "CC =C",
3042
+ "c2c( F)cc",
3043
+ "C( OC",
3044
+ "c1) OCO",
3045
+ "c3 ncc",
3046
+ "N (CC",
3047
+ "c1n c(N",
3048
+ "c(=O) n2",
3049
+ "c3ccc( N",
3050
+ "c 8",
3051
+ "c3 C)",
3052
+ "CC1 (C",
3053
+ "CC1 C",
3054
+ "c1) C(=O)",
3055
+ "cc 4",
3056
+ "CN (CC",
3057
+ "ccc2 c1",
3058
+ "ccccc 4",
3059
+ "c4ccc 5",
3060
+ "C(=O)N C1",
3061
+ "(C)C 3",
3062
+ "CC( =",
3063
+ "CC (F)(F)",
3064
+ "cc( Br)",
3065
+ "(C(=O) C2",
3066
+ "CC[NH+] 2",
3067
+ "CC( O",
3068
+ "C(C) O",
3069
+ "3) C1",
3070
+ "CCC (CC",
3071
+ "[O-]) CC1",
3072
+ "CCC= CCC=",
3073
+ "[NH3+] C1",
3074
+ "c(=O) n1",
3075
+ "cn 2",
3076
+ "[NH+] 3",
3077
+ "[NH2+] 1",
3078
+ "CCC 2)",
3079
+ "( OC)",
3080
+ "cc nc1",
3081
+ "c5 ccc(",
3082
+ "CC2) C1",
3083
+ "NC(=O) N1",
3084
+ "c1 N",
3085
+ "(C(C) =O)",
3086
+ "CCC( N",
3087
+ "c3 c2",
3088
+ "CC l)",
3089
+ "C (Cl)",
3090
+ "c2ccc s2)",
3091
+ "Cc1 c(",
3092
+ "CC(C) C1",
3093
+ "3CCCC 3",
3094
+ "C(N) =O)",
3095
+ "[O-] )cc2",
3096
+ "n c4",
3097
+ "( c3ccccc",
3098
+ "[NH2+] C)",
3099
+ "(C)C) C1",
3100
+ "n1 cn",
3101
+ "O 2",
3102
+ "c3 nn",
3103
+ "OC(C) (C)",
3104
+ "c1n c(C",
3105
+ "c3cccc 4",
3106
+ "c2ccc nc2",
3107
+ ") c1ccc(",
3108
+ "CC1 (C)C",
3109
+ "c3cc 4",
3110
+ "n c1)",
3111
+ "CO c1cccc",
3112
+ "N C2",
3113
+ "Cc1 s",
3114
+ "CCCC 3",
3115
+ "CC(=O)N 2",
3116
+ "=N NC(=O)",
3117
+ "= C(C)",
3118
+ "[NH+] (C)",
3119
+ "CC(CC (C",
3120
+ "= C(C",
3121
+ "# C",
3122
+ "F) CC1",
3123
+ "=C (C(=O)",
3124
+ "C(=O) C(",
3125
+ "s c2",
3126
+ "c1ccc o1",
3127
+ "[nH] c1",
3128
+ "(=O) C1",
3129
+ "[NH2+] C(",
3130
+ "c2cc3 c(",
3131
+ "=C( [O-])",
3132
+ "c3cc (Cl)",
3133
+ "c1cccc n1",
3134
+ "o c1",
3135
+ "( NC(=O)",
3136
+ "CC(C) (O)",
3137
+ "c1ncc cc1",
3138
+ "OCC (C",
3139
+ "CCO 1",
3140
+ "3 C)",
3141
+ "Cl) c1",
3142
+ "CCC (C)C",
3143
+ "S c1n",
3144
+ "ccccc 7",
3145
+ "cccc c12",
3146
+ "#N )cc1",
3147
+ "O c2ccc(",
3148
+ "CC OC2",
3149
+ "c3 nc(",
3150
+ "C(=O) C2",
3151
+ "Cc1n o",
3152
+ "c5 )",
3153
+ "n2 ccc",
3154
+ "Cc1 [nH]",
3155
+ "3)n 2)cc1",
3156
+ "(Cl) c2",
3157
+ "C1 C",
3158
+ "CC(C O)",
3159
+ "O CC(O)",
3160
+ "= C(C#N)",
3161
+ "CCCO 1",
3162
+ ")cc 4)",
3163
+ "n1 2",
3164
+ "[nH] c2c1",
3165
+ "cc cc1",
3166
+ "CCC N1",
3167
+ "c1 c(C",
3168
+ "Cc1n n(C)",
3169
+ "c3cc (C)",
3170
+ "N C",
3171
+ "2) nc1",
3172
+ "C3 =O)",
3173
+ "OC (C)C)",
3174
+ "CCO CC1)",
3175
+ "OC (C)C",
3176
+ "(Cl) c2)",
3177
+ "4 CCCC",
3178
+ "c2 c[nH]",
3179
+ "C= CC(=O)",
3180
+ "O C(=O)N1",
3181
+ "C1 CCN",
3182
+ "S )",
3183
+ "c2n c(N",
3184
+ "[NH2+] CC",
3185
+ "C= N",
3186
+ "c %1",
3187
+ "CCN (C)",
3188
+ "=[NH2+] )",
3189
+ "c(C) n1",
3190
+ "C( =[NH+]",
3191
+ "n [nH]1",
3192
+ "(Cl)cc1 )",
3193
+ "= CC=",
3194
+ "c2) OCO",
3195
+ "cc2 1",
3196
+ "c1cc( Br)",
3197
+ "F)cc1 F",
3198
+ "c1cc (F)",
3199
+ "CCCC )",
3200
+ "(F) c2)",
3201
+ "CCC( O",
3202
+ "C(=O) O",
3203
+ ")N (C",
3204
+ "(C(=O) C",
3205
+ "=C2 S",
3206
+ "= C)",
3207
+ "NC(=O) CC",
3208
+ "NC(=O) C3",
3209
+ "c(Cl) c2)",
3210
+ "s 2)c1",
3211
+ "C= CC=",
3212
+ "CCN S(=O)",
3213
+ "CC( N)=O)",
3214
+ "c(Cl) c3)",
3215
+ "OC)c1 OC",
3216
+ "(C) (C)",
3217
+ "OCCO 2",
3218
+ "F C(F)(F)",
3219
+ "c4 cc(",
3220
+ "3CCCC 3)",
3221
+ "Cl) CC1",
3222
+ "ccccc2 c1",
3223
+ "ccc (C",
3224
+ "cn 2)",
3225
+ "CO CCO",
3226
+ "CC1 (O)",
3227
+ "nc(N 2",
3228
+ "[nH] 1)",
3229
+ "CC(C) C(",
3230
+ "o 2)c1",
3231
+ "F) C1",
3232
+ "c7 ccccc7",
3233
+ "cc (C)",
3234
+ "F)cc (",
3235
+ "c3 c(C)",
3236
+ "[nH] 2)",
3237
+ "(Cl) c3)",
3238
+ "c2ccc o2)",
3239
+ "S C1",
3240
+ "n2 c(",
3241
+ "C2 =O",
3242
+ "nn 3",
3243
+ "o n1",
3244
+ "[nH] c2",
3245
+ "CC 5",
3246
+ "4) c3)",
3247
+ "C) CC1",
3248
+ "C2 CCCC",
3249
+ "c3ccc( Br",
3250
+ "=C2 C(=O)",
3251
+ "3CCCCC 3)",
3252
+ "c2ccc( O",
3253
+ "2 )ccc1",
3254
+ "(C(=O)N 3",
3255
+ "c3cc ncc",
3256
+ "CC1 CCCC",
3257
+ "n2 c(=O)",
3258
+ "N) c1",
3259
+ ")N C1",
3260
+ "CN S(=O)",
3261
+ "C2 CC2)",
3262
+ "Cn1 cn",
3263
+ "CC(C) (C",
3264
+ "C c1cccc2",
3265
+ "(C)C) CC1",
3266
+ "cc cc(",
3267
+ "cn c1",
3268
+ "[ C",
3269
+ "4 CCO",
3270
+ "n c2cccc",
3271
+ "[nH] c1=O",
3272
+ "CC c1cc",
3273
+ "C= C(C",
3274
+ "c1) C1",
3275
+ "F)cc (C",
3276
+ "CC c1",
3277
+ "c(C) c(C)",
3278
+ "C 2)n1",
3279
+ "CCC S(=O)",
3280
+ "OC) c(",
3281
+ "N =C(",
3282
+ "(C) (C)C)",
3283
+ "(C) O",
3284
+ "= CC(=O)N",
3285
+ "CCC1 2",
3286
+ "c1cccc( N",
3287
+ "Cc1n c(C",
3288
+ "= C3",
3289
+ "4) c3",
3290
+ "c2cc( N",
3291
+ "C3 CC3)",
3292
+ "NC( =S)",
3293
+ "C= CC1",
3294
+ "CCC O)",
3295
+ "CCCC CC",
3296
+ "Cc1cc( N",
3297
+ "c2cc s",
3298
+ "CC(C) CC(",
3299
+ "S C)",
3300
+ "C(C) =O)",
3301
+ "= [N+]",
3302
+ "[NH+] (C",
3303
+ "CO CC1",
3304
+ "3) c2",
3305
+ "c2 c(C)cc",
3306
+ "s 2)CC1",
3307
+ "CC l",
3308
+ "CCC2 (",
3309
+ ") (",
3310
+ "c2n n",
3311
+ "C c2ccc(",
3312
+ "=C(N) N)",
3313
+ "2) C(=O)",
3314
+ "c1cc(C 2",
3315
+ "(C(=O) OC",
3316
+ "cc1 Cl",
3317
+ "CO c1",
3318
+ "C 4)",
3319
+ "CCC 3)",
3320
+ ")cc n1",
3321
+ "c3cccc (C",
3322
+ "CC(=O)N (",
3323
+ "c1ccc( OC",
3324
+ "CCC c1n",
3325
+ "c3ccc s3)",
3326
+ "CC( N)",
3327
+ "cc n1",
3328
+ "Br )cc(",
3329
+ "[O-]) c(",
3330
+ "c2 o",
3331
+ "C( OC(=O)",
3332
+ ") NC(=O)",
3333
+ "2)cc1 OC",
3334
+ "C= CCO",
3335
+ ")ccc1 OC",
3336
+ "c2ncc cc2",
3337
+ "2) s1",
3338
+ "O=S(=O) (",
3339
+ "c2ccc(N 3",
3340
+ "4 )cc3",
3341
+ "[nH] c3",
3342
+ "(C)C) n1",
3343
+ "CS c1n",
3344
+ "C2 CCC",
3345
+ "C= CC",
3346
+ "c3n cn",
3347
+ "C 2)CC1",
3348
+ "C2 O)",
3349
+ "c3ccc (C)",
3350
+ "c(=O) o",
3351
+ "O=C(C 1",
3352
+ "[nH+] 1",
3353
+ ")cc2 c1",
3354
+ "C( OC)",
3355
+ "c (Cl)cc1",
3356
+ "[n -]",
3357
+ "C 2)C1",
3358
+ "N N",
3359
+ "(C(=O) O",
3360
+ "c1ccc s1)",
3361
+ "[ P",
3362
+ "c1ccc o1)",
3363
+ "O CCCO",
3364
+ "(F) c3)",
3365
+ "Cc1 o",
3366
+ "CC(C) n1",
3367
+ "C= CCn1",
3368
+ "cc1 C",
3369
+ "c4 n",
3370
+ "CC1 CC1",
3371
+ "c2n nc(C",
3372
+ "c(N) c1",
3373
+ "CCCC CCC",
3374
+ ")N1 CCN(",
3375
+ "c(N 3",
3376
+ "c(O) c1",
3377
+ "c3cc (F)",
3378
+ "(C) CC",
3379
+ "C) C1",
3380
+ "2)cc1 )",
3381
+ ")cc (C",
3382
+ "C3 CCCCC",
3383
+ "c3ccc (F)",
3384
+ "c(=O) c1",
3385
+ "c1)OCO 2",
3386
+ "c4 cn",
3387
+ "c2ccc( O)",
3388
+ "n 2)C1",
3389
+ "cc (Cl)",
3390
+ "c(F) c2)",
3391
+ "C1 (",
3392
+ "O S(=O)",
3393
+ "NC(C) =O)",
3394
+ "C( CC(=O)",
3395
+ "Cn1 cc(",
3396
+ "(C)cc 2)",
3397
+ "S1 (=O)",
3398
+ "c2ccc (CC",
3399
+ "(CC) CC",
3400
+ "2)c1 C",
3401
+ "c1n cc(",
3402
+ "(C)cc 3)",
3403
+ "CC(=O) O",
3404
+ "(F) c2",
3405
+ "4) ccc3",
3406
+ "ccc 5",
3407
+ "c1ccc(C N",
3408
+ "c(=O) c2",
3409
+ "c1cn cc(",
3410
+ "c2n c(C)",
3411
+ "[P +]",
3412
+ "c3 [nH]",
3413
+ "[n+] 1",
3414
+ "CCO c1ccc",
3415
+ "C [NH3+]",
3416
+ "4 CCCCC",
3417
+ "4 )cc3)",
3418
+ "C(=O)N C",
3419
+ "n cc(",
3420
+ "3) C1)",
3421
+ "CC (F)",
3422
+ "C(C) C1",
3423
+ "N c1cc",
3424
+ "[O-]) c2)",
3425
+ "C1 =O)",
3426
+ "(C) C(",
3427
+ "O 2)",
3428
+ "=N N",
3429
+ "CCCC 3)",
3430
+ "c2 c(C",
3431
+ "c2n ccc",
3432
+ "nc(C 2",
3433
+ "C1 (C",
3434
+ "c(N C",
3435
+ "nc2 n1",
3436
+ "c2s ccc2",
3437
+ "n 5",
3438
+ "= [N-]",
3439
+ "[S -]",
3440
+ "CC OC(C",
3441
+ "CO P(=O)",
3442
+ "c1ccc( n2",
3443
+ "( Br)",
3444
+ "c3cc (C",
3445
+ "c1 o",
3446
+ "=[NH+] O)",
3447
+ "CC1 CCN",
3448
+ "CC2 CCC1",
3449
+ ")ccc1 Cl",
3450
+ "C(=O)N C2",
3451
+ "I) c1",
3452
+ "c3ccc o3)",
3453
+ "nn n1",
3454
+ "CCC(C O)",
3455
+ "n2 c1",
3456
+ "nc2 1",
3457
+ "n3 cn",
3458
+ "cn n1",
3459
+ "n(C 2",
3460
+ "c4ccc (C",
3461
+ ") S(=O)",
3462
+ "cc1 2",
3463
+ "N c1cc(",
3464
+ "I )cc1",
3465
+ "nc1 C",
3466
+ "NC( N)",
3467
+ "cn c2",
3468
+ "(CC C(=O)",
3469
+ "n 4)",
3470
+ "c2cc( OC)",
3471
+ "2) cn1",
3472
+ "CC c1cccc",
3473
+ "Cn1 c(=O)",
3474
+ ")cc 2)n1",
3475
+ "C1 C2",
3476
+ "(CC) CC)",
3477
+ "(CC (C)C)",
3478
+ "C1 CCN(",
3479
+ "C( =N",
3480
+ "P (",
3481
+ "O=C(C O",
3482
+ "(F) c1)",
3483
+ "c2n cc(",
3484
+ "C= C(C)C",
3485
+ "NC(=O) CN",
3486
+ "CC c1cn",
3487
+ "n 2)n1",
3488
+ "c( [O-])",
3489
+ "CC1 O",
3490
+ "(C)cc (C)",
3491
+ "C(=O) OCC",
3492
+ "(C)cc c1",
3493
+ "Cc1 c(C",
3494
+ "S C2",
3495
+ ")ccc1 F",
3496
+ "cn 2)cc1",
3497
+ "c1n c2c(",
3498
+ "CO c1cc2",
3499
+ "CC2 1",
3500
+ "CC c1ccc",
3501
+ "C(F) F",
3502
+ "CC1 CN(",
3503
+ "c2n nn",
3504
+ "(Cl) c1)",
3505
+ "c4cc cc(",
3506
+ "CCO C(",
3507
+ "c(F)c1 F",
3508
+ "Cl) C1",
3509
+ "c2 C)cc1",
3510
+ "(Cl) c(",
3511
+ "CCCC (O)",
3512
+ "OC 4",
3513
+ "c1cs c(",
3514
+ "CC 5)",
3515
+ "4CCO CC4)",
3516
+ "CC (Cl)",
3517
+ "C(O) C1",
3518
+ "3 )cc2",
3519
+ "CCCC n1",
3520
+ "C(C S",
3521
+ "OC) C1",
3522
+ "OC(C) =O",
3523
+ "CC =C(",
3524
+ "NC(=O) OC",
3525
+ "CC[NH+] (",
3526
+ "c4ccccc 3",
3527
+ "3 CC4",
3528
+ "C= O",
3529
+ "C1= C(C)",
3530
+ "F)cc 2)c1",
3531
+ "CC [NH3+]",
3532
+ "CC c1cc(",
3533
+ "c 9",
3534
+ "O)cc 2)",
3535
+ ")cc 2)C1",
3536
+ "c1cc sc1",
3537
+ "ncc c1",
3538
+ "O [Si]",
3539
+ "c1n c(C)",
3540
+ "=[NH+] C",
3541
+ "c2ccc(C 3",
3542
+ "CCCC 2)c1",
3543
+ "c1 )N",
3544
+ "CCCC N",
3545
+ "(=O) C",
3546
+ "n c(Cl)",
3547
+ "- 3",
3548
+ "c1 c(C)cc",
3549
+ "Cc1cc2 c(",
3550
+ "C( OC2",
3551
+ "c4ccc5 c(",
3552
+ "2) c(C)c1",
3553
+ "[O-] )cc(",
3554
+ "C( CCCC",
3555
+ "CC1 CC1)",
3556
+ "cc (C(=O)",
3557
+ "cc2 c1",
3558
+ "CC1 CO",
3559
+ "CC2)c1 C",
3560
+ "N )cc1",
3561
+ ")ccc1 O",
3562
+ "C) n1",
3563
+ "O=C(C S",
3564
+ "(C) O)",
3565
+ "(Cl) c(C",
3566
+ "c1 c(N",
3567
+ "3) c2)",
3568
+ "C1 CCCC",
3569
+ "( OCC)",
3570
+ "CC1 CCN(",
3571
+ "c( =S)",
3572
+ "c(C 3",
3573
+ "c2n oc(C",
3574
+ "F)cc 4)",
3575
+ "C(C (C)C)",
3576
+ "(CCC )",
3577
+ "OC )cc(",
3578
+ "O)cc1 )",
3579
+ "cn1 )",
3580
+ "c2n [nH]",
3581
+ "(C) CCC",
3582
+ "1 )N",
3583
+ "c(Cl) c1)",
3584
+ "[O-]) n1",
3585
+ "C(C) =O",
3586
+ "c(=O) n(",
3587
+ "O=C(C n1",
3588
+ "OC )cc",
3589
+ "2) o1",
3590
+ "cc2 Cl)",
3591
+ "c3ccc nc3",
3592
+ "C( CC",
3593
+ "c(F) c3)",
3594
+ "CC c2c(",
3595
+ "CC(C) =",
3596
+ "c2n c3c(",
3597
+ "n3 cc",
3598
+ "cn 3",
3599
+ "O c2ccc",
3600
+ "o 2)CC1",
3601
+ "2) c(",
3602
+ "c(S C",
3603
+ "3) CC2",
3604
+ "C 6",
3605
+ "=O) C1",
3606
+ "ccc3 C)",
3607
+ "Cc1n n(",
3608
+ "nc( S",
3609
+ "O= c1",
3610
+ "=O) N",
3611
+ "s c3",
3612
+ "CCCO C1",
3613
+ "nn1 C",
3614
+ "CCC(C) C(",
3615
+ "n(C) c1",
3616
+ "O=C1 N",
3617
+ "c1ccc( S",
3618
+ "ncn c1",
3619
+ "2) N1",
3620
+ "F) n1",
3621
+ "CC= CC=",
3622
+ "c[nH] 1)",
3623
+ "CCC3 (CC",
3624
+ "(N) =S)",
3625
+ "[NH3+] )N",
3626
+ "e ]",
3627
+ "(=O) [nH]",
3628
+ "c1n n",
3629
+ "ncc 3",
3630
+ "(C c2ccc",
3631
+ "OCC 3",
3632
+ "s c(",
3633
+ "CCC1 )",
3634
+ "7 )",
3635
+ "cccc 3",
3636
+ "c5ccc 6",
3637
+ "CCC N2",
3638
+ "3) ccc2",
3639
+ "c2 =O)cc1",
3640
+ "C= CC(",
3641
+ "(C)C (C",
3642
+ "n2 cc(",
3643
+ "2) C2",
3644
+ "n2 nn",
3645
+ "C= CCN",
3646
+ ")N1 CCC(",
3647
+ "ccc n1",
3648
+ "CC1 CC(",
3649
+ "3CCCCC 3",
3650
+ "C [Si](C)",
3651
+ "Cn1 cc(C",
3652
+ "n2 n",
3653
+ "nn c2",
3654
+ "3)C2 )cc1",
3655
+ "nc1 N",
3656
+ "CC1 CCC(",
3657
+ "(C(=O)N 1",
3658
+ "B (O)",
3659
+ "CO C(=O)N",
3660
+ "CC3 )cc2",
3661
+ "N =N",
3662
+ "c5 cccc",
3663
+ "c1cc( N2",
3664
+ "ccccc 5",
3665
+ "(CC (O)",
3666
+ "4) C3",
3667
+ "cn 2)CC1",
3668
+ "3 CCC",
3669
+ "CO C(",
3670
+ "c1c( N)",
3671
+ "H ]",
3672
+ "(=O) o",
3673
+ "c2cn cc",
3674
+ "c -2",
3675
+ "C [NH3+])",
3676
+ "ccccc 8",
3677
+ "3) C2)",
3678
+ "CCC2 C3",
3679
+ "(=O) [O-]",
3680
+ "F)cc1 Cl",
3681
+ "CCCCC O",
3682
+ ")cc 5",
3683
+ "CC1 CCCC1",
3684
+ "CO CCC",
3685
+ "3 C(=O)",
3686
+ "N c1ccc",
3687
+ "OCO 4)",
3688
+ "CCC l)",
3689
+ "C(C c1cn",
3690
+ "c3ccc s",
3691
+ "c2s c(",
3692
+ "(CC (C)",
3693
+ "CCO CC2",
3694
+ "O= c1[nH]",
3695
+ "O c2ccccc",
3696
+ "CCCN (C",
3697
+ "c1ccc(C (",
3698
+ "C(F) F)",
3699
+ "c1) OCCO2",
3700
+ "c(Cl)cc 2",
3701
+ "s c1C",
3702
+ ")cc1 2",
3703
+ "c6 ccc",
3704
+ "c2cs c(",
3705
+ "c2cc( N)",
3706
+ "c4 c3",
3707
+ "c3 cs",
3708
+ "(CC l)",
3709
+ "c(C) cc1",
3710
+ "N# C",
3711
+ "C(O) C1O",
3712
+ "c2cn n(C)",
3713
+ "c2) OCCO",
3714
+ "c2)OCO 3)",
3715
+ "C(C 2",
3716
+ "F)cc 4",
3717
+ "4 CC4)",
3718
+ "c5 cc",
3719
+ "c3c( c2",
3720
+ "Cn1 cc",
3721
+ "CC2 CCCO",
3722
+ "C(C)C) c1",
3723
+ "[C -]",
3724
+ "Cc1 c(C)",
3725
+ "(=O) c1cc",
3726
+ "c1ccc( O)",
3727
+ "c2cc o",
3728
+ "[O-]) c2",
3729
+ ")cc1 )",
3730
+ "CC1 CCC(C",
3731
+ "c(C) c3)",
3732
+ "cc 5",
3733
+ "(C) c2",
3734
+ "ccc2 n1",
3735
+ "OC(F) F",
3736
+ "CCCC (C)C",
3737
+ "C OC(C",
3738
+ "[O-]) c3)",
3739
+ "[N+] 1",
3740
+ "n2 C",
3741
+ "Cc1cc( N2",
3742
+ "C NC(=O)N",
3743
+ "C(C)C 2",
3744
+ "CCCO 1)",
3745
+ "Cc1ccc( O",
3746
+ "CC(C) c1n",
3747
+ "n(C) n1",
3748
+ "CO CCn1",
3749
+ "C(C) CC",
3750
+ "Br )cc2",
3751
+ "Cl )N",
3752
+ "c1s ccc1",
3753
+ "(C)cc 3",
3754
+ "CC( =N",
3755
+ "C2 CCCCC",
3756
+ "c4cccc 5",
3757
+ "C) C(=O)",
3758
+ "S C(=C",
3759
+ "(C 3",
3760
+ "c(=O) n3",
3761
+ "CC(C #N)",
3762
+ "c(F) c2",
3763
+ "C1 CCC(",
3764
+ "(C) CC2",
3765
+ "C= C2",
3766
+ "2) N",
3767
+ "cc1 Cl)",
3768
+ "c1cn c(",
3769
+ "c1C #N",
3770
+ "N1 CCCC1",
3771
+ "=O) CC1",
3772
+ "CC(C) (",
3773
+ "CCCCC =",
3774
+ "2)cc1 C",
3775
+ "( c2ccc(",
3776
+ "(CCCC )",
3777
+ "C(C 1",
3778
+ "4) C2)",
3779
+ "c1 c(Cl)",
3780
+ "c(C (C)C)",
3781
+ "n2 C)",
3782
+ "CC c2ccc(",
3783
+ "CO CCN",
3784
+ "OC c1ccc(",
3785
+ "c(=O) n(C",
3786
+ "N (C)C",
3787
+ "3)C1) C2",
3788
+ "C( Br)",
3789
+ "c1n c(N)",
3790
+ "Br) c2)",
3791
+ "c(C) c(",
3792
+ "ccc2 Cl)",
3793
+ "nc2 s",
3794
+ "[nH] 3)",
3795
+ "CS CCC(",
3796
+ "c8 ccccc8",
3797
+ "CCC (C)C)",
3798
+ "CCCCC )",
3799
+ "(C) c(",
3800
+ "CO 1",
3801
+ "= NC(=O)",
3802
+ "OC) C(=O)",
3803
+ "CCCC CC)",
3804
+ "O) C(O)",
3805
+ "c2c( c1)",
3806
+ "n2 nc(",
3807
+ "c( F)cc2",
3808
+ "C2 CC3",
3809
+ "3)n 2)",
3810
+ "# [N+]",
3811
+ "c(C)c1 C",
3812
+ "Cc1cc sc1",
3813
+ "O 1)",
3814
+ "C2 CCCCC2",
3815
+ "s 2)C1",
3816
+ "F)cc cc3",
3817
+ "c(=O) n",
3818
+ "C1 CC",
3819
+ "[NH3+] C(",
3820
+ "C2 =O)c1",
3821
+ "(Cl) s1",
3822
+ "[n+] 2",
3823
+ "[O-] )cc3",
3824
+ "CC( S",
3825
+ "Cc1ccc o1",
3826
+ ")N1 CCCC1",
3827
+ "cc2 C)",
3828
+ "C(C) O)",
3829
+ "C1 (C)",
3830
+ "(C NC(=O)",
3831
+ "C1 CC1)",
3832
+ "O 3)",
3833
+ "CO c1c(",
3834
+ "Cc1n c(N",
3835
+ "( c2ccc",
3836
+ "N1 CCN",
3837
+ "(CC [NH+]",
3838
+ "(C) (C)C",
3839
+ "c1n c(Cl)",
3840
+ "c2cc (O)",
3841
+ "cs 2)cc1",
3842
+ "c2n oc(",
3843
+ "c1cc( O)",
3844
+ "nc2 )cc1",
3845
+ "ccccc2 3)",
3846
+ "2)CC1 )",
3847
+ "N1 CCN(",
3848
+ "(=O)N C",
3849
+ "O=C(C =C",
3850
+ "OC) c(C",
3851
+ "OC(F) F)",
3852
+ "n c4)",
3853
+ "c1cn n(",
3854
+ "(C) cc1",
3855
+ "S 2",
3856
+ "c1n c(C2",
3857
+ "c(C) c2)",
3858
+ "C1 =C(",
3859
+ "C OC(C)",
3860
+ "c3ccc( O",
3861
+ "CCC(C 2",
3862
+ ")ccc1 2",
3863
+ "ncn 2",
3864
+ "c1cn cc",
3865
+ "c3) OCO4)",
3866
+ "N2 CCN",
3867
+ "CC1 CC",
3868
+ "CC(C) N",
3869
+ "s 2)n1",
3870
+ "(=O)N (C)",
3871
+ "Nc1n cn",
3872
+ "c2cn 3",
3873
+ "(Cl) c3",
3874
+ "CCC1 CCCC",
3875
+ "CC( =C",
3876
+ "O c1ccc(C",
3877
+ "CC(O) C1",
3878
+ "= CN",
3879
+ "(=O)N C2",
3880
+ "[O-]) C(",
3881
+ "CCC OC",
3882
+ "(C)C) C2",
3883
+ "2)cc1 Cl",
3884
+ ")N c1ccc(",
3885
+ "c1n [nH]",
3886
+ ")ccc1 N",
3887
+ "cc( S(=O)",
3888
+ "CCO CC",
3889
+ "cn 2)c1",
3890
+ "c4cc 5",
3891
+ "3CCO CC3",
3892
+ "(C) =O)",
3893
+ "n nc(",
3894
+ "c2c( c1",
3895
+ "CN2 C(=O)",
3896
+ "C1 CC2",
3897
+ "2) C(=O)N",
3898
+ "NC( N",
3899
+ "c1ncc s1",
3900
+ "c2 C1",
3901
+ "cccc c12)",
3902
+ "CCc1n c(",
3903
+ "nn nc1",
3904
+ "c2c1 C",
3905
+ "3)n 2)c1",
3906
+ "c5 c(",
3907
+ "=C(N) N",
3908
+ "2) C(",
3909
+ "cn 3)",
3910
+ "CCC #N)",
3911
+ "c1ccc( N)",
3912
+ "c1n cc(C",
3913
+ "c3ccc o",
3914
+ "C2 C3",
3915
+ "c2n o",
3916
+ "c2C) CC1",
3917
+ "c1n o",
3918
+ "c2 c(Cl)",
3919
+ "Br) C1",
3920
+ "= CC2",
3921
+ "(C) n1",
3922
+ "= CC",
3923
+ "CCn1 cn",
3924
+ "CCCCC (",
3925
+ "OCC1 OC(",
3926
+ "cs1 )",
3927
+ "[O-]) C2",
3928
+ "=C (Cl)",
3929
+ "= CC1",
3930
+ "o c(=O)",
3931
+ "O c1ccc",
3932
+ "C( =S)",
3933
+ "C= CCCC",
3934
+ "cc1 )",
3935
+ "c3 =O)",
3936
+ "[nH] 2)c1",
3937
+ "(C 1",
3938
+ "(C) c3)",
3939
+ "c(F) c1)",
3940
+ "C= CC(C",
3941
+ "ccc( N",
3942
+ "C(=O) OC1",
3943
+ "CC) c1",
3944
+ "C12 CC3",
3945
+ "CCC (C(C)",
3946
+ "Cc1n nc(",
3947
+ "[O-]) c1)",
3948
+ "O=C(N CC1",
3949
+ "CCS C1",
3950
+ "c2)cc1 OC",
3951
+ "s 4)",
3952
+ "c2cc nc(N",
3953
+ "C(=O) C3",
3954
+ "c6 )",
3955
+ "(C (N)=O)",
3956
+ "[NH3+]) C",
3957
+ "= N)",
3958
+ "(CC CCC",
3959
+ "3)C2 =O)",
3960
+ "2) n",
3961
+ "CC OC(C)",
3962
+ "C(N) =S",
3963
+ "ccc3 Cl)",
3964
+ "ccccc3 4)",
3965
+ "CCCC CCO",
3966
+ "CC(=O) OC",
3967
+ "Cn1 ccnc1",
3968
+ "3 CC[NH+]",
3969
+ "(=O) O",
3970
+ "C2 1",
3971
+ "O=C( CC",
3972
+ "CCCCC 3",
3973
+ "F)cc 2)n1",
3974
+ "c2cc ncc2",
3975
+ "C1 (C)C",
3976
+ "C(= C)",
3977
+ "c2 c(N",
3978
+ "n2 n1",
3979
+ "c1cn c(N",
3980
+ "OCC) c1",
3981
+ "c(C O",
3982
+ "c1cn 2",
3983
+ "CC1 CCCO1",
3984
+ "n c2)c1",
3985
+ "c1 [nH+]",
3986
+ "c( F)cc1",
3987
+ "CC(O) CO",
3988
+ "ccc2 F)",
3989
+ "c3cc4 c(",
3990
+ "CC OC3",
3991
+ "(C c2ccc(",
3992
+ "c(C O)",
3993
+ "c2n c(=O)",
3994
+ "cc1 F",
3995
+ "C(=O)N CC",
3996
+ "n2 nc(C)",
3997
+ ")ccc1 Br",
3998
+ "N1 CCC(",
3999
+ "c4) CC3)",
4000
+ "c2n c(N)",
4001
+ "CO CCN1",
4002
+ "F)ccc1 F",
4003
+ "=[N-] )",
4004
+ "C3 )cc1",
4005
+ "c1ccc(C O",
4006
+ "c(C) c2",
4007
+ "cc( O)",
4008
+ "Cl) n1",
4009
+ "3) c2)cc1",
4010
+ "n c5",
4011
+ "c2C) c1",
4012
+ "O=C( CC1",
4013
+ "C(C) S",
4014
+ "( CCO",
4015
+ "N C1=O",
4016
+ "c2n cc(C",
4017
+ "#N) c1",
4018
+ "C1 CCCN",
4019
+ "c2n (",
4020
+ "N1 CC",
4021
+ "C(O) =C(",
4022
+ "Cc1n n",
4023
+ "N c1",
4024
+ "S C(C)",
4025
+ "CCC1 (C",
4026
+ "OCO 2",
4027
+ "CCCC CC=",
4028
+ "CC# CC#",
4029
+ "c1cc( N)",
4030
+ "F)cc2 F)",
4031
+ "2 )cc(",
4032
+ "(C)C (C)C",
4033
+ "cs 2)",
4034
+ "c3cc( OC)",
4035
+ "CCC2 1",
4036
+ "c1 =O)",
4037
+ "(CC OC)",
4038
+ "c2 3)",
4039
+ "C(=O) OC)",
4040
+ "(C)C )cc2",
4041
+ "c2n c3",
4042
+ "O O",
4043
+ "c2nnn n2",
4044
+ "3 )cc2)",
4045
+ "= CCCC",
4046
+ "(Cl)c1 Cl",
4047
+ "N# Cc1",
4048
+ "c(C N",
4049
+ "c oc(",
4050
+ "(C(C) (C)",
4051
+ "OC c1ccc",
4052
+ "C= CC2",
4053
+ "%1 0",
4054
+ "O=C(N C",
4055
+ "NC(=O)N (",
4056
+ "c4 ncc",
4057
+ "=C( S",
4058
+ "CCO P(=O)",
4059
+ "Cc1cs c(",
4060
+ "CC(C) O1",
4061
+ ")cc (C)c1",
4062
+ "c1ccc( N(",
4063
+ "CS C1",
4064
+ "CC2) nc1",
4065
+ "c4 )cc3",
4066
+ "c1cc (=O)",
4067
+ "N= [N+]",
4068
+ "C( c1ccc(",
4069
+ ")cc 5)",
4070
+ "(C(=O) C3",
4071
+ "N1 CCOCC1",
4072
+ "C c2cccc",
4073
+ "CC2 (",
4074
+ "nn (C)",
4075
+ "n2 cc(C",
4076
+ "c1n n(",
4077
+ "CCC1 (CC)",
4078
+ "C( O",
4079
+ "n 3)n",
4080
+ "o 2)C1",
4081
+ "Cl) C(=O)",
4082
+ "n3 ccc",
4083
+ "(C) c2)",
4084
+ "c( Br)cc1",
4085
+ "n2 c(C)",
4086
+ "Br) s1",
4087
+ "CC(O) (C",
4088
+ "C1 CC(",
4089
+ "nc(C) n1",
4090
+ "c6 ccc(",
4091
+ "C(C) (C",
4092
+ "F)cc 2)C1",
4093
+ "F) C(=O)",
4094
+ ")N1 CC",
4095
+ "(Cl) (Cl)",
4096
+ "c1cccc( O",
4097
+ "=O) cc2",
4098
+ "CC )cc1",
4099
+ "4 C)",
4100
+ "CC2 (CC",
4101
+ "c1 co",
4102
+ "C1 CCC2",
4103
+ "c2c( N)",
4104
+ "c2ccs c2)",
4105
+ "(Cl)cc 4)",
4106
+ "C[NH2+] 1",
4107
+ "c[nH] 1",
4108
+ "CCC 5",
4109
+ "c(=O) c3",
4110
+ "c1cc( OC",
4111
+ "CCCC CCC1",
4112
+ "c4 c3)",
4113
+ "CCO 2",
4114
+ "[NH2+] 1)",
4115
+ "C[NH2+] C",
4116
+ "c1 c[nH+]",
4117
+ "Br)cc1 )",
4118
+ "CCCCC (C)",
4119
+ "ccc2 C)",
4120
+ "CCn1 c(",
4121
+ "(C(=O) CC",
4122
+ "CN( S(=O)",
4123
+ "c(F) c3",
4124
+ "CC2 CCC",
4125
+ "= CC(",
4126
+ "N2 CC",
4127
+ "=[NH+] O",
4128
+ "cc c12",
4129
+ "C2 C1",
4130
+ "Cc1n c(C)",
4131
+ "(C(=O) CS",
4132
+ "OC) n1",
4133
+ "2)c1 =O",
4134
+ "c%1 0",
4135
+ "CO CC(C)",
4136
+ "3) CC2)c1",
4137
+ "c(N C2",
4138
+ "c3ccc( O)",
4139
+ "NC(=O)N C",
4140
+ "- c2ccc(",
4141
+ "n2 nc(C",
4142
+ "c2cc (=O)",
4143
+ "CCC( N)",
4144
+ "CCn1 c(S",
4145
+ "ncn c3",
4146
+ "CCC l",
4147
+ "c1n nc(C",
4148
+ "( c4ccccc",
4149
+ "F c1ccc(",
4150
+ "c3cc( Br)",
4151
+ "= Cc1ccc(",
4152
+ "c2n c(Cl)",
4153
+ "CCS 1",
4154
+ "C OC2",
4155
+ "S CC(=O)",
4156
+ "c2 [nH+]",
4157
+ "C(C) (O)",
4158
+ "COc1cc( N",
4159
+ "n2cc nc2)",
4160
+ "(C)C) c(",
4161
+ "2)c1 )",
4162
+ "(F) c3",
4163
+ "(F)(F) F",
4164
+ "CCC 4)",
4165
+ "c(Cl) c2",
4166
+ "[nH] n1",
4167
+ "n2 c(C",
4168
+ "(C2 CC2)",
4169
+ "C= CCC1",
4170
+ "N= C1",
4171
+ "OC1 2",
4172
+ "C4 CCCCC",
4173
+ "(=O) C2",
4174
+ "CCCC 2)C1",
4175
+ "OC) CC1",
4176
+ "O=C(N CC",
4177
+ "nc2 c(",
4178
+ "S1(=O) =O",
4179
+ "N# CC1",
4180
+ "O c3ccc(",
4181
+ "C(=O) C(C",
4182
+ "C3 O)",
4183
+ "F)cc1 )N",
4184
+ "CC2 (O)",
4185
+ "c3cc 2",
4186
+ "cn n1C",
4187
+ "nn 2)cc1",
4188
+ "Cc1ccc s1",
4189
+ "2) no1",
4190
+ "CC2 C1",
4191
+ ") C2",
4192
+ "O CC[NH+]",
4193
+ "S C(",
4194
+ "3 CCN(",
4195
+ "CCCC 4)",
4196
+ "CCC(C #N)",
4197
+ "O CC(",
4198
+ "(C(=O) CO",
4199
+ "n(C) c1=O",
4200
+ "[S e]",
4201
+ "c4ccc( OC",
4202
+ "F)cc3 F)",
4203
+ "n (CC)",
4204
+ "[S-] )",
4205
+ "3) C(=O)",
4206
+ "N# Cc1cc(",
4207
+ "CCCN C(N)",
4208
+ "2) nn1",
4209
+ "c(Cl)cc (",
4210
+ "Cc1cc nc(",
4211
+ "C(C) N",
4212
+ "o c1C",
4213
+ "cc1 OC)",
4214
+ "4) CC3",
4215
+ "c3ncc cc3",
4216
+ "cn c3",
4217
+ "CCC1 (C)",
4218
+ "c1c( O)",
4219
+ "Cc1n n(C",
4220
+ "CCC(C 1)",
4221
+ "c3 c[nH]",
4222
+ "(Cl)cc 4",
4223
+ "CCO c1cc",
4224
+ "CC( Br)",
4225
+ "CN( CC1",
4226
+ "c4cc (F)",
4227
+ "Cc1 c(N",
4228
+ "Cc1cc (F)",
4229
+ "C(C) CC)",
4230
+ "c3 o",
4231
+ "c2 c[nH+]",
4232
+ "[O-]) N",
4233
+ "OC) c2)",
4234
+ "C2 CCC1",
4235
+ "3) nc2",
4236
+ "Cc1ccc( S",
4237
+ "=O) ccc1",
4238
+ ")cc cc2",
4239
+ "CCS CC1",
4240
+ "N (C)C)",
4241
+ "c3n ccc",
4242
+ ")cc3) C2",
4243
+ "OC)cc1 )",
4244
+ "3) n1",
4245
+ "CC1 =N",
4246
+ "CC(C 1",
4247
+ "n1 cc",
4248
+ "2 CCN(",
4249
+ "CC (CC)",
4250
+ "(N N)",
4251
+ "(C) CC2)",
4252
+ "F)cc1 F)",
4253
+ "Br)cc (C",
4254
+ "Cn1 c(",
4255
+ "2)cc1 F",
4256
+ "Cc1n c(C2",
4257
+ "c1cc nc(N",
4258
+ "OCC (C)C)",
4259
+ "(C)C )cc(",
4260
+ "cs c1",
4261
+ "3) c(",
4262
+ "S S",
4263
+ "c2 c3c(",
4264
+ "CCCC 2)n1",
4265
+ "C# CCO",
4266
+ "c1cc oc1",
4267
+ "C(O) C2",
4268
+ "4 CC[NH+]",
4269
+ "(C) CC3)",
4270
+ "CC1 CCCC(",
4271
+ "c1 [nH]c(",
4272
+ "(F)c1 F",
4273
+ ")N (",
4274
+ "c1n c(=O)",
4275
+ "c2c( O)",
4276
+ "c2n n(",
4277
+ "CCC(C) C1",
4278
+ "2 C(",
4279
+ "C 3)n",
4280
+ "cn n2",
4281
+ "2) C(C)",
4282
+ "CC4 )cc3",
4283
+ "n o2)",
4284
+ "C2 (",
4285
+ "[O-]) cn1",
4286
+ "=C (C)C",
4287
+ "n 6",
4288
+ "c1n oc(",
4289
+ "c2cn n(",
4290
+ ")N1 CCN",
4291
+ "N2 CCCC2",
4292
+ ")cc (=O)",
4293
+ "o 2)n1",
4294
+ "OC) c3)",
4295
+ "c5 cc(",
4296
+ "c1 O",
4297
+ "Cc1 c[nH]",
4298
+ "OCC (C)C",
4299
+ "2 CC[NH+]",
4300
+ "O= S1(=O)",
4301
+ "C(C) (",
4302
+ "[Si] (",
4303
+ "c2cc1 OC",
4304
+ "c(C) s1",
4305
+ "c[nH+] 1",
4306
+ ")cc 3)n",
4307
+ ") C",
4308
+ "c -3",
4309
+ "CC2 (C",
4310
+ ")ccc1 C",
4311
+ "N3 C(=O)",
4312
+ "N c1cn",
4313
+ "CC1 CN",
4314
+ "Br) c1)",
4315
+ "CC4 )cc3)",
4316
+ "OC) c2",
4317
+ "3 CCN",
4318
+ "C1 (O)",
4319
+ "nn (",
4320
+ "=O)cc 2)",
4321
+ "(C) CCO",
4322
+ "c4 nc(",
4323
+ "c(N S(=O)",
4324
+ "c2nn [n-]",
4325
+ "ccc 6",
4326
+ "c3n nc(",
4327
+ "c2cn c(N",
4328
+ "c o1",
4329
+ "4) C3)",
4330
+ "c(C [NH+]",
4331
+ "CCn1 cc",
4332
+ ")cc1 F",
4333
+ "c2n nc3",
4334
+ "OCC O)",
4335
+ "3) n2",
4336
+ "n2cc nc2",
4337
+ "c2cn (C)",
4338
+ "c1ncn 2",
4339
+ "C1 CCC1",
4340
+ "#N )cc2",
4341
+ "c2cc nn2",
4342
+ "N1 S(=O)",
4343
+ "[C H]",
4344
+ "C2 CC2)c1",
4345
+ "CCC12 C",
4346
+ "c6ccc 7",
4347
+ "n o1)",
4348
+ "C2 (C)",
4349
+ "Nc1n c(",
4350
+ "2 CC3",
4351
+ "c2 co",
4352
+ "c1cc s",
4353
+ "CC2) cn1",
4354
+ "c1cs c(C",
4355
+ "C(C) =",
4356
+ "CCCN (CCC",
4357
+ "c(N) n",
4358
+ "Cc1 c(Cl)",
4359
+ "o c2c1",
4360
+ "F)cc 3)n",
4361
+ "c1) C(",
4362
+ "CC1 CC2",
4363
+ "C2 CCN",
4364
+ "c(N) n1",
4365
+ ") C(C)C",
4366
+ "(CC =C)",
4367
+ "CO CC(",
4368
+ "C(O) C(",
4369
+ "c2ccc( I",
4370
+ "[O-]) c(N",
4371
+ "c1) CCC2",
4372
+ "c(OC) c3)",
4373
+ ") C(",
4374
+ "2) O1",
4375
+ "c2cn [nH]",
4376
+ "CCn1 cc(",
4377
+ "C= CCN1",
4378
+ "c1) OCCO",
4379
+ "C) CC3)",
4380
+ "CCC1 =O",
4381
+ "OCCO 4)",
4382
+ "CN c1n",
4383
+ "(CC 3",
4384
+ "s c(N",
4385
+ "2) cs1",
4386
+ "C1 (C(=O)",
4387
+ "CCC1 O",
4388
+ "(C)C )cc3",
4389
+ "c1cc nc(",
4390
+ "= N1",
4391
+ "Cn1 nccc1",
4392
+ "C1 =N",
4393
+ "O=C( OC",
4394
+ "c1c( Br)",
4395
+ "c4ccc( N",
4396
+ "c3ccc( n4",
4397
+ "c1s c(",
4398
+ "CO c1c(C)",
4399
+ "CS c1ccc(",
4400
+ "c1ccc2 n",
4401
+ "3)CC2) n1",
4402
+ "CC(O) C(",
4403
+ "c2ccc( N)",
4404
+ "c4cc ncc",
4405
+ "CCn1 cc(C",
4406
+ "C1 =C",
4407
+ "[O-]) s1",
4408
+ "(=O) CC1",
4409
+ "(CC N",
4410
+ "n3 C)",
4411
+ "c2cc n",
4412
+ "CC(C)C n1",
4413
+ "c1cc n",
4414
+ ")cc( N",
4415
+ "2 CCC",
4416
+ "ccc2 o1",
4417
+ "CO c1cn",
4418
+ "c(=O) c(",
4419
+ "Cc1n cc",
4420
+ "OC 5",
4421
+ "c4ccc o",
4422
+ "O c2ccc(C",
4423
+ "(=O)N CC",
4424
+ "CC3 CCC2",
4425
+ "CCO c1cc(",
4426
+ "Nc1n c(N",
4427
+ "o c2",
4428
+ "CC2) s1",
4429
+ "nn 2)",
4430
+ "[NH2+] C3",
4431
+ "cc2 c(",
4432
+ "Cn1 nc(",
4433
+ "cc2 F)",
4434
+ "n2cn c3",
4435
+ "CCCC N(",
4436
+ "c1)OCO 2)",
4437
+ "(C c2cccc",
4438
+ "C( c2ccc",
4439
+ "CC2)cc1 C",
4440
+ "c3ccc(C 4",
4441
+ "C2 CCCC2",
4442
+ "CC( OC)",
4443
+ "c(C =O)",
4444
+ "C(C#N) =C",
4445
+ "CC c1c(C)",
4446
+ "N c1cccc(",
4447
+ "- 4",
4448
+ "c2ccc( n3",
4449
+ "Br) c2",
4450
+ "CC1 CCCN",
4451
+ "c3cc s",
4452
+ "Br) CC1",
4453
+ ") [NH+]1",
4454
+ "[O-]) cn",
4455
+ "C#N )cc1",
4456
+ "c( I)",
4457
+ "F)cc2 1",
4458
+ "c(C#N) c1",
4459
+ "C[NH+] (",
4460
+ "[NH+]2 CC",
4461
+ "c3 )cc2",
4462
+ "c2cn c(",
4463
+ "cc (F)",
4464
+ "c2) cn1",
4465
+ "c(OC )cc2",
4466
+ "C2 (C)C",
4467
+ "N1 CCCCC1",
4468
+ "CC(C)C (C",
4469
+ "B (",
4470
+ "CC(=O)N C",
4471
+ "=S )N1",
4472
+ "cc cc2",
4473
+ "O) C1",
4474
+ "F)cc (F)",
4475
+ "c3 4)",
4476
+ "C= CCC",
4477
+ "(C) c(C",
4478
+ "N1 C",
4479
+ "c1ccc(N C",
4480
+ "(C(=O) C(",
4481
+ "Cc1cn 2",
4482
+ "c(C) cc2",
4483
+ "O) ccc1",
4484
+ "c(C) o1",
4485
+ ") c1ccc",
4486
+ "c1cn n2",
4487
+ "CC3 )cc2)",
4488
+ "C= C=",
4489
+ ")N (C)",
4490
+ "O P",
4491
+ "[NH+] (CC",
4492
+ "C1 CO",
4493
+ "C2 C",
4494
+ "CC #N)",
4495
+ "[O-]) c(C",
4496
+ "( =S)",
4497
+ "CCCC( N",
4498
+ "F C(F)",
4499
+ "Cc1n ccn1",
4500
+ "CC c2cc(",
4501
+ "3 c(",
4502
+ "ccc nc2",
4503
+ "CCC1 C",
4504
+ "S c2n",
4505
+ "C= C(C#N)",
4506
+ "CO c1n",
4507
+ "O) CC1",
4508
+ "[nH] c(C",
4509
+ ")N (C)C",
4510
+ "5) ccc4",
4511
+ "4)C2) C3)",
4512
+ ")cc1 Cl",
4513
+ "Br) c(",
4514
+ "(F) F",
4515
+ "cc1 F)",
4516
+ "4CCCCC 4)",
4517
+ "C =O)",
4518
+ "5 )cc4",
4519
+ "OC1 (C)C",
4520
+ "5) c4",
4521
+ "C c3ccccc",
4522
+ "CC c2ccc",
4523
+ "c1ccc(C #",
4524
+ ")cc1 C",
4525
+ "O=C( CCC"
4526
+ ]
4527
+ }
4528
+ }
tokenizer_config.json ADDED
@@ -0,0 +1,52 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "added_tokens_decoder": {
3
+ "0": {
4
+ "content": "<pad>",
5
+ "lstrip": false,
6
+ "normalized": false,
7
+ "rstrip": false,
8
+ "single_word": false,
9
+ "special": true
10
+ },
11
+ "1": {
12
+ "content": "<s>",
13
+ "lstrip": false,
14
+ "normalized": false,
15
+ "rstrip": false,
16
+ "single_word": false,
17
+ "special": true
18
+ },
19
+ "2": {
20
+ "content": "<unk>",
21
+ "lstrip": false,
22
+ "normalized": false,
23
+ "rstrip": false,
24
+ "single_word": false,
25
+ "special": true
26
+ },
27
+ "3": {
28
+ "content": "<mask>",
29
+ "lstrip": false,
30
+ "normalized": false,
31
+ "rstrip": false,
32
+ "single_word": false,
33
+ "special": true
34
+ },
35
+ "2260": {
36
+ "content": "</s>",
37
+ "lstrip": false,
38
+ "normalized": false,
39
+ "rstrip": false,
40
+ "single_word": false,
41
+ "special": true
42
+ }
43
+ },
44
+ "bos_token": "<s>",
45
+ "clean_up_tokenization_spaces": true,
46
+ "eos_token": "</s>",
47
+ "mask_token": "<mask>",
48
+ "model_max_length": 512,
49
+ "pad_token": "<pad>",
50
+ "tokenizer_class": "PreTrainedTokenizerFast",
51
+ "unk_token": "<unk>"
52
+ }