Shivanikumar commited on
Commit
57ece4f
·
verified ·
1 Parent(s): 540e541

Upload tokenizer

Browse files
Files changed (3) hide show
  1. special_tokens_map.json +24 -0
  2. tokenizer.json +2961 -0
  3. tokenizer_config.json +56 -0
special_tokens_map.json ADDED
@@ -0,0 +1,24 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "bos_token": {
3
+ "content": "<|startoftext|>",
4
+ "lstrip": false,
5
+ "normalized": false,
6
+ "rstrip": false,
7
+ "single_word": false
8
+ },
9
+ "eos_token": {
10
+ "content": "<|endoftext|>",
11
+ "lstrip": false,
12
+ "normalized": false,
13
+ "rstrip": false,
14
+ "single_word": false
15
+ },
16
+ "pad_token": "<|endoftext|>",
17
+ "unk_token": {
18
+ "content": "<unk>",
19
+ "lstrip": false,
20
+ "normalized": false,
21
+ "rstrip": false,
22
+ "single_word": false
23
+ }
24
+ }
tokenizer.json ADDED
@@ -0,0 +1,2961 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "version": "1.0",
3
+ "truncation": {
4
+ "direction": "Right",
5
+ "max_length": 256,
6
+ "strategy": "LongestFirst",
7
+ "stride": 0
8
+ },
9
+ "padding": null,
10
+ "added_tokens": [
11
+ {
12
+ "id": 0,
13
+ "content": "<|startoftext|>",
14
+ "single_word": false,
15
+ "lstrip": false,
16
+ "rstrip": false,
17
+ "normalized": false,
18
+ "special": true
19
+ },
20
+ {
21
+ "id": 1,
22
+ "content": "<pad>",
23
+ "single_word": false,
24
+ "lstrip": false,
25
+ "rstrip": false,
26
+ "normalized": false,
27
+ "special": true
28
+ },
29
+ {
30
+ "id": 2,
31
+ "content": "<|endoftext|>",
32
+ "single_word": false,
33
+ "lstrip": false,
34
+ "rstrip": false,
35
+ "normalized": false,
36
+ "special": true
37
+ },
38
+ {
39
+ "id": 3,
40
+ "content": "<unk>",
41
+ "single_word": false,
42
+ "lstrip": false,
43
+ "rstrip": false,
44
+ "normalized": false,
45
+ "special": true
46
+ },
47
+ {
48
+ "id": 4,
49
+ "content": "<mask>",
50
+ "single_word": false,
51
+ "lstrip": false,
52
+ "rstrip": false,
53
+ "normalized": false,
54
+ "special": true
55
+ }
56
+ ],
57
+ "normalizer": null,
58
+ "pre_tokenizer": {
59
+ "type": "ByteLevel",
60
+ "add_prefix_space": false,
61
+ "trim_offsets": true,
62
+ "use_regex": true
63
+ },
64
+ "post_processor": null,
65
+ "decoder": null,
66
+ "model": {
67
+ "type": "BPE",
68
+ "dropout": null,
69
+ "unk_token": "<unk>",
70
+ "continuing_subword_prefix": null,
71
+ "end_of_word_suffix": null,
72
+ "fuse_unk": false,
73
+ "byte_fallback": false,
74
+ "ignore_merges": false,
75
+ "vocab": {
76
+ "<|startoftext|>": 0,
77
+ "<pad>": 1,
78
+ "<|endoftext|>": 2,
79
+ "<unk>": 3,
80
+ "<mask>": 4,
81
+ "#": 5,
82
+ "(": 6,
83
+ ")": 7,
84
+ "-": 8,
85
+ "1": 9,
86
+ "2": 10,
87
+ "3": 11,
88
+ "4": 12,
89
+ "5": 13,
90
+ "6": 14,
91
+ "=": 15,
92
+ "B": 16,
93
+ "C": 17,
94
+ "F": 18,
95
+ "H": 19,
96
+ "N": 20,
97
+ "O": 21,
98
+ "S": 22,
99
+ "[": 23,
100
+ "]": 24,
101
+ "c": 25,
102
+ "l": 26,
103
+ "n": 27,
104
+ "o": 28,
105
+ "r": 29,
106
+ "s": 30,
107
+ "cc": 31,
108
+ "CC": 32,
109
+ "(=": 33,
110
+ "ccc": 34,
111
+ "NC": 35,
112
+ "nc": 36,
113
+ "Cc": 37,
114
+ "ccccc": 38,
115
+ "CO": 39,
116
+ "CCC": 40,
117
+ "cccc": 41,
118
+ "Nc": 42,
119
+ "(-": 43,
120
+ "CCO": 44,
121
+ "COc": 45,
122
+ "CCN": 46,
123
+ "nn": 47,
124
+ "Cl": 48,
125
+ "nH": 49,
126
+ "CN": 50,
127
+ "OC": 51,
128
+ "CCCC": 52,
129
+ ")(=": 53,
130
+ ")(": 54,
131
+ "CCc": 55,
132
+ "ncc": 56,
133
+ "NCc": 57,
134
+ "sc": 58,
135
+ "nnc": 59,
136
+ ")=": 60,
137
+ "CNC": 61,
138
+ "NCC": 62,
139
+ "Cn": 63,
140
+ "oc": 64,
141
+ "COC": 65,
142
+ "cn": 66,
143
+ "Sc": 67,
144
+ "cnc": 68,
145
+ "12": 69,
146
+ "21": 70,
147
+ "CCOC": 71,
148
+ "CCCCC": 72,
149
+ "cccnc": 73,
150
+ "CCOCC": 74,
151
+ "OCC": 75,
152
+ "Br": 76,
153
+ "cnn": 77,
154
+ "CCCN": 78,
155
+ "CCNC": 79,
156
+ "CCOc": 80,
157
+ ")[": 81,
158
+ "ccccn": 82,
159
+ "noc": 83,
160
+ "CSc": 84,
161
+ "CCn": 85,
162
+ "Oc": 86,
163
+ "cccs": 87,
164
+ "ccco": 88,
165
+ "no": 89,
166
+ "ccncc": 90,
167
+ "csc": 91,
168
+ "cccn": 92,
169
+ "ccnc": 93,
170
+ "NCCc": 94,
171
+ "CNc": 95,
172
+ "cs": 96,
173
+ "23": 97,
174
+ "CCCO": 98,
175
+ "32": 99,
176
+ "ccsc": 100,
177
+ "ccn": 101,
178
+ "OCc": 102,
179
+ "NS": 103,
180
+ "ncnc": 104,
181
+ "CCCc": 105,
182
+ "CCCNC": 106,
183
+ "ncccc": 107,
184
+ "SCC": 108,
185
+ "nccn": 109,
186
+ "COCC": 110,
187
+ "on": 111,
188
+ "CS": 112,
189
+ "OCO": 113,
190
+ "OCCO": 114,
191
+ "CCCn": 115,
192
+ "ncn": 116,
193
+ "cncn": 117,
194
+ "NCCC": 118,
195
+ "ccoc": 119,
196
+ "nccc": 120,
197
+ "nccs": 121,
198
+ "COCc": 122,
199
+ "ncccn": 123,
200
+ "CCCCCC": 124,
201
+ "sccc": 125,
202
+ "CCCCN": 126,
203
+ "cnccn": 127,
204
+ "nnnn": 128,
205
+ "SC": 129,
206
+ "COCCN": 130,
207
+ "CNS": 131,
208
+ "ccnn": 132,
209
+ "cnnc": 133,
210
+ "cnnn": 134,
211
+ "CCSc": 135,
212
+ "SCc": 136,
213
+ "NCCNC": 137,
214
+ "nnn": 138,
215
+ "CCS": 139,
216
+ "NCCN": 140,
217
+ "nncn": 141,
218
+ "cncc": 142,
219
+ "CCCOc": 143,
220
+ "CCNc": 144,
221
+ "nnnc": 145,
222
+ "onc": 146,
223
+ "Fc": 147,
224
+ "coc": 148,
225
+ "COCCNC": 149,
226
+ "occc": 150,
227
+ "CCCCNC": 151,
228
+ "cscn": 152,
229
+ "CCNS": 153,
230
+ "CCCS": 154,
231
+ "NCCOc": 155,
232
+ "OCCN": 156,
233
+ "scnc": 157,
234
+ "OCCC": 158,
235
+ "NCCO": 159,
236
+ "34": 160,
237
+ "CCCOC": 161,
238
+ "nccnc": 162,
239
+ "NCCCn": 163,
240
+ "nncs": 164,
241
+ "ncnn": 165,
242
+ "NCCCN": 166,
243
+ "OCCCO": 167,
244
+ "NCCn": 168,
245
+ "COCCn": 169,
246
+ "cnccc": 170,
247
+ "43": 171,
248
+ "nonc": 172,
249
+ "CCCCn": 173,
250
+ "NCCCc": 174,
251
+ "CNCc": 175,
252
+ "COCCOc": 176,
253
+ "ncsc": 177,
254
+ "COCCCNC": 178,
255
+ "Nn": 179,
256
+ "FC": 180,
257
+ "Clc": 181,
258
+ "OCCc": 182,
259
+ "NCCCO": 183,
260
+ "COCCC": 184,
261
+ "CCCCO": 185,
262
+ "NCCNc": 186,
263
+ "CCCCc": 187,
264
+ "nsc": 188,
265
+ "ccon": 189,
266
+ "OCCn": 190,
267
+ "cncnc": 191,
268
+ "cccnn": 192,
269
+ "OCCNC": 193,
270
+ "CCCNc": 194,
271
+ "([": 195,
272
+ "CNCC": 196,
273
+ "scc": 197,
274
+ "nnsc": 198,
275
+ "csnn": 199,
276
+ "NCCCC": 200,
277
+ "OCCCC": 201,
278
+ "CCNCC": 202,
279
+ "SCCC": 203,
280
+ "CCCCS": 204,
281
+ "OS": 205,
282
+ "sccn": 206,
283
+ "Brc": 207,
284
+ "CCOCc": 208,
285
+ "13": 209,
286
+ "CCCSc": 210,
287
+ "NCCS": 211,
288
+ "NCCOC": 212,
289
+ "CCCCOc": 213,
290
+ "ccno": 214,
291
+ "nncc": 215,
292
+ "CCCNS": 216,
293
+ "NCCCNC": 217,
294
+ "co": 218,
295
+ "NCCCOC": 219,
296
+ "snc": 220,
297
+ "COCCO": 221,
298
+ "COCCc": 222,
299
+ "CCCCCNC": 223,
300
+ "CCOCCCNC": 224,
301
+ "ncoc": 225,
302
+ "COCCNc": 226,
303
+ "NCCSc": 227,
304
+ "ns": 228,
305
+ "nnco": 229,
306
+ "CNCCc": 230,
307
+ "CCOCCNC": 231,
308
+ "ncco": 232,
309
+ "snnc": 233,
310
+ "COCCOC": 234,
311
+ "ccnnc": 235,
312
+ "COCCCN": 236,
313
+ "COCCCC": 237,
314
+ "CCCCCN": 238,
315
+ "cnns": 239,
316
+ "OCCNc": 240,
317
+ "OCCCNC": 241,
318
+ "ocnc": 242,
319
+ "CCNCc": 243,
320
+ "NCCNS": 244,
321
+ "CCOCCC": 245,
322
+ "ccncn": 246,
323
+ "cncs": 247,
324
+ "OCCCn": 248,
325
+ "OCCOc": 249,
326
+ "OCCOC": 250,
327
+ "ncncc": 251,
328
+ "CCCCOC": 252,
329
+ "OCCSc": 253,
330
+ "OCCCc": 254,
331
+ "CCCOCC": 255,
332
+ "COCCCn": 256,
333
+ "NCCCOCC": 257,
334
+ "COCCOCC": 258,
335
+ ")-": 259,
336
+ "31": 260,
337
+ "NCCCOc": 261,
338
+ "NCCOCC": 262,
339
+ "OCOC": 263,
340
+ "occ": 264,
341
+ "NCCCCC": 265,
342
+ "oncc": 266,
343
+ "cnco": 267,
344
+ "cnoc": 268,
345
+ "CCCCNc": 269,
346
+ "NCCCSc": 270,
347
+ "NCCCNc": 271,
348
+ "SCCOc": 272,
349
+ "SCCn": 273,
350
+ "OCCCN": 274,
351
+ "COCCNS": 275,
352
+ "ncon": 276,
353
+ "CCCCSc": 277,
354
+ "COCCCNc": 278,
355
+ "NCCCCN": 279,
356
+ "COCCS": 280,
357
+ "SCCCC": 281,
358
+ "SCCNC": 282,
359
+ "SCCc": 283,
360
+ "nscc": 284,
361
+ "CCCCCCNC": 285,
362
+ "CCOCCCC": 286,
363
+ "SCCN": 287,
364
+ "OCn": 288,
365
+ "COn": 289,
366
+ "CCCCCn": 290,
367
+ "CCOCCn": 291,
368
+ "SCCO": 292,
369
+ "24": 293,
370
+ "NCCOCCO": 294,
371
+ "CCCCNS": 295,
372
+ "CCOCCOc": 296,
373
+ "CCOCCOC": 297,
374
+ "CCCCCc": 298,
375
+ "COCCSc": 299,
376
+ "NCCCS": 300,
377
+ "CCOCCN": 301,
378
+ "cscc": 302,
379
+ "NCCCNS": 303,
380
+ "OCCCOc": 304,
381
+ "sn": 305,
382
+ "OCCNS": 306,
383
+ "CCCOCc": 307,
384
+ "SCCCO": 308,
385
+ "OCCCNc": 309,
386
+ "COCCCNS": 310,
387
+ "COCCCOc": 311,
388
+ "CNn": 312,
389
+ "SCCCc": 313,
390
+ "nocc": 314,
391
+ "CCOCCSc": 315,
392
+ "CNCCN": 316,
393
+ "cnsc": 317,
394
+ "OCCCSc": 318,
395
+ "COCCOCCNC": 319,
396
+ "SCCOC": 320,
397
+ "cocn": 321,
398
+ "CCCCCS": 322,
399
+ "COCCCOC": 323,
400
+ "OCCS": 324,
401
+ "CCOCCOCC": 325,
402
+ "NCCCCCC": 326,
403
+ "CCCCCOc": 327,
404
+ "COCn": 328,
405
+ "NCN": 329,
406
+ "NCCOCc": 330,
407
+ "OCCOCC": 331,
408
+ "COCCCCNC": 332,
409
+ "CCOCCCNc": 333,
410
+ "NCCCOCc": 334,
411
+ "COCCCCC": 335,
412
+ "NCCCCc": 336,
413
+ "NCCCCn": 337,
414
+ "SCCS": 338,
415
+ "OCCCCC": 339,
416
+ "CCOCCCn": 340,
417
+ "45": 341,
418
+ "SCCCOc": 342,
419
+ "csnc": 343,
420
+ "conc": 344,
421
+ "CCCCOCC": 345,
422
+ "CCOCCCNS": 346,
423
+ "SCCNc": 347,
424
+ "COCCCCCNC": 348,
425
+ "NCCCCO": 349,
426
+ "SCCCNC": 350,
427
+ "CCCCCNS": 351,
428
+ "CCOCCS": 352,
429
+ "sncc": 353,
430
+ "CCCCCCC": 354,
431
+ "NCCCCCO": 355,
432
+ "SCCNS": 356,
433
+ "OCCCNS": 357,
434
+ "CCCCCCn": 358,
435
+ "OCCCCn": 359,
436
+ "SCCCN": 360,
437
+ "CCOCCO": 361,
438
+ "54": 362,
439
+ "CCOCCc": 363,
440
+ "COCO": 364,
441
+ "COCCCS": 365,
442
+ "cnncc": 366,
443
+ "NCCCCOc": 367,
444
+ "OCN": 368,
445
+ "CCCCOCCNC": 369,
446
+ "42": 370,
447
+ "SCn": 371,
448
+ "SCCCCC": 372,
449
+ "CCOCCNS": 373,
450
+ "CCCCCOC": 374,
451
+ "CCCCCSc": 375,
452
+ "OCCCCN": 376,
453
+ "COCCCSc": 377,
454
+ "COCCCCNc": 378,
455
+ "CCOCO": 379,
456
+ "CCOCS": 380,
457
+ "SCCCn": 381,
458
+ "COCN": 382,
459
+ "cccccc": 383,
460
+ "COCCNCc": 384,
461
+ "NCCCCNC": 385,
462
+ "CCCCCNc": 386,
463
+ "CCCCCCN": 387,
464
+ "COCCCc": 388,
465
+ "COCCOCc": 389,
466
+ "COCCCCSc": 390,
467
+ "CCCCCO": 391,
468
+ "SCCCNS": 392,
469
+ "CNCCC": 393,
470
+ "COCCCCn": 394,
471
+ "OCCCCNC": 395,
472
+ "SCCOCC": 396,
473
+ "CCOCCCN": 397,
474
+ "CCCOCCC": 398,
475
+ "OCCOCc": 399,
476
+ "COCCOCCC": 400,
477
+ "OCCOCCNc": 401,
478
+ "SCCCCCO": 402,
479
+ "On": 403,
480
+ "COCCCCS": 404,
481
+ "CCOCCCCNC": 405,
482
+ "OCCCCOc": 406,
483
+ "CCCOCCNC": 407,
484
+ "SCCSc": 408,
485
+ "CCOCCNc": 409,
486
+ "OCCCS": 410,
487
+ "OCCOCCN": 411,
488
+ "COCCCOCC": 412,
489
+ "CCCCOCCN": 413,
490
+ "ClC": 414,
491
+ "scnn": 415,
492
+ "OCCNCc": 416,
493
+ "CCCOCCc": 417,
494
+ "COCCCCN": 418,
495
+ "(#": 419,
496
+ "OCCCOCC": 420,
497
+ "COCCOCCCC": 421,
498
+ "SCN": 422,
499
+ "CCCCCOCC": 423,
500
+ "NN": 424,
501
+ "CCOn": 425,
502
+ "CCOCCOCc": 426,
503
+ "CCCCCCOc": 427,
504
+ "CCCCCCSc": 428,
505
+ "NCCCCCc": 429,
506
+ "OCCCOC": 430,
507
+ "ncno": 431,
508
+ "OCCCCSc": 432,
509
+ "OCOc": 433,
510
+ "SCCCOCC": 434,
511
+ "CCCCCCNS": 435,
512
+ "nccnn": 436,
513
+ "NCCOCCC": 437,
514
+ "CCCNCc": 438,
515
+ "CCCCCCNc": 439,
516
+ "OCCOCCn": 440,
517
+ "SCCCS": 441,
518
+ "nnccc": 442,
519
+ "COCCCCNS": 443,
520
+ "NCCCCNc": 444,
521
+ "SCCCSc": 445,
522
+ "SCCOCCO": 446,
523
+ "OCCCCCNc": 447,
524
+ "OCCCCCSc": 448,
525
+ "CCCCOCCn": 449,
526
+ "NCCCCOC": 450,
527
+ "ccns": 451,
528
+ "COCCCCOc": 452,
529
+ "CCCCOCc": 453,
530
+ "35": 454,
531
+ "][": 455,
532
+ "CNCCCN": 456,
533
+ "CNCCOc": 457,
534
+ "CCCCCCO": 458,
535
+ "SCCCCO": 459,
536
+ "OCCOn": 460,
537
+ "COCCNCC": 461,
538
+ "NCCNCC": 462,
539
+ "CCCCOCCS": 463,
540
+ "SCSc": 464,
541
+ "SCCCNc": 465,
542
+ "COCSc": 466,
543
+ "CCCOCCO": 467,
544
+ "OCCNCC": 468,
545
+ "COS": 469,
546
+ "NCCNCc": 470,
547
+ "OCCCCO": 471,
548
+ "CCCOCCOC": 472,
549
+ "OCCCCNc": 473,
550
+ "OCCCCc": 474,
551
+ "COCCCCc": 475,
552
+ "COCCCOCc": 476,
553
+ "CCCCCCc": 477,
554
+ "cnon": 478,
555
+ "ncncn": 479,
556
+ "nnns": 480,
557
+ "COCNC": 481,
558
+ "COCCCCCC": 482,
559
+ "CNCCNC": 483,
560
+ "CNCCSc": 484,
561
+ "CCCOCCN": 485,
562
+ "NNC": 486,
563
+ "OCCOCCSc": 487,
564
+ "SCNC": 488,
565
+ "ccs": 489,
566
+ "CCCCOCCSc": 490,
567
+ "COCCCCCn": 491,
568
+ "CCOCn": 492,
569
+ "CCOCCCOC": 493,
570
+ "BrC": 494,
571
+ "CCCOn": 495,
572
+ "CCCOCCOc": 496,
573
+ "OCCOCCO": 497,
574
+ "OCCOCn": 498,
575
+ "CCCCOCCCNC": 499,
576
+ "OCCCCCO": 500,
577
+ "cocc": 501,
578
+ "14": 502,
579
+ "OCCCCCn": 503,
580
+ "SN": 504,
581
+ "SCOC": 505,
582
+ "SCCCCSc": 506,
583
+ "NCS": 507,
584
+ "CNCCn": 508,
585
+ "CNCCCn": 509,
586
+ "CNCCS": 510,
587
+ "CCCCCOCc": 511,
588
+ "NCCCCCN": 512,
589
+ "cncnn": 513,
590
+ "COCCCCOC": 514,
591
+ "CBr": 515,
592
+ "NCO": 516,
593
+ "NCn": 517,
594
+ "COCOC": 518,
595
+ "CNCCO": 519,
596
+ "OCNC": 520,
597
+ "OCSc": 521,
598
+ "CCCCCCOC": 522,
599
+ "CCCOCCCC": 523,
600
+ "SCCNCc": 524,
601
+ "COCCOCn": 525,
602
+ "NSc": 526,
603
+ "nsnc": 527,
604
+ "NCSc": 528,
605
+ "NCOc": 529,
606
+ "NCNS": 530,
607
+ "ncnnc": 531,
608
+ "COCCCCCS": 532,
609
+ "COCS": 533,
610
+ "CCCOCCn": 534,
611
+ "CCCOCCNc": 535,
612
+ "SCCCCN": 536,
613
+ "SCCCOC": 537,
614
+ "SCCCCOc": 538,
615
+ "NCCCOCCc": 539,
616
+ "CCCCOCCC": 540,
617
+ "CCCCOCCOc": 541,
618
+ "COCCOCCCNC": 542,
619
+ "COCCOCCNc": 543,
620
+ "CCCCOCCNc": 544,
621
+ "41": 545,
622
+ "53": 546,
623
+ "OCl": 547,
624
+ "SCCOCCC": 548,
625
+ "CCl": 549,
626
+ "NCOC": 550,
627
+ "COCCCCCN": 551,
628
+ "CCOS": 552,
629
+ "CCNCCN": 553,
630
+ "CNCCOC": 554,
631
+ "CNCCCOc": 555,
632
+ "CNCCOCCO": 556,
633
+ "nncnc": 557,
634
+ "CCOCCNCc": 558,
635
+ "OCCOCCOc": 559,
636
+ "OCCCOn": 560,
637
+ "NCCCCSc": 561,
638
+ "NCCCCCCNC": 562,
639
+ "OCCCCS": 563,
640
+ "COCCOn": 564,
641
+ "COCCOCCOc": 565,
642
+ "COCCOCCn": 566,
643
+ "NCCOCCn": 567,
644
+ "65": 568,
645
+ "OCCCCCCSc": 569,
646
+ "OCCCCCCNc": 570,
647
+ "Sn": 571,
648
+ "SNC": 572,
649
+ "SCO": 573,
650
+ "SCS": 574,
651
+ "SCCNCC": 575,
652
+ "NCNc": 576,
653
+ "CCNCCSc": 577,
654
+ "CNCCCOC": 578,
655
+ "OCS": 579,
656
+ "NCCCCOCc": 580,
657
+ "COCNc": 581,
658
+ "COCNCc": 582,
659
+ "CCOCCCc": 583,
660
+ "CCOCCCOc": 584,
661
+ "OCCCOCc": 585,
662
+ "OCCCCNS": 586,
663
+ "CCCOCCCn": 587,
664
+ "NSNC": 588,
665
+ "OCCOCCc": 589,
666
+ "CCCCCCS": 590,
667
+ "NCCOCCCC": 591,
668
+ "NCCOCCOC": 592,
669
+ "CCCCOCCCN": 593,
670
+ "NCCCCCOC": 594,
671
+ "OCCCCCc": 595,
672
+ "COCCOCCN": 596,
673
+ "COCCCCCc": 597,
674
+ "COCCOCCS": 598,
675
+ "COCCOCCSc": 599,
676
+ "SCCOCCN": 600
677
+ },
678
+ "merges": [
679
+ [
680
+ "c",
681
+ "c"
682
+ ],
683
+ [
684
+ "C",
685
+ "C"
686
+ ],
687
+ [
688
+ "(",
689
+ "="
690
+ ],
691
+ [
692
+ "cc",
693
+ "c"
694
+ ],
695
+ [
696
+ "N",
697
+ "C"
698
+ ],
699
+ [
700
+ "n",
701
+ "c"
702
+ ],
703
+ [
704
+ "C",
705
+ "c"
706
+ ],
707
+ [
708
+ "cc",
709
+ "ccc"
710
+ ],
711
+ [
712
+ "C",
713
+ "O"
714
+ ],
715
+ [
716
+ "CC",
717
+ "C"
718
+ ],
719
+ [
720
+ "cc",
721
+ "cc"
722
+ ],
723
+ [
724
+ "N",
725
+ "c"
726
+ ],
727
+ [
728
+ "(",
729
+ "-"
730
+ ],
731
+ [
732
+ "CC",
733
+ "O"
734
+ ],
735
+ [
736
+ "CO",
737
+ "c"
738
+ ],
739
+ [
740
+ "CC",
741
+ "N"
742
+ ],
743
+ [
744
+ "n",
745
+ "n"
746
+ ],
747
+ [
748
+ "C",
749
+ "l"
750
+ ],
751
+ [
752
+ "n",
753
+ "H"
754
+ ],
755
+ [
756
+ "C",
757
+ "N"
758
+ ],
759
+ [
760
+ "O",
761
+ "C"
762
+ ],
763
+ [
764
+ "CC",
765
+ "CC"
766
+ ],
767
+ [
768
+ ")",
769
+ "(="
770
+ ],
771
+ [
772
+ ")",
773
+ "("
774
+ ],
775
+ [
776
+ "CC",
777
+ "c"
778
+ ],
779
+ [
780
+ "n",
781
+ "cc"
782
+ ],
783
+ [
784
+ "NC",
785
+ "c"
786
+ ],
787
+ [
788
+ "s",
789
+ "c"
790
+ ],
791
+ [
792
+ "n",
793
+ "nc"
794
+ ],
795
+ [
796
+ ")",
797
+ "="
798
+ ],
799
+ [
800
+ "C",
801
+ "NC"
802
+ ],
803
+ [
804
+ "N",
805
+ "CC"
806
+ ],
807
+ [
808
+ "C",
809
+ "n"
810
+ ],
811
+ [
812
+ "o",
813
+ "c"
814
+ ],
815
+ [
816
+ "CO",
817
+ "C"
818
+ ],
819
+ [
820
+ "c",
821
+ "n"
822
+ ],
823
+ [
824
+ "S",
825
+ "c"
826
+ ],
827
+ [
828
+ "c",
829
+ "nc"
830
+ ],
831
+ [
832
+ "1",
833
+ "2"
834
+ ],
835
+ [
836
+ "2",
837
+ "1"
838
+ ],
839
+ [
840
+ "CCO",
841
+ "C"
842
+ ],
843
+ [
844
+ "CC",
845
+ "CCC"
846
+ ],
847
+ [
848
+ "ccc",
849
+ "nc"
850
+ ],
851
+ [
852
+ "CCO",
853
+ "CC"
854
+ ],
855
+ [
856
+ "O",
857
+ "CC"
858
+ ],
859
+ [
860
+ "B",
861
+ "r"
862
+ ],
863
+ [
864
+ "c",
865
+ "nn"
866
+ ],
867
+ [
868
+ "CCC",
869
+ "N"
870
+ ],
871
+ [
872
+ "CC",
873
+ "NC"
874
+ ],
875
+ [
876
+ "CCO",
877
+ "c"
878
+ ],
879
+ [
880
+ ")",
881
+ "["
882
+ ],
883
+ [
884
+ "cccc",
885
+ "n"
886
+ ],
887
+ [
888
+ "n",
889
+ "oc"
890
+ ],
891
+ [
892
+ "C",
893
+ "Sc"
894
+ ],
895
+ [
896
+ "CC",
897
+ "n"
898
+ ],
899
+ [
900
+ "O",
901
+ "c"
902
+ ],
903
+ [
904
+ "ccc",
905
+ "s"
906
+ ],
907
+ [
908
+ "ccc",
909
+ "o"
910
+ ],
911
+ [
912
+ "n",
913
+ "o"
914
+ ],
915
+ [
916
+ "cc",
917
+ "ncc"
918
+ ],
919
+ [
920
+ "c",
921
+ "sc"
922
+ ],
923
+ [
924
+ "ccc",
925
+ "n"
926
+ ],
927
+ [
928
+ "cc",
929
+ "nc"
930
+ ],
931
+ [
932
+ "N",
933
+ "CCc"
934
+ ],
935
+ [
936
+ "C",
937
+ "Nc"
938
+ ],
939
+ [
940
+ "c",
941
+ "s"
942
+ ],
943
+ [
944
+ "2",
945
+ "3"
946
+ ],
947
+ [
948
+ "CC",
949
+ "CO"
950
+ ],
951
+ [
952
+ "3",
953
+ "2"
954
+ ],
955
+ [
956
+ "cc",
957
+ "sc"
958
+ ],
959
+ [
960
+ "cc",
961
+ "n"
962
+ ],
963
+ [
964
+ "O",
965
+ "Cc"
966
+ ],
967
+ [
968
+ "N",
969
+ "S"
970
+ ],
971
+ [
972
+ "nc",
973
+ "nc"
974
+ ],
975
+ [
976
+ "CC",
977
+ "Cc"
978
+ ],
979
+ [
980
+ "CCC",
981
+ "NC"
982
+ ],
983
+ [
984
+ "n",
985
+ "cccc"
986
+ ],
987
+ [
988
+ "S",
989
+ "CC"
990
+ ],
991
+ [
992
+ "ncc",
993
+ "n"
994
+ ],
995
+ [
996
+ "CO",
997
+ "CC"
998
+ ],
999
+ [
1000
+ "o",
1001
+ "n"
1002
+ ],
1003
+ [
1004
+ "C",
1005
+ "S"
1006
+ ],
1007
+ [
1008
+ "O",
1009
+ "CO"
1010
+ ],
1011
+ [
1012
+ "O",
1013
+ "CCO"
1014
+ ],
1015
+ [
1016
+ "CCC",
1017
+ "n"
1018
+ ],
1019
+ [
1020
+ "nc",
1021
+ "n"
1022
+ ],
1023
+ [
1024
+ "cnc",
1025
+ "n"
1026
+ ],
1027
+ [
1028
+ "N",
1029
+ "CCC"
1030
+ ],
1031
+ [
1032
+ "cc",
1033
+ "oc"
1034
+ ],
1035
+ [
1036
+ "n",
1037
+ "ccc"
1038
+ ],
1039
+ [
1040
+ "ncc",
1041
+ "s"
1042
+ ],
1043
+ [
1044
+ "CO",
1045
+ "Cc"
1046
+ ],
1047
+ [
1048
+ "n",
1049
+ "cccn"
1050
+ ],
1051
+ [
1052
+ "CCCC",
1053
+ "CC"
1054
+ ],
1055
+ [
1056
+ "s",
1057
+ "ccc"
1058
+ ],
1059
+ [
1060
+ "CC",
1061
+ "CCN"
1062
+ ],
1063
+ [
1064
+ "c",
1065
+ "nccn"
1066
+ ],
1067
+ [
1068
+ "nn",
1069
+ "nn"
1070
+ ],
1071
+ [
1072
+ "S",
1073
+ "C"
1074
+ ],
1075
+ [
1076
+ "CO",
1077
+ "CCN"
1078
+ ],
1079
+ [
1080
+ "CN",
1081
+ "S"
1082
+ ],
1083
+ [
1084
+ "cc",
1085
+ "nn"
1086
+ ],
1087
+ [
1088
+ "c",
1089
+ "nnc"
1090
+ ],
1091
+ [
1092
+ "cnn",
1093
+ "n"
1094
+ ],
1095
+ [
1096
+ "CC",
1097
+ "Sc"
1098
+ ],
1099
+ [
1100
+ "S",
1101
+ "Cc"
1102
+ ],
1103
+ [
1104
+ "NCC",
1105
+ "NC"
1106
+ ],
1107
+ [
1108
+ "nn",
1109
+ "n"
1110
+ ],
1111
+ [
1112
+ "CC",
1113
+ "S"
1114
+ ],
1115
+ [
1116
+ "N",
1117
+ "CCN"
1118
+ ],
1119
+ [
1120
+ "nnc",
1121
+ "n"
1122
+ ],
1123
+ [
1124
+ "c",
1125
+ "ncc"
1126
+ ],
1127
+ [
1128
+ "CC",
1129
+ "COc"
1130
+ ],
1131
+ [
1132
+ "CC",
1133
+ "Nc"
1134
+ ],
1135
+ [
1136
+ "nn",
1137
+ "nc"
1138
+ ],
1139
+ [
1140
+ "o",
1141
+ "nc"
1142
+ ],
1143
+ [
1144
+ "F",
1145
+ "c"
1146
+ ],
1147
+ [
1148
+ "c",
1149
+ "oc"
1150
+ ],
1151
+ [
1152
+ "CO",
1153
+ "CCNC"
1154
+ ],
1155
+ [
1156
+ "o",
1157
+ "ccc"
1158
+ ],
1159
+ [
1160
+ "CCCC",
1161
+ "NC"
1162
+ ],
1163
+ [
1164
+ "csc",
1165
+ "n"
1166
+ ],
1167
+ [
1168
+ "CCN",
1169
+ "S"
1170
+ ],
1171
+ [
1172
+ "CCC",
1173
+ "S"
1174
+ ],
1175
+ [
1176
+ "N",
1177
+ "CCOc"
1178
+ ],
1179
+ [
1180
+ "O",
1181
+ "CCN"
1182
+ ],
1183
+ [
1184
+ "sc",
1185
+ "nc"
1186
+ ],
1187
+ [
1188
+ "O",
1189
+ "CCC"
1190
+ ],
1191
+ [
1192
+ "N",
1193
+ "CCO"
1194
+ ],
1195
+ [
1196
+ "3",
1197
+ "4"
1198
+ ],
1199
+ [
1200
+ "CC",
1201
+ "COC"
1202
+ ],
1203
+ [
1204
+ "ncc",
1205
+ "nc"
1206
+ ],
1207
+ [
1208
+ "N",
1209
+ "CCCn"
1210
+ ],
1211
+ [
1212
+ "nnc",
1213
+ "s"
1214
+ ],
1215
+ [
1216
+ "nc",
1217
+ "nn"
1218
+ ],
1219
+ [
1220
+ "N",
1221
+ "CCCN"
1222
+ ],
1223
+ [
1224
+ "OCC",
1225
+ "CO"
1226
+ ],
1227
+ [
1228
+ "NCC",
1229
+ "n"
1230
+ ],
1231
+ [
1232
+ "CO",
1233
+ "CCn"
1234
+ ],
1235
+ [
1236
+ "cn",
1237
+ "ccc"
1238
+ ],
1239
+ [
1240
+ "4",
1241
+ "3"
1242
+ ],
1243
+ [
1244
+ "no",
1245
+ "nc"
1246
+ ],
1247
+ [
1248
+ "CCCC",
1249
+ "n"
1250
+ ],
1251
+ [
1252
+ "NCC",
1253
+ "Cc"
1254
+ ],
1255
+ [
1256
+ "C",
1257
+ "NCc"
1258
+ ],
1259
+ [
1260
+ "CO",
1261
+ "CCOc"
1262
+ ],
1263
+ [
1264
+ "nc",
1265
+ "sc"
1266
+ ],
1267
+ [
1268
+ "CO",
1269
+ "CCCNC"
1270
+ ],
1271
+ [
1272
+ "N",
1273
+ "n"
1274
+ ],
1275
+ [
1276
+ "F",
1277
+ "C"
1278
+ ],
1279
+ [
1280
+ "Cl",
1281
+ "c"
1282
+ ],
1283
+ [
1284
+ "O",
1285
+ "CCc"
1286
+ ],
1287
+ [
1288
+ "NCC",
1289
+ "CO"
1290
+ ],
1291
+ [
1292
+ "CO",
1293
+ "CCC"
1294
+ ],
1295
+ [
1296
+ "CC",
1297
+ "CCO"
1298
+ ],
1299
+ [
1300
+ "NCC",
1301
+ "Nc"
1302
+ ],
1303
+ [
1304
+ "CCCC",
1305
+ "c"
1306
+ ],
1307
+ [
1308
+ "n",
1309
+ "sc"
1310
+ ],
1311
+ [
1312
+ "cc",
1313
+ "on"
1314
+ ],
1315
+ [
1316
+ "OCC",
1317
+ "n"
1318
+ ],
1319
+ [
1320
+ "cnc",
1321
+ "nc"
1322
+ ],
1323
+ [
1324
+ "ccc",
1325
+ "nn"
1326
+ ],
1327
+ [
1328
+ "OCC",
1329
+ "NC"
1330
+ ],
1331
+ [
1332
+ "CCC",
1333
+ "Nc"
1334
+ ],
1335
+ [
1336
+ "(",
1337
+ "["
1338
+ ],
1339
+ [
1340
+ "CN",
1341
+ "CC"
1342
+ ],
1343
+ [
1344
+ "s",
1345
+ "cc"
1346
+ ],
1347
+ [
1348
+ "nn",
1349
+ "sc"
1350
+ ],
1351
+ [
1352
+ "cs",
1353
+ "nn"
1354
+ ],
1355
+ [
1356
+ "N",
1357
+ "CCCC"
1358
+ ],
1359
+ [
1360
+ "O",
1361
+ "CCCC"
1362
+ ],
1363
+ [
1364
+ "CCN",
1365
+ "CC"
1366
+ ],
1367
+ [
1368
+ "S",
1369
+ "CCC"
1370
+ ],
1371
+ [
1372
+ "CCCC",
1373
+ "S"
1374
+ ],
1375
+ [
1376
+ "O",
1377
+ "S"
1378
+ ],
1379
+ [
1380
+ "s",
1381
+ "ccn"
1382
+ ],
1383
+ [
1384
+ "Br",
1385
+ "c"
1386
+ ],
1387
+ [
1388
+ "CCO",
1389
+ "Cc"
1390
+ ],
1391
+ [
1392
+ "1",
1393
+ "3"
1394
+ ],
1395
+ [
1396
+ "CCC",
1397
+ "Sc"
1398
+ ],
1399
+ [
1400
+ "NCC",
1401
+ "S"
1402
+ ],
1403
+ [
1404
+ "N",
1405
+ "CCOC"
1406
+ ],
1407
+ [
1408
+ "CC",
1409
+ "CCOc"
1410
+ ],
1411
+ [
1412
+ "cc",
1413
+ "no"
1414
+ ],
1415
+ [
1416
+ "nn",
1417
+ "cc"
1418
+ ],
1419
+ [
1420
+ "CCCN",
1421
+ "S"
1422
+ ],
1423
+ [
1424
+ "N",
1425
+ "CCCNC"
1426
+ ],
1427
+ [
1428
+ "c",
1429
+ "o"
1430
+ ],
1431
+ [
1432
+ "NCC",
1433
+ "COC"
1434
+ ],
1435
+ [
1436
+ "s",
1437
+ "nc"
1438
+ ],
1439
+ [
1440
+ "CO",
1441
+ "CCO"
1442
+ ],
1443
+ [
1444
+ "CO",
1445
+ "CCc"
1446
+ ],
1447
+ [
1448
+ "CCCCC",
1449
+ "NC"
1450
+ ],
1451
+ [
1452
+ "CCO",
1453
+ "CCCNC"
1454
+ ],
1455
+ [
1456
+ "nc",
1457
+ "oc"
1458
+ ],
1459
+ [
1460
+ "COCC",
1461
+ "Nc"
1462
+ ],
1463
+ [
1464
+ "NCC",
1465
+ "Sc"
1466
+ ],
1467
+ [
1468
+ "n",
1469
+ "s"
1470
+ ],
1471
+ [
1472
+ "nnc",
1473
+ "o"
1474
+ ],
1475
+ [
1476
+ "CN",
1477
+ "CCc"
1478
+ ],
1479
+ [
1480
+ "CCOCC",
1481
+ "NC"
1482
+ ],
1483
+ [
1484
+ "ncc",
1485
+ "o"
1486
+ ],
1487
+ [
1488
+ "s",
1489
+ "nnc"
1490
+ ],
1491
+ [
1492
+ "CO",
1493
+ "CCOC"
1494
+ ],
1495
+ [
1496
+ "cc",
1497
+ "nnc"
1498
+ ],
1499
+ [
1500
+ "CO",
1501
+ "CCCN"
1502
+ ],
1503
+ [
1504
+ "CO",
1505
+ "CCCC"
1506
+ ],
1507
+ [
1508
+ "CCCCC",
1509
+ "N"
1510
+ ],
1511
+ [
1512
+ "cnn",
1513
+ "s"
1514
+ ],
1515
+ [
1516
+ "OCC",
1517
+ "Nc"
1518
+ ],
1519
+ [
1520
+ "O",
1521
+ "CCCNC"
1522
+ ],
1523
+ [
1524
+ "oc",
1525
+ "nc"
1526
+ ],
1527
+ [
1528
+ "CC",
1529
+ "NCc"
1530
+ ],
1531
+ [
1532
+ "NCCN",
1533
+ "S"
1534
+ ],
1535
+ [
1536
+ "CCO",
1537
+ "CCC"
1538
+ ],
1539
+ [
1540
+ "ccnc",
1541
+ "n"
1542
+ ],
1543
+ [
1544
+ "cnc",
1545
+ "s"
1546
+ ],
1547
+ [
1548
+ "O",
1549
+ "CCCn"
1550
+ ],
1551
+ [
1552
+ "O",
1553
+ "CCOc"
1554
+ ],
1555
+ [
1556
+ "O",
1557
+ "CCOC"
1558
+ ],
1559
+ [
1560
+ "nc",
1561
+ "ncc"
1562
+ ],
1563
+ [
1564
+ "CC",
1565
+ "CCOC"
1566
+ ],
1567
+ [
1568
+ "OCC",
1569
+ "Sc"
1570
+ ],
1571
+ [
1572
+ "OCC",
1573
+ "Cc"
1574
+ ],
1575
+ [
1576
+ "CCCO",
1577
+ "CC"
1578
+ ],
1579
+ [
1580
+ "CO",
1581
+ "CCCn"
1582
+ ],
1583
+ [
1584
+ "NCC",
1585
+ "COCC"
1586
+ ],
1587
+ [
1588
+ "CO",
1589
+ "CCOCC"
1590
+ ],
1591
+ [
1592
+ ")",
1593
+ "-"
1594
+ ],
1595
+ [
1596
+ "3",
1597
+ "1"
1598
+ ],
1599
+ [
1600
+ "NCC",
1601
+ "COc"
1602
+ ],
1603
+ [
1604
+ "N",
1605
+ "CCOCC"
1606
+ ],
1607
+ [
1608
+ "O",
1609
+ "COC"
1610
+ ],
1611
+ [
1612
+ "o",
1613
+ "cc"
1614
+ ],
1615
+ [
1616
+ "NCC",
1617
+ "CCC"
1618
+ ],
1619
+ [
1620
+ "o",
1621
+ "ncc"
1622
+ ],
1623
+ [
1624
+ "cnc",
1625
+ "o"
1626
+ ],
1627
+ [
1628
+ "cn",
1629
+ "oc"
1630
+ ],
1631
+ [
1632
+ "CCCC",
1633
+ "Nc"
1634
+ ],
1635
+ [
1636
+ "NCCC",
1637
+ "Sc"
1638
+ ],
1639
+ [
1640
+ "NCCC",
1641
+ "Nc"
1642
+ ],
1643
+ [
1644
+ "S",
1645
+ "CCOc"
1646
+ ],
1647
+ [
1648
+ "S",
1649
+ "CCn"
1650
+ ],
1651
+ [
1652
+ "O",
1653
+ "CCCN"
1654
+ ],
1655
+ [
1656
+ "COCCN",
1657
+ "S"
1658
+ ],
1659
+ [
1660
+ "nc",
1661
+ "on"
1662
+ ],
1663
+ [
1664
+ "CCCC",
1665
+ "Sc"
1666
+ ],
1667
+ [
1668
+ "COCCC",
1669
+ "Nc"
1670
+ ],
1671
+ [
1672
+ "NCC",
1673
+ "CCN"
1674
+ ],
1675
+ [
1676
+ "COCC",
1677
+ "S"
1678
+ ],
1679
+ [
1680
+ "S",
1681
+ "CCCC"
1682
+ ],
1683
+ [
1684
+ "S",
1685
+ "CCNC"
1686
+ ],
1687
+ [
1688
+ "S",
1689
+ "CCc"
1690
+ ],
1691
+ [
1692
+ "n",
1693
+ "scc"
1694
+ ],
1695
+ [
1696
+ "CCCC",
1697
+ "CCNC"
1698
+ ],
1699
+ [
1700
+ "CCO",
1701
+ "CCCC"
1702
+ ],
1703
+ [
1704
+ "S",
1705
+ "CCN"
1706
+ ],
1707
+ [
1708
+ "OC",
1709
+ "n"
1710
+ ],
1711
+ [
1712
+ "CO",
1713
+ "n"
1714
+ ],
1715
+ [
1716
+ "CCCCC",
1717
+ "n"
1718
+ ],
1719
+ [
1720
+ "CCOCC",
1721
+ "n"
1722
+ ],
1723
+ [
1724
+ "S",
1725
+ "CCO"
1726
+ ],
1727
+ [
1728
+ "2",
1729
+ "4"
1730
+ ],
1731
+ [
1732
+ "NCCO",
1733
+ "CCO"
1734
+ ],
1735
+ [
1736
+ "CCCCN",
1737
+ "S"
1738
+ ],
1739
+ [
1740
+ "CCO",
1741
+ "CCOc"
1742
+ ],
1743
+ [
1744
+ "CCO",
1745
+ "CCOC"
1746
+ ],
1747
+ [
1748
+ "CCCC",
1749
+ "Cc"
1750
+ ],
1751
+ [
1752
+ "COCC",
1753
+ "Sc"
1754
+ ],
1755
+ [
1756
+ "NCCC",
1757
+ "S"
1758
+ ],
1759
+ [
1760
+ "CCO",
1761
+ "CCN"
1762
+ ],
1763
+ [
1764
+ "cs",
1765
+ "cc"
1766
+ ],
1767
+ [
1768
+ "NCCCN",
1769
+ "S"
1770
+ ],
1771
+ [
1772
+ "OCC",
1773
+ "COc"
1774
+ ],
1775
+ [
1776
+ "s",
1777
+ "n"
1778
+ ],
1779
+ [
1780
+ "O",
1781
+ "CCNS"
1782
+ ],
1783
+ [
1784
+ "CCCO",
1785
+ "Cc"
1786
+ ],
1787
+ [
1788
+ "S",
1789
+ "CCCO"
1790
+ ],
1791
+ [
1792
+ "OCCC",
1793
+ "Nc"
1794
+ ],
1795
+ [
1796
+ "CO",
1797
+ "CCCNS"
1798
+ ],
1799
+ [
1800
+ "COCC",
1801
+ "COc"
1802
+ ],
1803
+ [
1804
+ "CN",
1805
+ "n"
1806
+ ],
1807
+ [
1808
+ "S",
1809
+ "CCCc"
1810
+ ],
1811
+ [
1812
+ "no",
1813
+ "cc"
1814
+ ],
1815
+ [
1816
+ "CCOCC",
1817
+ "Sc"
1818
+ ],
1819
+ [
1820
+ "CN",
1821
+ "CCN"
1822
+ ],
1823
+ [
1824
+ "cn",
1825
+ "sc"
1826
+ ],
1827
+ [
1828
+ "OCCC",
1829
+ "Sc"
1830
+ ],
1831
+ [
1832
+ "CO",
1833
+ "CCOCCNC"
1834
+ ],
1835
+ [
1836
+ "S",
1837
+ "CCOC"
1838
+ ],
1839
+ [
1840
+ "coc",
1841
+ "n"
1842
+ ],
1843
+ [
1844
+ "CCCCC",
1845
+ "S"
1846
+ ],
1847
+ [
1848
+ "COCC",
1849
+ "COC"
1850
+ ],
1851
+ [
1852
+ "OCC",
1853
+ "S"
1854
+ ],
1855
+ [
1856
+ "CCO",
1857
+ "CCOCC"
1858
+ ],
1859
+ [
1860
+ "N",
1861
+ "CCCCCC"
1862
+ ],
1863
+ [
1864
+ "CCCC",
1865
+ "COc"
1866
+ ],
1867
+ [
1868
+ "CO",
1869
+ "Cn"
1870
+ ],
1871
+ [
1872
+ "NC",
1873
+ "N"
1874
+ ],
1875
+ [
1876
+ "NCCO",
1877
+ "Cc"
1878
+ ],
1879
+ [
1880
+ "O",
1881
+ "CCOCC"
1882
+ ],
1883
+ [
1884
+ "CO",
1885
+ "CCCCNC"
1886
+ ],
1887
+ [
1888
+ "CCO",
1889
+ "CCCNc"
1890
+ ],
1891
+ [
1892
+ "NCC",
1893
+ "COCc"
1894
+ ],
1895
+ [
1896
+ "CO",
1897
+ "CCCCC"
1898
+ ],
1899
+ [
1900
+ "N",
1901
+ "CCCCc"
1902
+ ],
1903
+ [
1904
+ "N",
1905
+ "CCCCn"
1906
+ ],
1907
+ [
1908
+ "SCC",
1909
+ "S"
1910
+ ],
1911
+ [
1912
+ "O",
1913
+ "CCCCC"
1914
+ ],
1915
+ [
1916
+ "CCO",
1917
+ "CCCn"
1918
+ ],
1919
+ [
1920
+ "4",
1921
+ "5"
1922
+ ],
1923
+ [
1924
+ "SCC",
1925
+ "COc"
1926
+ ],
1927
+ [
1928
+ "cs",
1929
+ "nc"
1930
+ ],
1931
+ [
1932
+ "c",
1933
+ "onc"
1934
+ ],
1935
+ [
1936
+ "CC",
1937
+ "CCOCC"
1938
+ ],
1939
+ [
1940
+ "CCO",
1941
+ "CCCNS"
1942
+ ],
1943
+ [
1944
+ "SCC",
1945
+ "Nc"
1946
+ ],
1947
+ [
1948
+ "CO",
1949
+ "CCCCCNC"
1950
+ ],
1951
+ [
1952
+ "NCC",
1953
+ "CCO"
1954
+ ],
1955
+ [
1956
+ "S",
1957
+ "CCCNC"
1958
+ ],
1959
+ [
1960
+ "CCCCC",
1961
+ "NS"
1962
+ ],
1963
+ [
1964
+ "CCOCC",
1965
+ "S"
1966
+ ],
1967
+ [
1968
+ "s",
1969
+ "ncc"
1970
+ ],
1971
+ [
1972
+ "CCCC",
1973
+ "CCC"
1974
+ ],
1975
+ [
1976
+ "NCCCC",
1977
+ "CO"
1978
+ ],
1979
+ [
1980
+ "S",
1981
+ "CCNS"
1982
+ ],
1983
+ [
1984
+ "O",
1985
+ "CCCNS"
1986
+ ],
1987
+ [
1988
+ "CCCC",
1989
+ "CCn"
1990
+ ],
1991
+ [
1992
+ "O",
1993
+ "CCCCn"
1994
+ ],
1995
+ [
1996
+ "S",
1997
+ "CCCN"
1998
+ ],
1999
+ [
2000
+ "CCO",
2001
+ "CCO"
2002
+ ],
2003
+ [
2004
+ "5",
2005
+ "4"
2006
+ ],
2007
+ [
2008
+ "CCO",
2009
+ "CCc"
2010
+ ],
2011
+ [
2012
+ "CO",
2013
+ "CO"
2014
+ ],
2015
+ [
2016
+ "CO",
2017
+ "CCCS"
2018
+ ],
2019
+ [
2020
+ "cnn",
2021
+ "cc"
2022
+ ],
2023
+ [
2024
+ "NCC",
2025
+ "CCOc"
2026
+ ],
2027
+ [
2028
+ "O",
2029
+ "CN"
2030
+ ],
2031
+ [
2032
+ "CC",
2033
+ "CCOCCNC"
2034
+ ],
2035
+ [
2036
+ "4",
2037
+ "2"
2038
+ ],
2039
+ [
2040
+ "S",
2041
+ "Cn"
2042
+ ],
2043
+ [
2044
+ "S",
2045
+ "CCCCC"
2046
+ ],
2047
+ [
2048
+ "CCO",
2049
+ "CCNS"
2050
+ ],
2051
+ [
2052
+ "CCCC",
2053
+ "COC"
2054
+ ],
2055
+ [
2056
+ "CCCCC",
2057
+ "Sc"
2058
+ ],
2059
+ [
2060
+ "OCC",
2061
+ "CCN"
2062
+ ],
2063
+ [
2064
+ "COCCC",
2065
+ "Sc"
2066
+ ],
2067
+ [
2068
+ "COCCCC",
2069
+ "Nc"
2070
+ ],
2071
+ [
2072
+ "CCO",
2073
+ "CO"
2074
+ ],
2075
+ [
2076
+ "CCOC",
2077
+ "S"
2078
+ ],
2079
+ [
2080
+ "S",
2081
+ "CCCn"
2082
+ ],
2083
+ [
2084
+ "CO",
2085
+ "CN"
2086
+ ],
2087
+ [
2088
+ "cccc",
2089
+ "cc"
2090
+ ],
2091
+ [
2092
+ "COCC",
2093
+ "NCc"
2094
+ ],
2095
+ [
2096
+ "N",
2097
+ "CCCCNC"
2098
+ ],
2099
+ [
2100
+ "CCCCC",
2101
+ "Nc"
2102
+ ],
2103
+ [
2104
+ "CCCC",
2105
+ "CCN"
2106
+ ],
2107
+ [
2108
+ "CO",
2109
+ "CCCc"
2110
+ ],
2111
+ [
2112
+ "CO",
2113
+ "CCOCc"
2114
+ ],
2115
+ [
2116
+ "COCCCC",
2117
+ "Sc"
2118
+ ],
2119
+ [
2120
+ "CCCC",
2121
+ "CO"
2122
+ ],
2123
+ [
2124
+ "S",
2125
+ "CCCNS"
2126
+ ],
2127
+ [
2128
+ "CN",
2129
+ "CCC"
2130
+ ],
2131
+ [
2132
+ "CO",
2133
+ "CCCCn"
2134
+ ],
2135
+ [
2136
+ "O",
2137
+ "CCCCNC"
2138
+ ],
2139
+ [
2140
+ "S",
2141
+ "CCOCC"
2142
+ ],
2143
+ [
2144
+ "CCO",
2145
+ "CCCN"
2146
+ ],
2147
+ [
2148
+ "CCCO",
2149
+ "CCC"
2150
+ ],
2151
+ [
2152
+ "OCCO",
2153
+ "Cc"
2154
+ ],
2155
+ [
2156
+ "COCCO",
2157
+ "CCC"
2158
+ ],
2159
+ [
2160
+ "OCCOCC",
2161
+ "Nc"
2162
+ ],
2163
+ [
2164
+ "SCCCC",
2165
+ "CO"
2166
+ ],
2167
+ [
2168
+ "O",
2169
+ "n"
2170
+ ],
2171
+ [
2172
+ "CO",
2173
+ "CCCCS"
2174
+ ],
2175
+ [
2176
+ "CCO",
2177
+ "CCCCNC"
2178
+ ],
2179
+ [
2180
+ "OCC",
2181
+ "CCOc"
2182
+ ],
2183
+ [
2184
+ "CCCO",
2185
+ "CCNC"
2186
+ ],
2187
+ [
2188
+ "SCC",
2189
+ "Sc"
2190
+ ],
2191
+ [
2192
+ "CCOCC",
2193
+ "Nc"
2194
+ ],
2195
+ [
2196
+ "O",
2197
+ "CCCS"
2198
+ ],
2199
+ [
2200
+ "OCCO",
2201
+ "CCN"
2202
+ ],
2203
+ [
2204
+ "CO",
2205
+ "CCCOCC"
2206
+ ],
2207
+ [
2208
+ "CCCCO",
2209
+ "CCN"
2210
+ ],
2211
+ [
2212
+ "Cl",
2213
+ "C"
2214
+ ],
2215
+ [
2216
+ "sc",
2217
+ "nn"
2218
+ ],
2219
+ [
2220
+ "OCC",
2221
+ "NCc"
2222
+ ],
2223
+ [
2224
+ "CCCO",
2225
+ "CCc"
2226
+ ],
2227
+ [
2228
+ "COCC",
2229
+ "CCN"
2230
+ ],
2231
+ [
2232
+ "(",
2233
+ "#"
2234
+ ],
2235
+ [
2236
+ "OCC",
2237
+ "COCC"
2238
+ ],
2239
+ [
2240
+ "COCCO",
2241
+ "CCCC"
2242
+ ],
2243
+ [
2244
+ "S",
2245
+ "CN"
2246
+ ],
2247
+ [
2248
+ "CCCC",
2249
+ "COCC"
2250
+ ],
2251
+ [
2252
+ "N",
2253
+ "N"
2254
+ ],
2255
+ [
2256
+ "CCO",
2257
+ "n"
2258
+ ],
2259
+ [
2260
+ "CCO",
2261
+ "CCOCc"
2262
+ ],
2263
+ [
2264
+ "CCCC",
2265
+ "CCOc"
2266
+ ],
2267
+ [
2268
+ "CCCCCC",
2269
+ "Sc"
2270
+ ],
2271
+ [
2272
+ "NCCCC",
2273
+ "Cc"
2274
+ ],
2275
+ [
2276
+ "OCC",
2277
+ "COC"
2278
+ ],
2279
+ [
2280
+ "nc",
2281
+ "no"
2282
+ ],
2283
+ [
2284
+ "OCCCC",
2285
+ "Sc"
2286
+ ],
2287
+ [
2288
+ "O",
2289
+ "COc"
2290
+ ],
2291
+ [
2292
+ "S",
2293
+ "CCCOCC"
2294
+ ],
2295
+ [
2296
+ "CCCC",
2297
+ "CCNS"
2298
+ ],
2299
+ [
2300
+ "ncc",
2301
+ "nn"
2302
+ ],
2303
+ [
2304
+ "NCCO",
2305
+ "CCC"
2306
+ ],
2307
+ [
2308
+ "CCC",
2309
+ "NCc"
2310
+ ],
2311
+ [
2312
+ "CCCCCC",
2313
+ "Nc"
2314
+ ],
2315
+ [
2316
+ "O",
2317
+ "CCOCCn"
2318
+ ],
2319
+ [
2320
+ "S",
2321
+ "CCCS"
2322
+ ],
2323
+ [
2324
+ "nn",
2325
+ "ccc"
2326
+ ],
2327
+ [
2328
+ "COCC",
2329
+ "CCNS"
2330
+ ],
2331
+ [
2332
+ "NCCCC",
2333
+ "Nc"
2334
+ ],
2335
+ [
2336
+ "SCCC",
2337
+ "Sc"
2338
+ ],
2339
+ [
2340
+ "SCCO",
2341
+ "CCO"
2342
+ ],
2343
+ [
2344
+ "OCCCCC",
2345
+ "Nc"
2346
+ ],
2347
+ [
2348
+ "OCCCCC",
2349
+ "Sc"
2350
+ ],
2351
+ [
2352
+ "CC",
2353
+ "CCOCCn"
2354
+ ],
2355
+ [
2356
+ "NCC",
2357
+ "CCOC"
2358
+ ],
2359
+ [
2360
+ "ccn",
2361
+ "s"
2362
+ ],
2363
+ [
2364
+ "COCC",
2365
+ "CCOc"
2366
+ ],
2367
+ [
2368
+ "CCCCO",
2369
+ "Cc"
2370
+ ],
2371
+ [
2372
+ "3",
2373
+ "5"
2374
+ ],
2375
+ [
2376
+ "]",
2377
+ "["
2378
+ ],
2379
+ [
2380
+ "CN",
2381
+ "CCCN"
2382
+ ],
2383
+ [
2384
+ "CN",
2385
+ "CCOc"
2386
+ ],
2387
+ [
2388
+ "CCCC",
2389
+ "CCO"
2390
+ ],
2391
+ [
2392
+ "SCC",
2393
+ "CCO"
2394
+ ],
2395
+ [
2396
+ "OCCO",
2397
+ "n"
2398
+ ],
2399
+ [
2400
+ "COCCN",
2401
+ "CC"
2402
+ ],
2403
+ [
2404
+ "NCCN",
2405
+ "CC"
2406
+ ],
2407
+ [
2408
+ "CCCCOCC",
2409
+ "S"
2410
+ ],
2411
+ [
2412
+ "S",
2413
+ "CSc"
2414
+ ],
2415
+ [
2416
+ "S",
2417
+ "CCCNc"
2418
+ ],
2419
+ [
2420
+ "COC",
2421
+ "Sc"
2422
+ ],
2423
+ [
2424
+ "CCCO",
2425
+ "CCO"
2426
+ ],
2427
+ [
2428
+ "OCCN",
2429
+ "CC"
2430
+ ],
2431
+ [
2432
+ "CO",
2433
+ "S"
2434
+ ],
2435
+ [
2436
+ "NCC",
2437
+ "NCc"
2438
+ ],
2439
+ [
2440
+ "OCC",
2441
+ "CCO"
2442
+ ],
2443
+ [
2444
+ "CCCO",
2445
+ "CCOC"
2446
+ ],
2447
+ [
2448
+ "OCCCC",
2449
+ "Nc"
2450
+ ],
2451
+ [
2452
+ "O",
2453
+ "CCCCc"
2454
+ ],
2455
+ [
2456
+ "CO",
2457
+ "CCCCc"
2458
+ ],
2459
+ [
2460
+ "CO",
2461
+ "CCCOCc"
2462
+ ],
2463
+ [
2464
+ "CCCC",
2465
+ "CCc"
2466
+ ],
2467
+ [
2468
+ "cn",
2469
+ "on"
2470
+ ],
2471
+ [
2472
+ "ncnc",
2473
+ "n"
2474
+ ],
2475
+ [
2476
+ "nnn",
2477
+ "s"
2478
+ ],
2479
+ [
2480
+ "CO",
2481
+ "CNC"
2482
+ ],
2483
+ [
2484
+ "CO",
2485
+ "CCCCCC"
2486
+ ],
2487
+ [
2488
+ "CN",
2489
+ "CCNC"
2490
+ ],
2491
+ [
2492
+ "CN",
2493
+ "CCSc"
2494
+ ],
2495
+ [
2496
+ "CCCO",
2497
+ "CCN"
2498
+ ],
2499
+ [
2500
+ "N",
2501
+ "NC"
2502
+ ],
2503
+ [
2504
+ "O",
2505
+ "CCOCCSc"
2506
+ ],
2507
+ [
2508
+ "S",
2509
+ "CNC"
2510
+ ],
2511
+ [
2512
+ "cc",
2513
+ "s"
2514
+ ],
2515
+ [
2516
+ "CC",
2517
+ "CCOCCSc"
2518
+ ],
2519
+ [
2520
+ "CO",
2521
+ "CCCCCn"
2522
+ ],
2523
+ [
2524
+ "CCO",
2525
+ "Cn"
2526
+ ],
2527
+ [
2528
+ "CCOCC",
2529
+ "COC"
2530
+ ],
2531
+ [
2532
+ "Br",
2533
+ "C"
2534
+ ],
2535
+ [
2536
+ "CCCO",
2537
+ "n"
2538
+ ],
2539
+ [
2540
+ "CCCO",
2541
+ "CCOc"
2542
+ ],
2543
+ [
2544
+ "OCCO",
2545
+ "CCO"
2546
+ ],
2547
+ [
2548
+ "OCCO",
2549
+ "Cn"
2550
+ ],
2551
+ [
2552
+ "CCCCO",
2553
+ "CCCNC"
2554
+ ],
2555
+ [
2556
+ "OCCCC",
2557
+ "CO"
2558
+ ],
2559
+ [
2560
+ "co",
2561
+ "cc"
2562
+ ],
2563
+ [
2564
+ "1",
2565
+ "4"
2566
+ ],
2567
+ [
2568
+ "O",
2569
+ "CCCCCn"
2570
+ ],
2571
+ [
2572
+ "S",
2573
+ "N"
2574
+ ],
2575
+ [
2576
+ "S",
2577
+ "COC"
2578
+ ],
2579
+ [
2580
+ "S",
2581
+ "CCCCSc"
2582
+ ],
2583
+ [
2584
+ "NC",
2585
+ "S"
2586
+ ],
2587
+ [
2588
+ "CN",
2589
+ "CCn"
2590
+ ],
2591
+ [
2592
+ "CN",
2593
+ "CCCn"
2594
+ ],
2595
+ [
2596
+ "CN",
2597
+ "CCS"
2598
+ ],
2599
+ [
2600
+ "CCCC",
2601
+ "COCc"
2602
+ ],
2603
+ [
2604
+ "NCC",
2605
+ "CCCN"
2606
+ ],
2607
+ [
2608
+ "cnc",
2609
+ "nn"
2610
+ ],
2611
+ [
2612
+ "COCC",
2613
+ "CCOC"
2614
+ ],
2615
+ [
2616
+ "C",
2617
+ "Br"
2618
+ ],
2619
+ [
2620
+ "NC",
2621
+ "O"
2622
+ ],
2623
+ [
2624
+ "NC",
2625
+ "n"
2626
+ ],
2627
+ [
2628
+ "CO",
2629
+ "COC"
2630
+ ],
2631
+ [
2632
+ "CN",
2633
+ "CCO"
2634
+ ],
2635
+ [
2636
+ "OC",
2637
+ "NC"
2638
+ ],
2639
+ [
2640
+ "OC",
2641
+ "Sc"
2642
+ ],
2643
+ [
2644
+ "CCCC",
2645
+ "CCOC"
2646
+ ],
2647
+ [
2648
+ "CCCO",
2649
+ "CCCC"
2650
+ ],
2651
+ [
2652
+ "SCC",
2653
+ "NCc"
2654
+ ],
2655
+ [
2656
+ "COCCO",
2657
+ "Cn"
2658
+ ],
2659
+ [
2660
+ "N",
2661
+ "Sc"
2662
+ ],
2663
+ [
2664
+ "n",
2665
+ "snc"
2666
+ ],
2667
+ [
2668
+ "NC",
2669
+ "Sc"
2670
+ ],
2671
+ [
2672
+ "NC",
2673
+ "Oc"
2674
+ ],
2675
+ [
2676
+ "NC",
2677
+ "NS"
2678
+ ],
2679
+ [
2680
+ "nc",
2681
+ "nnc"
2682
+ ],
2683
+ [
2684
+ "CO",
2685
+ "CCCCCS"
2686
+ ],
2687
+ [
2688
+ "COC",
2689
+ "S"
2690
+ ],
2691
+ [
2692
+ "CCCO",
2693
+ "CCn"
2694
+ ],
2695
+ [
2696
+ "CCCO",
2697
+ "CCNc"
2698
+ ],
2699
+ [
2700
+ "SCC",
2701
+ "CCN"
2702
+ ],
2703
+ [
2704
+ "SCC",
2705
+ "COC"
2706
+ ],
2707
+ [
2708
+ "SCC",
2709
+ "CCOc"
2710
+ ],
2711
+ [
2712
+ "NCCCO",
2713
+ "CCc"
2714
+ ],
2715
+ [
2716
+ "CCCCO",
2717
+ "CCC"
2718
+ ],
2719
+ [
2720
+ "CCCCO",
2721
+ "CCOc"
2722
+ ],
2723
+ [
2724
+ "COCCO",
2725
+ "CCCNC"
2726
+ ],
2727
+ [
2728
+ "COCCOCC",
2729
+ "Nc"
2730
+ ],
2731
+ [
2732
+ "CCCCOCC",
2733
+ "Nc"
2734
+ ],
2735
+ [
2736
+ "4",
2737
+ "1"
2738
+ ],
2739
+ [
2740
+ "5",
2741
+ "3"
2742
+ ],
2743
+ [
2744
+ "O",
2745
+ "Cl"
2746
+ ],
2747
+ [
2748
+ "S",
2749
+ "CCOCCC"
2750
+ ],
2751
+ [
2752
+ "CC",
2753
+ "l"
2754
+ ],
2755
+ [
2756
+ "NC",
2757
+ "OC"
2758
+ ],
2759
+ [
2760
+ "CO",
2761
+ "CCCCCN"
2762
+ ],
2763
+ [
2764
+ "CCO",
2765
+ "S"
2766
+ ],
2767
+ [
2768
+ "CCN",
2769
+ "CCN"
2770
+ ],
2771
+ [
2772
+ "CN",
2773
+ "CCOC"
2774
+ ],
2775
+ [
2776
+ "CN",
2777
+ "CCCOc"
2778
+ ],
2779
+ [
2780
+ "CN",
2781
+ "CCOCCO"
2782
+ ],
2783
+ [
2784
+ "nnc",
2785
+ "nc"
2786
+ ],
2787
+ [
2788
+ "CCOCC",
2789
+ "NCc"
2790
+ ],
2791
+ [
2792
+ "OCCO",
2793
+ "CCOc"
2794
+ ],
2795
+ [
2796
+ "OCCCO",
2797
+ "n"
2798
+ ],
2799
+ [
2800
+ "NCCCC",
2801
+ "Sc"
2802
+ ],
2803
+ [
2804
+ "NCCCC",
2805
+ "CCNC"
2806
+ ],
2807
+ [
2808
+ "OCCCC",
2809
+ "S"
2810
+ ],
2811
+ [
2812
+ "COCCO",
2813
+ "n"
2814
+ ],
2815
+ [
2816
+ "COCCO",
2817
+ "CCOc"
2818
+ ],
2819
+ [
2820
+ "COCCOCC",
2821
+ "n"
2822
+ ],
2823
+ [
2824
+ "NCCOCC",
2825
+ "n"
2826
+ ],
2827
+ [
2828
+ "6",
2829
+ "5"
2830
+ ],
2831
+ [
2832
+ "O",
2833
+ "CCCCCCSc"
2834
+ ],
2835
+ [
2836
+ "O",
2837
+ "CCCCCCNc"
2838
+ ],
2839
+ [
2840
+ "S",
2841
+ "n"
2842
+ ],
2843
+ [
2844
+ "S",
2845
+ "NC"
2846
+ ],
2847
+ [
2848
+ "S",
2849
+ "CO"
2850
+ ],
2851
+ [
2852
+ "S",
2853
+ "CS"
2854
+ ],
2855
+ [
2856
+ "S",
2857
+ "CCNCC"
2858
+ ],
2859
+ [
2860
+ "NC",
2861
+ "Nc"
2862
+ ],
2863
+ [
2864
+ "CCN",
2865
+ "CCSc"
2866
+ ],
2867
+ [
2868
+ "CN",
2869
+ "CCCOC"
2870
+ ],
2871
+ [
2872
+ "OC",
2873
+ "S"
2874
+ ],
2875
+ [
2876
+ "NCC",
2877
+ "CCOCc"
2878
+ ],
2879
+ [
2880
+ "COC",
2881
+ "Nc"
2882
+ ],
2883
+ [
2884
+ "COC",
2885
+ "NCc"
2886
+ ],
2887
+ [
2888
+ "CCOCC",
2889
+ "Cc"
2890
+ ],
2891
+ [
2892
+ "CCOCC",
2893
+ "COc"
2894
+ ],
2895
+ [
2896
+ "OCC",
2897
+ "COCc"
2898
+ ],
2899
+ [
2900
+ "OCC",
2901
+ "CCNS"
2902
+ ],
2903
+ [
2904
+ "CCCO",
2905
+ "CCCn"
2906
+ ],
2907
+ [
2908
+ "NS",
2909
+ "NC"
2910
+ ],
2911
+ [
2912
+ "OCCO",
2913
+ "CCc"
2914
+ ],
2915
+ [
2916
+ "CCCCCC",
2917
+ "S"
2918
+ ],
2919
+ [
2920
+ "NCCO",
2921
+ "CCCC"
2922
+ ],
2923
+ [
2924
+ "NCCO",
2925
+ "CCOC"
2926
+ ],
2927
+ [
2928
+ "CCCCO",
2929
+ "CCCN"
2930
+ ],
2931
+ [
2932
+ "NCCCC",
2933
+ "COC"
2934
+ ],
2935
+ [
2936
+ "OCCCC",
2937
+ "Cc"
2938
+ ],
2939
+ [
2940
+ "COCCO",
2941
+ "CCN"
2942
+ ],
2943
+ [
2944
+ "COCCCC",
2945
+ "Cc"
2946
+ ],
2947
+ [
2948
+ "COCCOCC",
2949
+ "S"
2950
+ ],
2951
+ [
2952
+ "COCCOCC",
2953
+ "Sc"
2954
+ ],
2955
+ [
2956
+ "SCCO",
2957
+ "CCN"
2958
+ ]
2959
+ ]
2960
+ }
2961
+ }
tokenizer_config.json ADDED
@@ -0,0 +1,56 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "added_tokens_decoder": {
3
+ "0": {
4
+ "content": "<|startoftext|>",
5
+ "lstrip": false,
6
+ "normalized": false,
7
+ "rstrip": false,
8
+ "single_word": false,
9
+ "special": true
10
+ },
11
+ "1": {
12
+ "content": "<pad>",
13
+ "lstrip": false,
14
+ "normalized": false,
15
+ "rstrip": false,
16
+ "single_word": false,
17
+ "special": true
18
+ },
19
+ "2": {
20
+ "content": "<|endoftext|>",
21
+ "lstrip": false,
22
+ "normalized": false,
23
+ "rstrip": false,
24
+ "single_word": false,
25
+ "special": true
26
+ },
27
+ "3": {
28
+ "content": "<unk>",
29
+ "lstrip": false,
30
+ "normalized": false,
31
+ "rstrip": false,
32
+ "single_word": false,
33
+ "special": true
34
+ },
35
+ "4": {
36
+ "content": "<mask>",
37
+ "lstrip": false,
38
+ "normalized": false,
39
+ "rstrip": false,
40
+ "single_word": false,
41
+ "special": true
42
+ }
43
+ },
44
+ "bos_token": "<|startoftext|>",
45
+ "clean_up_tokenization_spaces": false,
46
+ "eos_token": "<|endoftext|>",
47
+ "extra_special_tokens": {},
48
+ "max_length": 256,
49
+ "model_max_length": 1000000000000000019884624838656,
50
+ "pad_token": "<|endoftext|>",
51
+ "stride": 0,
52
+ "tokenizer_class": "PreTrainedTokenizerFast",
53
+ "truncation_side": "right",
54
+ "truncation_strategy": "longest_first",
55
+ "unk_token": "<unk>"
56
+ }