mikemayuare commited on
Commit
9e0ecf3
1 Parent(s): 1471d2d

Upload tokenizer

Browse files
Files changed (3) hide show
  1. special_tokens_map.json +37 -0
  2. tokenizer.json +4529 -0
  3. tokenizer_config.json +52 -0
special_tokens_map.json ADDED
@@ -0,0 +1,37 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "bos_token": {
3
+ "content": "<s>",
4
+ "lstrip": false,
5
+ "normalized": false,
6
+ "rstrip": false,
7
+ "single_word": false
8
+ },
9
+ "eos_token": {
10
+ "content": "</s>",
11
+ "lstrip": false,
12
+ "normalized": false,
13
+ "rstrip": false,
14
+ "single_word": false
15
+ },
16
+ "mask_token": {
17
+ "content": "<mask>",
18
+ "lstrip": false,
19
+ "normalized": false,
20
+ "rstrip": false,
21
+ "single_word": false
22
+ },
23
+ "pad_token": {
24
+ "content": "<pad>",
25
+ "lstrip": false,
26
+ "normalized": false,
27
+ "rstrip": false,
28
+ "single_word": false
29
+ },
30
+ "unk_token": {
31
+ "content": "<unk>",
32
+ "lstrip": false,
33
+ "normalized": false,
34
+ "rstrip": false,
35
+ "single_word": false
36
+ }
37
+ }
tokenizer.json ADDED
@@ -0,0 +1,4529 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "version": "1.0",
3
+ "truncation": null,
4
+ "padding": null,
5
+ "added_tokens": [
6
+ {
7
+ "id": 0,
8
+ "content": "<pad>",
9
+ "single_word": false,
10
+ "lstrip": false,
11
+ "rstrip": false,
12
+ "normalized": false,
13
+ "special": true
14
+ },
15
+ {
16
+ "id": 1,
17
+ "content": "<s>",
18
+ "single_word": false,
19
+ "lstrip": false,
20
+ "rstrip": false,
21
+ "normalized": false,
22
+ "special": true
23
+ },
24
+ {
25
+ "id": 2,
26
+ "content": "<unk>",
27
+ "single_word": false,
28
+ "lstrip": false,
29
+ "rstrip": false,
30
+ "normalized": false,
31
+ "special": true
32
+ },
33
+ {
34
+ "id": 3,
35
+ "content": "<mask>",
36
+ "single_word": false,
37
+ "lstrip": false,
38
+ "rstrip": false,
39
+ "normalized": false,
40
+ "special": true
41
+ },
42
+ {
43
+ "id": 2260,
44
+ "content": "</s>",
45
+ "single_word": false,
46
+ "lstrip": false,
47
+ "rstrip": false,
48
+ "normalized": false,
49
+ "special": true
50
+ }
51
+ ],
52
+ "normalizer": null,
53
+ "pre_tokenizer": null,
54
+ "post_processor": null,
55
+ "decoder": null,
56
+ "model": {
57
+ "type": "BPE",
58
+ "dropout": null,
59
+ "unk_token": "<unk>",
60
+ "continuing_subword_prefix": null,
61
+ "end_of_word_suffix": null,
62
+ "fuse_unk": false,
63
+ "byte_fallback": false,
64
+ "ignore_merges": false,
65
+ "vocab": {
66
+ "<pad>": 0,
67
+ "<s>": 1,
68
+ "<unk>": 2,
69
+ "<mask>": 3,
70
+ "#": 4,
71
+ "%": 5,
72
+ "(": 6,
73
+ ")": 7,
74
+ "+": 8,
75
+ "-": 9,
76
+ "0": 10,
77
+ "1": 11,
78
+ "2": 12,
79
+ "3": 13,
80
+ "4": 14,
81
+ "5": 15,
82
+ "6": 16,
83
+ "7": 17,
84
+ "8": 18,
85
+ "9": 19,
86
+ "=": 20,
87
+ "A": 21,
88
+ "B": 22,
89
+ "C": 23,
90
+ "E": 24,
91
+ "F": 25,
92
+ "G": 26,
93
+ "H": 27,
94
+ "I": 28,
95
+ "M": 29,
96
+ "N": 30,
97
+ "O": 31,
98
+ "P": 32,
99
+ "R": 33,
100
+ "S": 34,
101
+ "T": 35,
102
+ "U": 36,
103
+ "V": 37,
104
+ "W": 38,
105
+ "X": 39,
106
+ "Y": 40,
107
+ "Z": 41,
108
+ "[": 42,
109
+ "]": 43,
110
+ "a": 44,
111
+ "b": 45,
112
+ "c": 46,
113
+ "d": 47,
114
+ "e": 48,
115
+ "g": 49,
116
+ "h": 50,
117
+ "i": 51,
118
+ "l": 52,
119
+ "m": 53,
120
+ "n": 54,
121
+ "o": 55,
122
+ "p": 56,
123
+ "r": 57,
124
+ "s": 58,
125
+ "t": 59,
126
+ "u": 60,
127
+ "cc": 61,
128
+ "CC": 62,
129
+ "(C": 63,
130
+ "c1": 64,
131
+ "O)": 65,
132
+ "=O)": 66,
133
+ "(=O)": 67,
134
+ "ccc": 68,
135
+ "(C)": 69,
136
+ "c2": 70,
137
+ "C(=O)": 71,
138
+ ")cc": 72,
139
+ "+]": 73,
140
+ "[N": 74,
141
+ "CCC": 75,
142
+ "c1cc": 76,
143
+ "[NH": 77,
144
+ "c1ccc": 78,
145
+ "c(": 79,
146
+ "C(": 80,
147
+ "c3": 81,
148
+ "2)": 82,
149
+ "F)": 83,
150
+ "C1": 84,
151
+ "CCCC": 85,
152
+ "c2cc": 86,
153
+ "OC": 87,
154
+ "c1cccc": 88,
155
+ "NC(=O)": 89,
156
+ ")cc1": 90,
157
+ "CC1": 91,
158
+ "(=O)N": 92,
159
+ "(C)C": 93,
160
+ "-]": 94,
161
+ "CO": 95,
162
+ "c1ccc(": 96,
163
+ "[O": 97,
164
+ "[O-]": 98,
165
+ "n1": 99,
166
+ "[NH+]": 100,
167
+ "c2ccc": 101,
168
+ "3)": 102,
169
+ "(Cl": 103,
170
+ "(F)": 104,
171
+ "c1ccccc1": 105,
172
+ "ccccc": 106,
173
+ "CCO": 107,
174
+ "C(=O)N": 108,
175
+ "2+]": 109,
176
+ "[NH2+]": 110,
177
+ "c2ccccc": 111,
178
+ "(CC": 112,
179
+ "C2": 113,
180
+ "[O-])": 114,
181
+ "cn": 115,
182
+ "c1n": 116,
183
+ "S(=O)": 117,
184
+ "[n": 118,
185
+ "N)": 119,
186
+ "O=": 120,
187
+ "CCN": 121,
188
+ "(C(=O)": 122,
189
+ "[nH": 123,
190
+ "(C(=O)N": 124,
191
+ "c4": 125,
192
+ "(Cl)": 126,
193
+ "Br": 127,
194
+ "CC(C)": 128,
195
+ "C(C)": 129,
196
+ "[nH]": 130,
197
+ "(C)C)": 131,
198
+ "CC(": 132,
199
+ "2)cc1": 133,
200
+ "c(C": 134,
201
+ "3+]": 135,
202
+ "[NH3+]": 136,
203
+ "c3ccc": 137,
204
+ "c2ccc(": 138,
205
+ "CN": 139,
206
+ "C(C": 140,
207
+ "c(C)": 141,
208
+ "c3ccccc": 142,
209
+ "Cl": 143,
210
+ "CCCCC": 144,
211
+ "C=": 145,
212
+ "cc(": 146,
213
+ "c2)": 147,
214
+ "c2n": 148,
215
+ "cc1": 149,
216
+ "OC)": 150,
217
+ "c2ccccc2": 151,
218
+ "O=C(": 152,
219
+ "c1cc(": 153,
220
+ "F)cc": 154,
221
+ "c1ccc(C": 155,
222
+ "CC(=O)N": 156,
223
+ ")N": 157,
224
+ "n2": 158,
225
+ "CC2": 159,
226
+ "[N+]": 160,
227
+ "2)c1": 161,
228
+ "C)": 162,
229
+ "[NH3+])": 163,
230
+ "CC[NH+]": 164,
231
+ "Br)": 165,
232
+ "4)": 166,
233
+ "c(N": 167,
234
+ "CCC(": 168,
235
+ "=O": 169,
236
+ "(Cl)cc": 170,
237
+ "(F)(F)": 171,
238
+ "c1)": 172,
239
+ "c(=O)": 173,
240
+ "c3cc": 174,
241
+ "[N+](=O)": 175,
242
+ "Cc1ccc(": 176,
243
+ "CC(=O)": 177,
244
+ "c2cccc": 178,
245
+ "c1ccc2": 179,
246
+ "c1cccc(": 180,
247
+ "CC2)": 181,
248
+ "N1": 182,
249
+ "C(F)": 183,
250
+ "C3": 184,
251
+ "s1": 185,
252
+ "c3ccccc3": 186,
253
+ "C[NH+]": 187,
254
+ "CCC1": 188,
255
+ "ccc2": 189,
256
+ "Cc1": 190,
257
+ "nc(": 191,
258
+ "nc1": 192,
259
+ "OCC": 193,
260
+ "Cc1cc": 194,
261
+ "CCCCCCCC": 195,
262
+ "C(O)": 196,
263
+ "N2": 197,
264
+ "=C": 198,
265
+ "c3ccc(": 199,
266
+ "OC(C)": 200,
267
+ "Cc1n": 201,
268
+ "c3)": 202,
269
+ "COC(=O)": 203,
270
+ "Cl)": 204,
271
+ "c(Cl)": 205,
272
+ "#N)": 206,
273
+ "C(F)(F)": 207,
274
+ "c5": 208,
275
+ "2)CC1": 209,
276
+ "(CC)": 210,
277
+ "OC(=O)": 211,
278
+ "(O)": 212,
279
+ "CC[NH2+]": 213,
280
+ "1)": 214,
281
+ "cc2": 215,
282
+ "=C(": 216,
283
+ "C[NH2+]": 217,
284
+ ")ccc1": 218,
285
+ "CCN(": 219,
286
+ "O=C(N": 220,
287
+ "F)cc1": 221,
288
+ "(F)(F)F)": 222,
289
+ "nn": 223,
290
+ "=N": 224,
291
+ ")cc2": 225,
292
+ "COc1ccc(": 226,
293
+ "c4ccccc": 227,
294
+ "2)C1": 228,
295
+ "CS": 229,
296
+ "CC(C)(C)": 230,
297
+ "CCCC1": 231,
298
+ "c(F)": 232,
299
+ "c1cn": 233,
300
+ "CCOC(=O)": 234,
301
+ "c2cc(": 235,
302
+ "CCCN": 236,
303
+ "CCC(C)": 237,
304
+ "CC3)": 238,
305
+ "nc2": 239,
306
+ "NC(=O)N": 240,
307
+ "C(C)C": 241,
308
+ "=S": 242,
309
+ "c4ccc": 243,
310
+ "CC(O)": 244,
311
+ "CC3": 245,
312
+ "o1": 246,
313
+ "cs": 247,
314
+ "CCCO": 248,
315
+ "CCC2": 249,
316
+ "(C(C)": 250,
317
+ "(Cl)cc1": 251,
318
+ "c1ccc2c(": 252,
319
+ "cn1": 253,
320
+ "CC(C": 254,
321
+ "C(=O)N1": 255,
322
+ "(N)": 256,
323
+ "c2c(": 257,
324
+ "[S": 258,
325
+ "Cn1": 259,
326
+ "=[NH+]": 260,
327
+ "Cc1ccc": 261,
328
+ "CCCCC1": 262,
329
+ "n3": 263,
330
+ "Cc1cc(": 264,
331
+ "O=C(C": 265,
332
+ "c2c1": 266,
333
+ "ncc": 267,
334
+ "c1cc(C": 268,
335
+ "2)n1": 269,
336
+ "c1cccc(C": 270,
337
+ "CCC(C": 271,
338
+ "c2ccc3": 272,
339
+ "CC)": 273,
340
+ "c2cn": 274,
341
+ "c(C(=O)N": 275,
342
+ "c12": 276,
343
+ ")N1": 277,
344
+ "[nH+]": 278,
345
+ "[Si": 279,
346
+ "(CC(=O)N": 280,
347
+ "c3cccc": 281,
348
+ "ccc3": 282,
349
+ "CNC(=O)": 283,
350
+ "[NH+]1": 284,
351
+ "CC=": 285,
352
+ ")cc(": 286,
353
+ "CC(C)C": 287,
354
+ "OC1": 288,
355
+ "n(": 289,
356
+ "c2cccc(": 290,
357
+ "[Si]": 291,
358
+ "[NH+]2": 292,
359
+ "OC2": 293,
360
+ "CC1)": 294,
361
+ "c4ccccc4": 295,
362
+ "CCn1": 296,
363
+ "cccc": 297,
364
+ "c2ccc(C": 298,
365
+ "c(C)c1": 299,
366
+ "(C)cc": 300,
367
+ "N#": 301,
368
+ ")cc2)": 302,
369
+ "CCNC(=O)": 303,
370
+ "c1c(": 304,
371
+ "CC2)cc1": 305,
372
+ "CCS": 306,
373
+ "3)n": 307,
374
+ "OC(C": 308,
375
+ "=O)cc1": 309,
376
+ "c1cc2": 310,
377
+ "c2)cc1": 311,
378
+ "n(C)": 312,
379
+ "5)": 313,
380
+ "NS(=O)": 314,
381
+ "NC(=O)C(": 315,
382
+ "c1C": 316,
383
+ "c[nH]": 317,
384
+ "NC(": 318,
385
+ "([O-])": 319,
386
+ "c3n": 320,
387
+ "(C)C(=O)": 321,
388
+ "c(OC)": 322,
389
+ "#N": 323,
390
+ ")cc3)": 324,
391
+ "CCCC2": 325,
392
+ "CN1": 326,
393
+ "c(N)": 327,
394
+ "Cc1ccc(C": 328,
395
+ "(C)(=O)": 329,
396
+ "C(C)C)": 330,
397
+ "c6": 331,
398
+ "O=C1": 332,
399
+ "nc(N": 333,
400
+ "C[NH+]1": 334,
401
+ ")cc3": 335,
402
+ ")C(=O)": 336,
403
+ "c(C(=O)": 337,
404
+ "C2)": 338,
405
+ "CC2)c1": 339,
406
+ "c1cccc2": 340,
407
+ "Br)cc1": 341,
408
+ "N(": 342,
409
+ "cc(C": 343,
410
+ "C1CC1": 344,
411
+ "S(C)(=O)": 345,
412
+ "nc(C": 346,
413
+ "CCC(=O)": 347,
414
+ "ccc(": 348,
415
+ "CCC(=O)N": 349,
416
+ "[O-])cc1": 350,
417
+ "c(NC(=O)": 351,
418
+ "C(N)": 352,
419
+ "CN(": 353,
420
+ "CCN1": 354,
421
+ "c1ccc(N": 355,
422
+ "c3c(": 356,
423
+ "C4": 357,
424
+ "[O-])c1": 358,
425
+ "OC(": 359,
426
+ "c1ccc(C)": 360,
427
+ "(C(=O)N2": 361,
428
+ ")cc2)cc1": 362,
429
+ "[nH]1": 363,
430
+ "c(Cl)cc": 364,
431
+ "OCCO": 365,
432
+ "C1=O": 366,
433
+ "Cc1cccc(": 367,
434
+ "=C2": 368,
435
+ "n(C": 369,
436
+ "CCC[NH+]": 370,
437
+ "CCCC(": 371,
438
+ "=S)": 372,
439
+ "O1": 373,
440
+ "nn1": 374,
441
+ "CCC3": 375,
442
+ "Br)c1": 376,
443
+ "NC(=O)C1": 377,
444
+ "[Si](C)": 378,
445
+ "(CC(=O)": 379,
446
+ "cc3": 380,
447
+ "OCO": 381,
448
+ ")C1": 382,
449
+ "c4ccc(": 383,
450
+ "N1C(=O)": 384,
451
+ "n2)": 385,
452
+ "c2)c1": 386,
453
+ "C(C)(C)C": 387,
454
+ "nc3": 388,
455
+ "OCC(=O)N": 389,
456
+ "c2ccc3c(": 390,
457
+ "c4)": 391,
458
+ "=S)N": 392,
459
+ "Nc1n": 393,
460
+ "Cc1cn": 394,
461
+ "c5ccccc": 395,
462
+ "NC(=O)C2": 396,
463
+ "(N)=O)": 397,
464
+ "CCS(=O)": 398,
465
+ "F)cc2": 399,
466
+ "P(=O)": 400,
467
+ "ccccc2": 401,
468
+ "(Cl)c1": 402,
469
+ "O)cc1": 403,
470
+ "c1ccc(C2": 404,
471
+ "CCc1n": 405,
472
+ "C(C)(C)": 406,
473
+ "c(Cl)c1": 407,
474
+ "c2ccc(N": 408,
475
+ "C(N": 409,
476
+ "ncn": 410,
477
+ "(C2": 411,
478
+ "c(S": 412,
479
+ "c3cc(": 413,
480
+ "(CCC": 414,
481
+ "C#": 415,
482
+ "c(F)c1": 416,
483
+ "c2s": 417,
484
+ "3)C2": 418,
485
+ "CS(=O)": 419,
486
+ "CCOCC1": 420,
487
+ "CC1(C)": 421,
488
+ "OCC)": 422,
489
+ "CN(C(=O)": 423,
490
+ "c(O)": 424,
491
+ "ncc1": 425,
492
+ "ccc1": 426,
493
+ "COc1cc(": 427,
494
+ "3CCCC": 428,
495
+ "Cc1cc(C)": 429,
496
+ "N2C(=O)": 430,
497
+ "CC(CC": 431,
498
+ "CC[NH+]1": 432,
499
+ "c1=O": 433,
500
+ "N=": 434,
501
+ "C3)": 435,
502
+ "cs1": 436,
503
+ "n3)": 437,
504
+ "c3ccc4": 438,
505
+ "I)": 439,
506
+ "c2cc3": 440,
507
+ "CC(C)C)": 441,
508
+ "CC4": 442,
509
+ "C)cc1": 443,
510
+ "c2nc(": 444,
511
+ "s2)": 445,
512
+ "C(F)(F)F": 446,
513
+ "C=C": 447,
514
+ "C(=O)NC(": 448,
515
+ "c(C2": 449,
516
+ "c2)CC1": 450,
517
+ "c1ncc": 451,
518
+ "(C)C1": 452,
519
+ "(CO)": 453,
520
+ "CC(=O)N1": 454,
521
+ "(C)c1": 455,
522
+ "CCC(O)": 456,
523
+ "c4cc": 457,
524
+ "C(=O)N2": 458,
525
+ "sc1": 459,
526
+ "([NH3+])": 460,
527
+ "COC1": 461,
528
+ "[O-])C1": 462,
529
+ "OC)cc1": 463,
530
+ "c1ccc(O": 464,
531
+ "C(=O)N(": 465,
532
+ "COc1ccc": 466,
533
+ "(=O)N2": 467,
534
+ "Cc1cccc": 468,
535
+ "(C)C)cc1": 469,
536
+ "n1)": 470,
537
+ "3)cc1": 471,
538
+ "=C(N": 472,
539
+ "l)": 473,
540
+ "CCC=": 474,
541
+ "(F)c1": 475,
542
+ "c(C)cc": 476,
543
+ "c2ncc": 477,
544
+ "(Cl)cc2": 478,
545
+ "(C#N)": 479,
546
+ "OC3": 480,
547
+ "n2)cc1": 481,
548
+ "ccc21": 482,
549
+ "c1s": 483,
550
+ "(C)C(C)": 484,
551
+ "(C(=O)NC": 485,
552
+ "CN(C)": 486,
553
+ "[NH2+]C": 487,
554
+ "OC)c1": 488,
555
+ "C(C#N)": 489,
556
+ "c1nc(": 490,
557
+ "CC4)": 491,
558
+ "COc1cc": 492,
559
+ "(N": 493,
560
+ "CCCCC2": 494,
561
+ "C1=": 495,
562
+ "F)cc2)": 496,
563
+ "C1)": 497,
564
+ "s1)": 498,
565
+ "nc(C)": 499,
566
+ "ccccc3": 500,
567
+ "=O)c1": 501,
568
+ "COC": 502,
569
+ "o2)": 503,
570
+ "COc1cc(C": 504,
571
+ "c2cc(Cl)": 505,
572
+ "CCOCCO": 506,
573
+ "CCCCO": 507,
574
+ "c3cccc(": 508,
575
+ "CCN(CC)": 509,
576
+ "c2ccc(OC": 510,
577
+ "c(C(C)": 511,
578
+ "N(C": 512,
579
+ "N(C)": 513,
580
+ "F)ccc1": 514,
581
+ "C(CO)": 515,
582
+ "N(C(=O)": 516,
583
+ "[NH2+]C1": 517,
584
+ "c7": 518,
585
+ "OC(C)=O)": 519,
586
+ "c3cn": 520,
587
+ "n4": 521,
588
+ "CCN(C": 522,
589
+ "CN(C": 523,
590
+ "(CCO)": 524,
591
+ "SC": 525,
592
+ "c5ccccc5": 526,
593
+ "=C1": 527,
594
+ "c1cs": 528,
595
+ "c1ccc(F)": 529,
596
+ "oc(": 530,
597
+ "(C)C2": 531,
598
+ "C(C(=O)": 532,
599
+ "c(N2": 533,
600
+ "CCOCC2)": 534,
601
+ "CCc1ccc(": 535,
602
+ "(CCCC": 536,
603
+ "c2cc(C": 537,
604
+ "c2ccc(Br": 538,
605
+ "c3ccc(C": 539,
606
+ "[nH]c(": 540,
607
+ "3)CC2)": 541,
608
+ "NC(=O)C": 542,
609
+ "OCCC": 543,
610
+ "(c2ccccc": 544,
611
+ "no1": 545,
612
+ "(=O)N1": 546,
613
+ "c(OC)c1": 547,
614
+ "c2cc(C)": 548,
615
+ "ccc4": 549,
616
+ "C(O)C(O)": 550,
617
+ "CCOC1": 551,
618
+ "OCC1": 552,
619
+ "c2ccc(C)": 553,
620
+ "c1cc2c(": 554,
621
+ "c2cccc3": 555,
622
+ "O)cc": 556,
623
+ "o1)": 557,
624
+ "=O)cc": 558,
625
+ "c[nH+]": 559,
626
+ "CCOC": 560,
627
+ "O=S(=O)": 561,
628
+ "CCCC(C)": 562,
629
+ "N=C": 563,
630
+ "CCCn1": 564,
631
+ "3CCO": 565,
632
+ "c4cccc": 566,
633
+ "c2C)": 567,
634
+ "c2ncn": 568,
635
+ "C(=": 569,
636
+ "c(Br)": 570,
637
+ "CCC4": 571,
638
+ "c2cs": 572,
639
+ "c2cccc(C": 573,
640
+ "c(O": 574,
641
+ "[n+]": 575,
642
+ "CCCCC2)": 576,
643
+ "(C)C)c1": 577,
644
+ "C1CCCCC1": 578,
645
+ "F)cc3)": 579,
646
+ ")C(=O)N": 580,
647
+ "Cc1cc(C": 581,
648
+ "(Cl)ccc1": 582,
649
+ "CCCC2)": 583,
650
+ "c1nnc(": 584,
651
+ "c12)": 585,
652
+ "nc2)": 586,
653
+ "C(=C": 587,
654
+ "c1c(C)": 588,
655
+ "(Cl)cc2)": 589,
656
+ "ccccc6": 590,
657
+ "C12": 591,
658
+ "%1": 592,
659
+ "C(NC(=O)": 593,
660
+ "OC(CO)": 594,
661
+ "(CC2": 595,
662
+ "c2cc(F)": 596,
663
+ "c1ccc(N2": 597,
664
+ "o2)cc1": 598,
665
+ "c1cccs1": 599,
666
+ "[O-])cc": 600,
667
+ "C2=O)": 601,
668
+ "c1cccnc1": 602,
669
+ "=C(N)": 603,
670
+ "C=C1": 604,
671
+ "c1cc(N": 605,
672
+ "3CCCCC": 606,
673
+ "CCC)": 607,
674
+ "C(=S)N": 608,
675
+ "c(C#N)": 609,
676
+ "c21": 610,
677
+ "[N-]": 611,
678
+ "CCO)": 612,
679
+ "n2cc": 613,
680
+ "c(S(=O)": 614,
681
+ "CCCN(": 615,
682
+ "C(C(=O)N": 616,
683
+ "c1ncn": 617,
684
+ "n1C": 618,
685
+ "c2ccc(F)": 619,
686
+ "C[NH+]2": 620,
687
+ "NC(=O)CS": 621,
688
+ "c2nnc(": 622,
689
+ "(O": 623,
690
+ "n2cn": 624,
691
+ "(C(C)C)": 625,
692
+ "c3ccccc2": 626,
693
+ "n2)c1": 627,
694
+ "[NH2+]C2": 628,
695
+ "Cc1cc2": 629,
696
+ "N)=O)": 630,
697
+ "s3)": 631,
698
+ "OC(=O)N": 632,
699
+ "C1CCC": 633,
700
+ "F)cc3": 634,
701
+ "CCCCC1)": 635,
702
+ "Oc1ccc(": 636,
703
+ "(C)C)cc": 637,
704
+ "NC(C)": 638,
705
+ "CN1C(=O)": 639,
706
+ "=[NH2+]": 640,
707
+ "C1O": 641,
708
+ "c(OC": 642,
709
+ "c6ccccc6": 643,
710
+ "S1": 644,
711
+ "CC1(": 645,
712
+ "SCC(=O)N": 646,
713
+ "c1[nH]": 647,
714
+ "c2)C1": 648,
715
+ "c2c(C)": 649,
716
+ "=CC(=O)": 650,
717
+ "c3ccc4c(": 651,
718
+ "OC(F)(F)": 652,
719
+ "(N)(=O)": 653,
720
+ "CCC(CC)": 654,
721
+ "c1c[nH]": 655,
722
+ "co": 656,
723
+ "CC(C(=O)": 657,
724
+ "CO)": 658,
725
+ "no": 659,
726
+ "CCN(CC": 660,
727
+ "s2)cc1": 661,
728
+ "OCCCC": 662,
729
+ "C(=O)OC": 663,
730
+ "c2n1": 664,
731
+ "C2)cc1": 665,
732
+ "F)c1": 666,
733
+ "nc12": 667,
734
+ "Br)cc": 668,
735
+ "NC1": 669,
736
+ "CCNC(N": 670,
737
+ "3)CC1": 671,
738
+ "c2)n1": 672,
739
+ "c2[nH]": 673,
740
+ "C=C(": 674,
741
+ "3CCOCC3)": 675,
742
+ "=C(O)": 676,
743
+ "ncc2": 677,
744
+ "C#N)": 678,
745
+ "c1ccncc1": 679,
746
+ "c(Br)c1": 680,
747
+ "CCCCCC1": 681,
748
+ ")cc21": 682,
749
+ "-2": 683,
750
+ "C2)c1": 684,
751
+ "Cc1cs": 685,
752
+ "N3": 686,
753
+ "O=[N+]": 687,
754
+ "Br)ccc1": 688,
755
+ "c2=O)": 689,
756
+ "Cc1ccc2": 690,
757
+ "Cc2ccc": 691,
758
+ "NC(=O)CO": 692,
759
+ "C1CCCC1": 693,
760
+ "3)c1": 694,
761
+ "c(F)c(F)": 695,
762
+ "C[NH+](C": 696,
763
+ "C)c1": 697,
764
+ "c1cc(C)": 698,
765
+ "C#N": 699,
766
+ "NC(=S)N": 700,
767
+ "F)cc1)": 701,
768
+ "nnc1": 702,
769
+ "CC(C)O": 703,
770
+ "c5ccc": 704,
771
+ "O)c1": 705,
772
+ ")cc2)c1": 706,
773
+ "S(N)(=O)": 707,
774
+ "CC2)CC1": 708,
775
+ "C5": 709,
776
+ "CC#": 710,
777
+ "4)CC3)": 711,
778
+ "O[Si](C)": 712,
779
+ "CCCC(C": 713,
780
+ "CCN2": 714,
781
+ "CC12": 715,
782
+ "c1c(F)cc": 716,
783
+ "nn2": 717,
784
+ "COc1ccc2": 718,
785
+ "CC(N": 719,
786
+ "c2nc(C": 720,
787
+ "O=C(NC1": 721,
788
+ "C=C(C)": 722,
789
+ "cc(N": 723,
790
+ "n(CC": 724,
791
+ "3CC3)": 725,
792
+ "n2)CC1": 726,
793
+ "oc(C": 727,
794
+ "nc3)": 728,
795
+ "c4c(": 729,
796
+ "CC1CCC": 730,
797
+ "(Cl)cc3": 731,
798
+ "Cl)cc1": 732,
799
+ "c2cc(Br)": 733,
800
+ "OC(F)": 734,
801
+ "c2ccncc": 735,
802
+ "n[nH]": 736,
803
+ "OCC2": 737,
804
+ "(Cl)cc3)": 738,
805
+ "Cc1nc(": 739,
806
+ "CN2": 740,
807
+ "nc(N)": 741,
808
+ "C2=O)cc1": 742,
809
+ "nc2c1": 743,
810
+ "CCCC(=O)": 744,
811
+ "(F)F)": 745,
812
+ "Cc2ccccc": 746,
813
+ "(C)CC1": 747,
814
+ "c3s": 748,
815
+ "NC(=O)N2": 749,
816
+ "CCCC1)": 750,
817
+ "OP(=O)": 751,
818
+ "Nc1ccc(": 752,
819
+ "C(N)=O": 753,
820
+ ")cc2)CC1": 754,
821
+ "6)": 755,
822
+ "CC2)n1": 756,
823
+ "ncn1": 757,
824
+ "CCCS": 758,
825
+ "OCC(=O)": 759,
826
+ "2)C1=O": 760,
827
+ "CC2)ccc1": 761,
828
+ ")cc4": 762,
829
+ ")cccc1": 763,
830
+ "C(=O)N(C": 764,
831
+ "sc2c1": 765,
832
+ "C(=O)C1": 766,
833
+ "o3)": 767,
834
+ "c1cc(Cl)": 768,
835
+ "OC)c(OC)": 769,
836
+ "Cc1ccc(N": 770,
837
+ "c3ccc(OC": 771,
838
+ "c2cc1": 772,
839
+ "CC1=": 773,
840
+ "C(CC)": 774,
841
+ "CC=C": 775,
842
+ "c2c(F)cc": 776,
843
+ "C(OC": 777,
844
+ "c1)OCO": 778,
845
+ "c3ncc": 779,
846
+ "N(CC": 780,
847
+ "c1nc(N": 781,
848
+ "c(=O)n2": 782,
849
+ "c3ccc(N": 783,
850
+ "c8": 784,
851
+ "c3C)": 785,
852
+ "CC1(C": 786,
853
+ "CC1C": 787,
854
+ "c1)C(=O)": 788,
855
+ "cc4": 789,
856
+ "CN(CC": 790,
857
+ "ccc2c1": 791,
858
+ "ccccc4": 792,
859
+ "c4ccc5": 793,
860
+ "C(=O)NC1": 794,
861
+ "(C)C3": 795,
862
+ "CC(=": 796,
863
+ "CC(F)(F)": 797,
864
+ "cc(Br)": 798,
865
+ "(C(=O)C2": 799,
866
+ "CC[NH+]2": 800,
867
+ "CC(O": 801,
868
+ "C(C)O": 802,
869
+ "3)C1": 803,
870
+ "CCC(CC": 804,
871
+ "[O-])CC1": 805,
872
+ "CCC=CCC=": 806,
873
+ "[NH3+]C1": 807,
874
+ "c(=O)n1": 808,
875
+ "cn2": 809,
876
+ "[NH+]3": 810,
877
+ "[NH2+]1": 811,
878
+ "CCC2)": 812,
879
+ "(OC)": 813,
880
+ "ccnc1": 814,
881
+ "c5ccc(": 815,
882
+ "CC2)C1": 816,
883
+ "NC(=O)N1": 817,
884
+ "c1N": 818,
885
+ "(C(C)=O)": 819,
886
+ "CCC(N": 820,
887
+ "c3c2": 821,
888
+ "CCl)": 822,
889
+ "C(Cl)": 823,
890
+ "c2cccs2)": 824,
891
+ "Cc1c(": 825,
892
+ "CC(C)C1": 826,
893
+ "3CCCC3": 827,
894
+ "C(N)=O)": 828,
895
+ "[O-])cc2": 829,
896
+ "nc4": 830,
897
+ "(c3ccccc": 831,
898
+ "[NH2+]C)": 832,
899
+ "(C)C)C1": 833,
900
+ "n1cn": 834,
901
+ "O2": 835,
902
+ "c3nn": 836,
903
+ "OC(C)(C)": 837,
904
+ "c1nc(C": 838,
905
+ "c3cccc4": 839,
906
+ "c2cccnc2": 840,
907
+ ")c1ccc(": 841,
908
+ "CC1(C)C": 842,
909
+ "c3cc4": 843,
910
+ "nc1)": 844,
911
+ "COc1cccc": 845,
912
+ "NC2": 846,
913
+ "Cc1s": 847,
914
+ "CCCC3": 848,
915
+ "CC(=O)N2": 849,
916
+ "=NNC(=O)": 850,
917
+ "=C(C)": 851,
918
+ "[NH+](C)": 852,
919
+ "CC(CC(C": 853,
920
+ "=C(C": 854,
921
+ "#C": 855,
922
+ "F)CC1": 856,
923
+ "=C(C(=O)": 857,
924
+ "C(=O)C(": 858,
925
+ "sc2": 859,
926
+ "c1ccco1": 860,
927
+ "[nH]c1": 861,
928
+ "(=O)C1": 862,
929
+ "[NH2+]C(": 863,
930
+ "c2cc3c(": 864,
931
+ "=C([O-])": 865,
932
+ "c3cc(Cl)": 866,
933
+ "c1ccccn1": 867,
934
+ "oc1": 868,
935
+ "(NC(=O)": 869,
936
+ "CC(C)(O)": 870,
937
+ "c1ncccc1": 871,
938
+ "OCC(C": 872,
939
+ "CCO1": 873,
940
+ "3C)": 874,
941
+ "Cl)c1": 875,
942
+ "CCC(C)C": 876,
943
+ "Sc1n": 877,
944
+ "ccccc7": 878,
945
+ "ccccc12": 879,
946
+ "#N)cc1": 880,
947
+ "Oc2ccc(": 881,
948
+ "CCOC2": 882,
949
+ "c3nc(": 883,
950
+ "C(=O)C2": 884,
951
+ "Cc1no": 885,
952
+ "c5)": 886,
953
+ "n2ccc": 887,
954
+ "Cc1[nH]": 888,
955
+ "3)n2)cc1": 889,
956
+ "(Cl)c2": 890,
957
+ "C1C": 891,
958
+ "CC(CO)": 892,
959
+ "OCC(O)": 893,
960
+ "=C(C#N)": 894,
961
+ "CCCO1": 895,
962
+ ")cc4)": 896,
963
+ "n12": 897,
964
+ "[nH]c2c1": 898,
965
+ "cccc1": 899,
966
+ "CCCN1": 900,
967
+ "c1c(C": 901,
968
+ "Cc1nn(C)": 902,
969
+ "c3cc(C)": 903,
970
+ "NC": 904,
971
+ "2)nc1": 905,
972
+ "C3=O)": 906,
973
+ "OC(C)C)": 907,
974
+ "CCOCC1)": 908,
975
+ "OC(C)C": 909,
976
+ "(Cl)c2)": 910,
977
+ "4CCCC": 911,
978
+ "c2c[nH]": 912,
979
+ "C=CC(=O)": 913,
980
+ "OC(=O)N1": 914,
981
+ "C1CCN": 915,
982
+ "S)": 916,
983
+ "c2nc(N": 917,
984
+ "[NH2+]CC": 918,
985
+ "C=N": 919,
986
+ "c%1": 920,
987
+ "CCN(C)": 921,
988
+ "=[NH2+])": 922,
989
+ "c(C)n1": 923,
990
+ "C(=[NH+]": 924,
991
+ "n[nH]1": 925,
992
+ "(Cl)cc1)": 926,
993
+ "=CC=": 927,
994
+ "c2)OCO": 928,
995
+ "cc21": 929,
996
+ "c1cc(Br)": 930,
997
+ "F)cc1F": 931,
998
+ "c1cc(F)": 932,
999
+ "CCCC)": 933,
1000
+ "(F)c2)": 934,
1001
+ "CCC(O": 935,
1002
+ "C(=O)O": 936,
1003
+ ")N(C": 937,
1004
+ "(C(=O)C": 938,
1005
+ "=C2S": 939,
1006
+ "=C)": 940,
1007
+ "NC(=O)CC": 941,
1008
+ "NC(=O)C3": 942,
1009
+ "c(Cl)c2)": 943,
1010
+ "s2)c1": 944,
1011
+ "C=CC=": 945,
1012
+ "CCNS(=O)": 946,
1013
+ "CC(N)=O)": 947,
1014
+ "c(Cl)c3)": 948,
1015
+ "OC)c1OC": 949,
1016
+ "(C)(C)": 950,
1017
+ "OCCO2": 951,
1018
+ "FC(F)(F)": 952,
1019
+ "c4cc(": 953,
1020
+ "3CCCC3)": 954,
1021
+ "Cl)CC1": 955,
1022
+ "ccccc2c1": 956,
1023
+ "ccc(C": 957,
1024
+ "cn2)": 958,
1025
+ "COCCO": 959,
1026
+ "CC1(O)": 960,
1027
+ "nc(N2": 961,
1028
+ "[nH]1)": 962,
1029
+ "CC(C)C(": 963,
1030
+ "o2)c1": 964,
1031
+ "F)C1": 965,
1032
+ "c7ccccc7": 966,
1033
+ "cc(C)": 967,
1034
+ "F)cc(": 968,
1035
+ "c3c(C)": 969,
1036
+ "[nH]2)": 970,
1037
+ "(Cl)c3)": 971,
1038
+ "c2ccco2)": 972,
1039
+ "SC1": 973,
1040
+ "n2c(": 974,
1041
+ "C2=O": 975,
1042
+ "nn3": 976,
1043
+ "on1": 977,
1044
+ "[nH]c2": 978,
1045
+ "CC5": 979,
1046
+ "4)c3)": 980,
1047
+ "C)CC1": 981,
1048
+ "C2CCCC": 982,
1049
+ "c3ccc(Br": 983,
1050
+ "=C2C(=O)": 984,
1051
+ "3CCCCC3)": 985,
1052
+ "c2ccc(O": 986,
1053
+ "2)ccc1": 987,
1054
+ "(C(=O)N3": 988,
1055
+ "c3ccncc": 989,
1056
+ "CC1CCCC": 990,
1057
+ "n2c(=O)": 991,
1058
+ "N)c1": 992,
1059
+ ")NC1": 993,
1060
+ "CNS(=O)": 994,
1061
+ "C2CC2)": 995,
1062
+ "Cn1cn": 996,
1063
+ "CC(C)(C": 997,
1064
+ "Cc1cccc2": 998,
1065
+ "(C)C)CC1": 999,
1066
+ "cccc(": 1000,
1067
+ "cnc1": 1001,
1068
+ "[C": 1002,
1069
+ "4CCO": 1003,
1070
+ "nc2cccc": 1004,
1071
+ "[nH]c1=O": 1005,
1072
+ "CCc1cc": 1006,
1073
+ "C=C(C": 1007,
1074
+ "c1)C1": 1008,
1075
+ "F)cc(C": 1009,
1076
+ "CCc1": 1010,
1077
+ "c(C)c(C)": 1011,
1078
+ "C2)n1": 1012,
1079
+ "CCCS(=O)": 1013,
1080
+ "OC)c(": 1014,
1081
+ "N=C(": 1015,
1082
+ "(C)(C)C)": 1016,
1083
+ "(C)O": 1017,
1084
+ "=CC(=O)N": 1018,
1085
+ "CCC12": 1019,
1086
+ "c1cccc(N": 1020,
1087
+ "Cc1nc(C": 1021,
1088
+ "=C3": 1022,
1089
+ "4)c3": 1023,
1090
+ "c2cc(N": 1024,
1091
+ "C3CC3)": 1025,
1092
+ "NC(=S)": 1026,
1093
+ "C=CC1": 1027,
1094
+ "CCCO)": 1028,
1095
+ "CCCCCC": 1029,
1096
+ "Cc1cc(N": 1030,
1097
+ "c2ccs": 1031,
1098
+ "CC(C)CC(": 1032,
1099
+ "SC)": 1033,
1100
+ "C(C)=O)": 1034,
1101
+ "=[N+]": 1035,
1102
+ "[NH+](C": 1036,
1103
+ "COCC1": 1037,
1104
+ "3)c2": 1038,
1105
+ "c2c(C)cc": 1039,
1106
+ "s2)CC1": 1040,
1107
+ "CCl": 1041,
1108
+ "CCC2(": 1042,
1109
+ ")(": 1043,
1110
+ "c2nn": 1044,
1111
+ "Cc2ccc(": 1045,
1112
+ "=C(N)N)": 1046,
1113
+ "2)C(=O)": 1047,
1114
+ "c1cc(C2": 1048,
1115
+ "(C(=O)OC": 1049,
1116
+ "cc1Cl": 1050,
1117
+ "COc1": 1051,
1118
+ "C4)": 1052,
1119
+ "CCC3)": 1053,
1120
+ ")ccn1": 1054,
1121
+ "c3cccc(C": 1055,
1122
+ "CC(=O)N(": 1056,
1123
+ "c1ccc(OC": 1057,
1124
+ "CCCc1n": 1058,
1125
+ "c3cccs3)": 1059,
1126
+ "CC(N)": 1060,
1127
+ "ccn1": 1061,
1128
+ "Br)cc(": 1062,
1129
+ "[O-])c(": 1063,
1130
+ "c2o": 1064,
1131
+ "C(OC(=O)": 1065,
1132
+ ")NC(=O)": 1066,
1133
+ "2)cc1OC": 1067,
1134
+ "C=CCO": 1068,
1135
+ ")ccc1OC": 1069,
1136
+ "c2ncccc2": 1070,
1137
+ "2)s1": 1071,
1138
+ "O=S(=O)(": 1072,
1139
+ "c2ccc(N3": 1073,
1140
+ "4)cc3": 1074,
1141
+ "[nH]c3": 1075,
1142
+ "(C)C)n1": 1076,
1143
+ "CSc1n": 1077,
1144
+ "C2CCC": 1078,
1145
+ "C=CC": 1079,
1146
+ "c3ncn": 1080,
1147
+ "C2)CC1": 1081,
1148
+ "C2O)": 1082,
1149
+ "c3ccc(C)": 1083,
1150
+ "c(=O)o": 1084,
1151
+ "O=C(C1": 1085,
1152
+ "[nH+]1": 1086,
1153
+ ")cc2c1": 1087,
1154
+ "C(OC)": 1088,
1155
+ "c(Cl)cc1": 1089,
1156
+ "[n-]": 1090,
1157
+ "C2)C1": 1091,
1158
+ "NN": 1092,
1159
+ "(C(=O)O": 1093,
1160
+ "c1cccs1)": 1094,
1161
+ "[P": 1095,
1162
+ "c1ccco1)": 1096,
1163
+ "OCCCO": 1097,
1164
+ "(F)c3)": 1098,
1165
+ "Cc1o": 1099,
1166
+ "CC(C)n1": 1100,
1167
+ "C=CCn1": 1101,
1168
+ "cc1C": 1102,
1169
+ "c4n": 1103,
1170
+ "CC1CC1": 1104,
1171
+ "c2nnc(C": 1105,
1172
+ "c(N)c1": 1106,
1173
+ "CCCCCCC": 1107,
1174
+ ")N1CCN(": 1108,
1175
+ "c(N3": 1109,
1176
+ "c(O)c1": 1110,
1177
+ "c3cc(F)": 1111,
1178
+ "(C)CC": 1112,
1179
+ "C)C1": 1113,
1180
+ "2)cc1)": 1114,
1181
+ ")cc(C": 1115,
1182
+ "C3CCCCC": 1116,
1183
+ "c3ccc(F)": 1117,
1184
+ "c(=O)c1": 1118,
1185
+ "c1)OCO2": 1119,
1186
+ "c4cn": 1120,
1187
+ "c2ccc(O)": 1121,
1188
+ "n2)C1": 1122,
1189
+ "cc(Cl)": 1123,
1190
+ "c(F)c2)": 1124,
1191
+ "C1(": 1125,
1192
+ "OS(=O)": 1126,
1193
+ "NC(C)=O)": 1127,
1194
+ "C(CC(=O)": 1128,
1195
+ "Cn1cc(": 1129,
1196
+ "(C)cc2)": 1130,
1197
+ "S1(=O)": 1131,
1198
+ "c2ccc(CC": 1132,
1199
+ "(CC)CC": 1133,
1200
+ "2)c1C": 1134,
1201
+ "c1ncc(": 1135,
1202
+ "(C)cc3)": 1136,
1203
+ "CC(=O)O": 1137,
1204
+ "(F)c2": 1138,
1205
+ "4)ccc3": 1139,
1206
+ "ccc5": 1140,
1207
+ "c1ccc(CN": 1141,
1208
+ "c(=O)c2": 1142,
1209
+ "c1cncc(": 1143,
1210
+ "c2nc(C)": 1144,
1211
+ "[P+]": 1145,
1212
+ "c3[nH]": 1146,
1213
+ "[n+]1": 1147,
1214
+ "CCOc1ccc": 1148,
1215
+ "C[NH3+]": 1149,
1216
+ "4CCCCC": 1150,
1217
+ "4)cc3)": 1151,
1218
+ "C(=O)NC": 1152,
1219
+ "ncc(": 1153,
1220
+ "3)C1)": 1154,
1221
+ "CC(F)": 1155,
1222
+ "C(C)C1": 1156,
1223
+ "Nc1cc": 1157,
1224
+ "[O-])c2)": 1158,
1225
+ "C1=O)": 1159,
1226
+ "(C)C(": 1160,
1227
+ "O2)": 1161,
1228
+ "=NN": 1162,
1229
+ "CCCC3)": 1163,
1230
+ "c2c(C": 1164,
1231
+ "c2nccc": 1165,
1232
+ "nc(C2": 1166,
1233
+ "C1(C": 1167,
1234
+ "c(NC": 1168,
1235
+ "nc2n1": 1169,
1236
+ "c2sccc2": 1170,
1237
+ "n5": 1171,
1238
+ "=[N-]": 1172,
1239
+ "[S-]": 1173,
1240
+ "CCOC(C": 1174,
1241
+ "COP(=O)": 1175,
1242
+ "c1ccc(n2": 1176,
1243
+ "(Br)": 1177,
1244
+ "c3cc(C": 1178,
1245
+ "c1o": 1179,
1246
+ "=[NH+]O)": 1180,
1247
+ "CC1CCN": 1181,
1248
+ "CC2CCC1": 1182,
1249
+ ")ccc1Cl": 1183,
1250
+ "C(=O)NC2": 1184,
1251
+ "I)c1": 1185,
1252
+ "c3ccco3)": 1186,
1253
+ "nnn1": 1187,
1254
+ "CCC(CO)": 1188,
1255
+ "n2c1": 1189,
1256
+ "nc21": 1190,
1257
+ "n3cn": 1191,
1258
+ "cnn1": 1192,
1259
+ "n(C2": 1193,
1260
+ "c4ccc(C": 1194,
1261
+ ")S(=O)": 1195,
1262
+ "cc12": 1196,
1263
+ "Nc1cc(": 1197,
1264
+ "I)cc1": 1198,
1265
+ "nc1C": 1199,
1266
+ "NC(N)": 1200,
1267
+ "cnc2": 1201,
1268
+ "(CCC(=O)": 1202,
1269
+ "n4)": 1203,
1270
+ "c2cc(OC)": 1204,
1271
+ "2)cn1": 1205,
1272
+ "CCc1cccc": 1206,
1273
+ "Cn1c(=O)": 1207,
1274
+ ")cc2)n1": 1208,
1275
+ "C1C2": 1209,
1276
+ "(CC)CC)": 1210,
1277
+ "(CC(C)C)": 1211,
1278
+ "C1CCN(": 1212,
1279
+ "C(=N": 1213,
1280
+ "P(": 1214,
1281
+ "O=C(CO": 1215,
1282
+ "(F)c1)": 1216,
1283
+ "c2ncc(": 1217,
1284
+ "C=C(C)C": 1218,
1285
+ "NC(=O)CN": 1219,
1286
+ "CCc1cn": 1220,
1287
+ "n2)n1": 1221,
1288
+ "c([O-])": 1222,
1289
+ "CC1O": 1223,
1290
+ "(C)cc(C)": 1224,
1291
+ "C(=O)OCC": 1225,
1292
+ "(C)ccc1": 1226,
1293
+ "Cc1c(C": 1227,
1294
+ "SC2": 1228,
1295
+ ")ccc1F": 1229,
1296
+ "cn2)cc1": 1230,
1297
+ "c1nc2c(": 1231,
1298
+ "COc1cc2": 1232,
1299
+ "CC21": 1233,
1300
+ "CCc1ccc": 1234,
1301
+ "C(F)F": 1235,
1302
+ "CC1CN(": 1236,
1303
+ "c2nnn": 1237,
1304
+ "(Cl)c1)": 1238,
1305
+ "c4cccc(": 1239,
1306
+ "CCOC(": 1240,
1307
+ "c(F)c1F": 1241,
1308
+ "Cl)C1": 1242,
1309
+ "c2C)cc1": 1243,
1310
+ "(Cl)c(": 1244,
1311
+ "CCCC(O)": 1245,
1312
+ "OC4": 1246,
1313
+ "c1csc(": 1247,
1314
+ "CC5)": 1248,
1315
+ "4CCOCC4)": 1249,
1316
+ "CC(Cl)": 1250,
1317
+ "C(O)C1": 1251,
1318
+ "3)cc2": 1252,
1319
+ "CCCCn1": 1253,
1320
+ "C(CS": 1254,
1321
+ "OC)C1": 1255,
1322
+ "OC(C)=O": 1256,
1323
+ "CC=C(": 1257,
1324
+ "NC(=O)OC": 1258,
1325
+ "CC[NH+](": 1259,
1326
+ "c4ccccc3": 1260,
1327
+ "3CC4": 1261,
1328
+ "C=O": 1262,
1329
+ "C1=C(C)": 1263,
1330
+ "F)cc2)c1": 1264,
1331
+ "CC[NH3+]": 1265,
1332
+ "CCc1cc(": 1266,
1333
+ "c9": 1267,
1334
+ "O)cc2)": 1268,
1335
+ ")cc2)C1": 1269,
1336
+ "c1ccsc1": 1270,
1337
+ "nccc1": 1271,
1338
+ "O[Si]": 1272,
1339
+ "c1nc(C)": 1273,
1340
+ "=[NH+]C": 1274,
1341
+ "c2ccc(C3": 1275,
1342
+ "CCCC2)c1": 1276,
1343
+ "c1)N": 1277,
1344
+ "CCCCN": 1278,
1345
+ "(=O)C": 1279,
1346
+ "nc(Cl)": 1280,
1347
+ "-3": 1281,
1348
+ "c1c(C)cc": 1282,
1349
+ "Cc1cc2c(": 1283,
1350
+ "C(OC2": 1284,
1351
+ "c4ccc5c(": 1285,
1352
+ "2)c(C)c1": 1286,
1353
+ "[O-])cc(": 1287,
1354
+ "C(CCCC": 1288,
1355
+ "CC1CC1)": 1289,
1356
+ "cc(C(=O)": 1290,
1357
+ "cc2c1": 1291,
1358
+ "CC1CO": 1292,
1359
+ "CC2)c1C": 1293,
1360
+ "N)cc1": 1294,
1361
+ ")ccc1O": 1295,
1362
+ "C)n1": 1296,
1363
+ "O=C(CS": 1297,
1364
+ "(C)O)": 1298,
1365
+ "(Cl)c(C": 1299,
1366
+ "c1c(N": 1300,
1367
+ "3)c2)": 1301,
1368
+ "C1CCCC": 1302,
1369
+ "(OCC)": 1303,
1370
+ "CC1CCN(": 1304,
1371
+ "c(=S)": 1305,
1372
+ "c(C3": 1306,
1373
+ "c2noc(C": 1307,
1374
+ "F)cc4)": 1308,
1375
+ "C(C(C)C)": 1309,
1376
+ "(CCC)": 1310,
1377
+ "OC)cc(": 1311,
1378
+ "O)cc1)": 1312,
1379
+ "cn1)": 1313,
1380
+ "c2n[nH]": 1314,
1381
+ "(C)CCC": 1315,
1382
+ "1)N": 1316,
1383
+ "c(Cl)c1)": 1317,
1384
+ "[O-])n1": 1318,
1385
+ "C(C)=O": 1319,
1386
+ "c(=O)n(": 1320,
1387
+ "O=C(Cn1": 1321,
1388
+ "OC)cc": 1322,
1389
+ "2)o1": 1323,
1390
+ "cc2Cl)": 1324,
1391
+ "c3cccnc3": 1325,
1392
+ "C(CC": 1326,
1393
+ "c(F)c3)": 1327,
1394
+ "CCc2c(": 1328,
1395
+ "CC(C)=": 1329,
1396
+ "c2nc3c(": 1330,
1397
+ "n3cc": 1331,
1398
+ "cn3": 1332,
1399
+ "Oc2ccc": 1333,
1400
+ "o2)CC1": 1334,
1401
+ "2)c(": 1335,
1402
+ "c(SC": 1336,
1403
+ "3)CC2": 1337,
1404
+ "C6": 1338,
1405
+ "=O)C1": 1339,
1406
+ "ccc3C)": 1340,
1407
+ "Cc1nn(": 1341,
1408
+ "nc(S": 1342,
1409
+ "O=c1": 1343,
1410
+ "=O)N": 1344,
1411
+ "sc3": 1345,
1412
+ "CCCOC1": 1346,
1413
+ "nn1C": 1347,
1414
+ "CCC(C)C(": 1348,
1415
+ "n(C)c1": 1349,
1416
+ "O=C1N": 1350,
1417
+ "c1ccc(S": 1351,
1418
+ "ncnc1": 1352,
1419
+ "2)N1": 1353,
1420
+ "F)n1": 1354,
1421
+ "CC=CC=": 1355,
1422
+ "c[nH]1)": 1356,
1423
+ "CCC3(CC": 1357,
1424
+ "(N)=S)": 1358,
1425
+ "[NH3+])N": 1359,
1426
+ "e]": 1360,
1427
+ "(=O)[nH]": 1361,
1428
+ "c1nn": 1362,
1429
+ "ncc3": 1363,
1430
+ "(Cc2ccc": 1364,
1431
+ "OCC3": 1365,
1432
+ "sc(": 1366,
1433
+ "CCC1)": 1367,
1434
+ "7)": 1368,
1435
+ "cccc3": 1369,
1436
+ "c5ccc6": 1370,
1437
+ "CCCN2": 1371,
1438
+ "3)ccc2": 1372,
1439
+ "c2=O)cc1": 1373,
1440
+ "C=CC(": 1374,
1441
+ "(C)C(C": 1375,
1442
+ "n2cc(": 1376,
1443
+ "2)C2": 1377,
1444
+ "n2nn": 1378,
1445
+ "C=CCN": 1379,
1446
+ ")N1CCC(": 1380,
1447
+ "cccn1": 1381,
1448
+ "CC1CC(": 1382,
1449
+ "3CCCCC3": 1383,
1450
+ "C[Si](C)": 1384,
1451
+ "Cn1cc(C": 1385,
1452
+ "n2n": 1386,
1453
+ "nnc2": 1387,
1454
+ "3)C2)cc1": 1388,
1455
+ "nc1N": 1389,
1456
+ "CC1CCC(": 1390,
1457
+ "(C(=O)N1": 1391,
1458
+ "B(O)": 1392,
1459
+ "COC(=O)N": 1393,
1460
+ "CC3)cc2": 1394,
1461
+ "N=N": 1395,
1462
+ "c5cccc": 1396,
1463
+ "c1cc(N2": 1397,
1464
+ "ccccc5": 1398,
1465
+ "(CC(O)": 1399,
1466
+ "4)C3": 1400,
1467
+ "cn2)CC1": 1401,
1468
+ "3CCC": 1402,
1469
+ "COC(": 1403,
1470
+ "c1c(N)": 1404,
1471
+ "H]": 1405,
1472
+ "(=O)o": 1406,
1473
+ "c2cncc": 1407,
1474
+ "c-2": 1408,
1475
+ "C[NH3+])": 1409,
1476
+ "ccccc8": 1410,
1477
+ "3)C2)": 1411,
1478
+ "CCC2C3": 1412,
1479
+ "(=O)[O-]": 1413,
1480
+ "F)cc1Cl": 1414,
1481
+ "CCCCCO": 1415,
1482
+ ")cc5": 1416,
1483
+ "CC1CCCC1": 1417,
1484
+ "COCCC": 1418,
1485
+ "3C(=O)": 1419,
1486
+ "Nc1ccc": 1420,
1487
+ "OCO4)": 1421,
1488
+ "CCCl)": 1422,
1489
+ "C(Cc1cn": 1423,
1490
+ "c3cccs": 1424,
1491
+ "c2sc(": 1425,
1492
+ "(CC(C)": 1426,
1493
+ "CCOCC2": 1427,
1494
+ "O=c1[nH]": 1428,
1495
+ "Oc2ccccc": 1429,
1496
+ "CCCN(C": 1430,
1497
+ "c1ccc(C(": 1431,
1498
+ "C(F)F)": 1432,
1499
+ "c1)OCCO2": 1433,
1500
+ "c(Cl)cc2": 1434,
1501
+ "sc1C": 1435,
1502
+ ")cc12": 1436,
1503
+ "c6ccc": 1437,
1504
+ "c2csc(": 1438,
1505
+ "c2cc(N)": 1439,
1506
+ "c4c3": 1440,
1507
+ "c3cs": 1441,
1508
+ "(CCl)": 1442,
1509
+ "c(C)cc1": 1443,
1510
+ "N#C": 1444,
1511
+ "C(O)C1O": 1445,
1512
+ "c2cnn(C)": 1446,
1513
+ "c2)OCCO": 1447,
1514
+ "c2)OCO3)": 1448,
1515
+ "C(C2": 1449,
1516
+ "F)cc4": 1450,
1517
+ "4CC4)": 1451,
1518
+ "c5cc": 1452,
1519
+ "c3c(c2": 1453,
1520
+ "Cn1cc": 1454,
1521
+ "CC2CCCO": 1455,
1522
+ "C(C)C)c1": 1456,
1523
+ "[C-]": 1457,
1524
+ "Cc1c(C)": 1458,
1525
+ "(=O)c1cc": 1459,
1526
+ "c1ccc(O)": 1460,
1527
+ "c2cco": 1461,
1528
+ "[O-])c2": 1462,
1529
+ ")cc1)": 1463,
1530
+ "CC1CCC(C": 1464,
1531
+ "c(C)c3)": 1465,
1532
+ "cc5": 1466,
1533
+ "(C)c2": 1467,
1534
+ "ccc2n1": 1468,
1535
+ "OC(F)F": 1469,
1536
+ "CCCC(C)C": 1470,
1537
+ "COC(C": 1471,
1538
+ "[O-])c3)": 1472,
1539
+ "[N+]1": 1473,
1540
+ "n2C": 1474,
1541
+ "Cc1cc(N2": 1475,
1542
+ "CNC(=O)N": 1476,
1543
+ "C(C)C2": 1477,
1544
+ "CCCO1)": 1478,
1545
+ "Cc1ccc(O": 1479,
1546
+ "CC(C)c1n": 1480,
1547
+ "n(C)n1": 1481,
1548
+ "COCCn1": 1482,
1549
+ "C(C)CC": 1483,
1550
+ "Br)cc2": 1484,
1551
+ "Cl)N": 1485,
1552
+ "c1sccc1": 1486,
1553
+ "(C)cc3": 1487,
1554
+ "CC(=N": 1488,
1555
+ "C2CCCCC": 1489,
1556
+ "c4cccc5": 1490,
1557
+ "C)C(=O)": 1491,
1558
+ "SC(=C": 1492,
1559
+ "(C3": 1493,
1560
+ "c(=O)n3": 1494,
1561
+ "CC(C#N)": 1495,
1562
+ "c(F)c2": 1496,
1563
+ "C1CCC(": 1497,
1564
+ "(C)CC2": 1498,
1565
+ "C=C2": 1499,
1566
+ "2)N": 1500,
1567
+ "cc1Cl)": 1501,
1568
+ "c1cnc(": 1502,
1569
+ "c1C#N": 1503,
1570
+ "N1CCCC1": 1504,
1571
+ "=O)CC1": 1505,
1572
+ "CC(C)(": 1506,
1573
+ "CCCCC=": 1507,
1574
+ "2)cc1C": 1508,
1575
+ "(c2ccc(": 1509,
1576
+ "(CCCC)": 1510,
1577
+ "C(C1": 1511,
1578
+ "4)C2)": 1512,
1579
+ "c1c(Cl)": 1513,
1580
+ "c(C(C)C)": 1514,
1581
+ "n2C)": 1515,
1582
+ "CCc2ccc(": 1516,
1583
+ "COCCN": 1517,
1584
+ "OCc1ccc(": 1518,
1585
+ "c(=O)n(C": 1519,
1586
+ "N(C)C": 1520,
1587
+ "3)C1)C2": 1521,
1588
+ "C(Br)": 1522,
1589
+ "c1nc(N)": 1523,
1590
+ "Br)c2)": 1524,
1591
+ "c(C)c(": 1525,
1592
+ "ccc2Cl)": 1526,
1593
+ "nc2s": 1527,
1594
+ "[nH]3)": 1528,
1595
+ "CSCCC(": 1529,
1596
+ "c8ccccc8": 1530,
1597
+ "CCC(C)C)": 1531,
1598
+ "CCCCC)": 1532,
1599
+ "(C)c(": 1533,
1600
+ "CO1": 1534,
1601
+ "=NC(=O)": 1535,
1602
+ "OC)C(=O)": 1536,
1603
+ "CCCCCC)": 1537,
1604
+ "O)C(O)": 1538,
1605
+ "c2c(c1)": 1539,
1606
+ "n2nc(": 1540,
1607
+ "c(F)cc2": 1541,
1608
+ "C2CC3": 1542,
1609
+ "3)n2)": 1543,
1610
+ "#[N+]": 1544,
1611
+ "c(C)c1C": 1545,
1612
+ "Cc1ccsc1": 1546,
1613
+ "O1)": 1547,
1614
+ "C2CCCCC2": 1548,
1615
+ "s2)C1": 1549,
1616
+ "F)cccc3": 1550,
1617
+ "c(=O)n": 1551,
1618
+ "C1CC": 1552,
1619
+ "[NH3+]C(": 1553,
1620
+ "C2=O)c1": 1554,
1621
+ "(Cl)s1": 1555,
1622
+ "[n+]2": 1556,
1623
+ "[O-])cc3": 1557,
1624
+ "CC(S": 1558,
1625
+ "Cc1ccco1": 1559,
1626
+ ")N1CCCC1": 1560,
1627
+ "cc2C)": 1561,
1628
+ "C(C)O)": 1562,
1629
+ "C1(C)": 1563,
1630
+ "(CNC(=O)": 1564,
1631
+ "C1CC1)": 1565,
1632
+ "O3)": 1566,
1633
+ "COc1c(": 1567,
1634
+ "Cc1nc(N": 1568,
1635
+ "(c2ccc": 1569,
1636
+ "N1CCN": 1570,
1637
+ "(CC[NH+]": 1571,
1638
+ "(C)(C)C": 1572,
1639
+ "c1nc(Cl)": 1573,
1640
+ "c2cc(O)": 1574,
1641
+ "cs2)cc1": 1575,
1642
+ "c2noc(": 1576,
1643
+ "c1cc(O)": 1577,
1644
+ "nc2)cc1": 1578,
1645
+ "ccccc23)": 1579,
1646
+ "2)CC1)": 1580,
1647
+ "N1CCN(": 1581,
1648
+ "(=O)NC": 1582,
1649
+ "O=C(C=C": 1583,
1650
+ "OC)c(C": 1584,
1651
+ "OC(F)F)": 1585,
1652
+ "nc4)": 1586,
1653
+ "c1cnn(": 1587,
1654
+ "(C)cc1": 1588,
1655
+ "S2": 1589,
1656
+ "c1nc(C2": 1590,
1657
+ "c(C)c2)": 1591,
1658
+ "C1=C(": 1592,
1659
+ "COC(C)": 1593,
1660
+ "c3ccc(O": 1594,
1661
+ "CCC(C2": 1595,
1662
+ ")ccc12": 1596,
1663
+ "ncn2": 1597,
1664
+ "c1cncc": 1598,
1665
+ "c3)OCO4)": 1599,
1666
+ "N2CCN": 1600,
1667
+ "CC1CC": 1601,
1668
+ "CC(C)N": 1602,
1669
+ "s2)n1": 1603,
1670
+ "(=O)N(C)": 1604,
1671
+ "Nc1ncn": 1605,
1672
+ "c2cn3": 1606,
1673
+ "(Cl)c3": 1607,
1674
+ "CCC1CCCC": 1608,
1675
+ "CC(=C": 1609,
1676
+ "Oc1ccc(C": 1610,
1677
+ "CC(O)C1": 1611,
1678
+ "=CN": 1612,
1679
+ "(=O)NC2": 1613,
1680
+ "[O-])C(": 1614,
1681
+ "CCCOC": 1615,
1682
+ "(C)C)C2": 1616,
1683
+ "2)cc1Cl": 1617,
1684
+ ")Nc1ccc(": 1618,
1685
+ "c1n[nH]": 1619,
1686
+ ")ccc1N": 1620,
1687
+ "cc(S(=O)": 1621,
1688
+ "CCOCC": 1622,
1689
+ "cn2)c1": 1623,
1690
+ "c4cc5": 1624,
1691
+ "3CCOCC3": 1625,
1692
+ "(C)=O)": 1626,
1693
+ "nnc(": 1627,
1694
+ "c2c(c1": 1628,
1695
+ "CN2C(=O)": 1629,
1696
+ "C1CC2": 1630,
1697
+ "2)C(=O)N": 1631,
1698
+ "NC(N": 1632,
1699
+ "c1nccs1": 1633,
1700
+ "c2C1": 1634,
1701
+ "ccccc12)": 1635,
1702
+ "CCc1nc(": 1636,
1703
+ "nnnc1": 1637,
1704
+ "c2c1C": 1638,
1705
+ "3)n2)c1": 1639,
1706
+ "c5c(": 1640,
1707
+ "=C(N)N": 1641,
1708
+ "2)C(": 1642,
1709
+ "cn3)": 1643,
1710
+ "CCC#N)": 1644,
1711
+ "c1ccc(N)": 1645,
1712
+ "c1ncc(C": 1646,
1713
+ "c3ccco": 1647,
1714
+ "C2C3": 1648,
1715
+ "c2no": 1649,
1716
+ "c2C)CC1": 1650,
1717
+ "c1no": 1651,
1718
+ "c2c(Cl)": 1652,
1719
+ "Br)C1": 1653,
1720
+ "=CC2": 1654,
1721
+ "(C)n1": 1655,
1722
+ "=CC": 1656,
1723
+ "CCn1cn": 1657,
1724
+ "CCCCC(": 1658,
1725
+ "OCC1OC(": 1659,
1726
+ "cs1)": 1660,
1727
+ "[O-])C2": 1661,
1728
+ "=C(Cl)": 1662,
1729
+ "=CC1": 1663,
1730
+ "oc(=O)": 1664,
1731
+ "Oc1ccc": 1665,
1732
+ "C(=S)": 1666,
1733
+ "C=CCCC": 1667,
1734
+ "cc1)": 1668,
1735
+ "c3=O)": 1669,
1736
+ "[nH]2)c1": 1670,
1737
+ "(C1": 1671,
1738
+ "(C)c3)": 1672,
1739
+ "c(F)c1)": 1673,
1740
+ "C=CC(C": 1674,
1741
+ "ccc(N": 1675,
1742
+ "C(=O)OC1": 1676,
1743
+ "CC)c1": 1677,
1744
+ "C12CC3": 1678,
1745
+ "CCC(C(C)": 1679,
1746
+ "Cc1nnc(": 1680,
1747
+ "[O-])c1)": 1681,
1748
+ "O=C(NCC1": 1682,
1749
+ "CCSC1": 1683,
1750
+ "c2)cc1OC": 1684,
1751
+ "s4)": 1685,
1752
+ "c2ccnc(N": 1686,
1753
+ "C(=O)C3": 1687,
1754
+ "c6)": 1688,
1755
+ "(C(N)=O)": 1689,
1756
+ "[NH3+])C": 1690,
1757
+ "=N)": 1691,
1758
+ "(CCCCC": 1692,
1759
+ "3)C2=O)": 1693,
1760
+ "2)n": 1694,
1761
+ "CCOC(C)": 1695,
1762
+ "C(N)=S": 1696,
1763
+ "ccc3Cl)": 1697,
1764
+ "ccccc34)": 1698,
1765
+ "CCCCCCO": 1699,
1766
+ "CC(=O)OC": 1700,
1767
+ "Cn1ccnc1": 1701,
1768
+ "3CC[NH+]": 1702,
1769
+ "(=O)O": 1703,
1770
+ "C21": 1704,
1771
+ "O=C(CC": 1705,
1772
+ "CCCCC3": 1706,
1773
+ "F)cc2)n1": 1707,
1774
+ "c2ccncc2": 1708,
1775
+ "C1(C)C": 1709,
1776
+ "C(=C)": 1710,
1777
+ "c2c(N": 1711,
1778
+ "n2n1": 1712,
1779
+ "c1cnc(N": 1713,
1780
+ "OCC)c1": 1714,
1781
+ "c(CO": 1715,
1782
+ "c1cn2": 1716,
1783
+ "CC1CCCO1": 1717,
1784
+ "nc2)c1": 1718,
1785
+ "c1[nH+]": 1719,
1786
+ "c(F)cc1": 1720,
1787
+ "CC(O)CO": 1721,
1788
+ "ccc2F)": 1722,
1789
+ "c3cc4c(": 1723,
1790
+ "CCOC3": 1724,
1791
+ "(Cc2ccc(": 1725,
1792
+ "c(CO)": 1726,
1793
+ "c2nc(=O)": 1727,
1794
+ "cc1F": 1728,
1795
+ "C(=O)NCC": 1729,
1796
+ "n2nc(C)": 1730,
1797
+ ")ccc1Br": 1731,
1798
+ "N1CCC(": 1732,
1799
+ "c4)CC3)": 1733,
1800
+ "c2nc(N)": 1734,
1801
+ "COCCN1": 1735,
1802
+ "F)ccc1F": 1736,
1803
+ "=[N-])": 1737,
1804
+ "C3)cc1": 1738,
1805
+ "c1ccc(CO": 1739,
1806
+ "c(C)c2": 1740,
1807
+ "cc(O)": 1741,
1808
+ "Cl)n1": 1742,
1809
+ "3)c2)cc1": 1743,
1810
+ "nc5": 1744,
1811
+ "c2C)c1": 1745,
1812
+ "O=C(CC1": 1746,
1813
+ "C(C)S": 1747,
1814
+ "(CCO": 1748,
1815
+ "NC1=O": 1749,
1816
+ "c2ncc(C": 1750,
1817
+ "#N)c1": 1751,
1818
+ "C1CCCN": 1752,
1819
+ "c2n(": 1753,
1820
+ "N1CC": 1754,
1821
+ "C(O)=C(": 1755,
1822
+ "Cc1nn": 1756,
1823
+ "Nc1": 1757,
1824
+ "SC(C)": 1758,
1825
+ "CCC1(C": 1759,
1826
+ "OCO2": 1760,
1827
+ "CCCCCC=": 1761,
1828
+ "CC#CC#": 1762,
1829
+ "c1cc(N)": 1763,
1830
+ "F)cc2F)": 1764,
1831
+ "2)cc(": 1765,
1832
+ "(C)C(C)C": 1766,
1833
+ "cs2)": 1767,
1834
+ "c3cc(OC)": 1768,
1835
+ "CCC21": 1769,
1836
+ "c1=O)": 1770,
1837
+ "(CCOC)": 1771,
1838
+ "c23)": 1772,
1839
+ "C(=O)OC)": 1773,
1840
+ "(C)C)cc2": 1774,
1841
+ "c2nc3": 1775,
1842
+ "OO": 1776,
1843
+ "c2nnnn2": 1777,
1844
+ "3)cc2)": 1778,
1845
+ "=CCCC": 1779,
1846
+ "(Cl)c1Cl": 1780,
1847
+ "N#Cc1": 1781,
1848
+ "c(CN": 1782,
1849
+ "coc(": 1783,
1850
+ "(C(C)(C)": 1784,
1851
+ "OCc1ccc": 1785,
1852
+ "C=CC2": 1786,
1853
+ "%10": 1787,
1854
+ "O=C(NC": 1788,
1855
+ "NC(=O)N(": 1789,
1856
+ "c4ncc": 1790,
1857
+ "=C(S": 1791,
1858
+ "CCOP(=O)": 1792,
1859
+ "Cc1csc(": 1793,
1860
+ "CC(C)O1": 1794,
1861
+ ")cc(C)c1": 1795,
1862
+ "c1ccc(N(": 1796,
1863
+ "CSC1": 1797,
1864
+ "CC2)nc1": 1798,
1865
+ "c4)cc3": 1799,
1866
+ "c1cc(=O)": 1800,
1867
+ "N=[N+]": 1801,
1868
+ "C(c1ccc(": 1802,
1869
+ ")cc5)": 1803,
1870
+ "(C(=O)C3": 1804,
1871
+ "N1CCOCC1": 1805,
1872
+ "Cc2cccc": 1806,
1873
+ "CC2(": 1807,
1874
+ "nn(C)": 1808,
1875
+ "n2cc(C": 1809,
1876
+ "c1nn(": 1810,
1877
+ "CCC1(CC)": 1811,
1878
+ "C(O": 1812,
1879
+ "n3)n": 1813,
1880
+ "o2)C1": 1814,
1881
+ "Cl)C(=O)": 1815,
1882
+ "n3ccc": 1816,
1883
+ "(C)c2)": 1817,
1884
+ "c(Br)cc1": 1818,
1885
+ "n2c(C)": 1819,
1886
+ "Br)s1": 1820,
1887
+ "CC(O)(C": 1821,
1888
+ "C1CC(": 1822,
1889
+ "nc(C)n1": 1823,
1890
+ "c6ccc(": 1824,
1891
+ "C(C)(C": 1825,
1892
+ "F)cc2)C1": 1826,
1893
+ "F)C(=O)": 1827,
1894
+ ")N1CC": 1828,
1895
+ "(Cl)(Cl)": 1829,
1896
+ "c1cccc(O": 1830,
1897
+ "=O)cc2": 1831,
1898
+ "CC)cc1": 1832,
1899
+ "4C)": 1833,
1900
+ "CC2(CC": 1834,
1901
+ "c1co": 1835,
1902
+ "C1CCC2": 1836,
1903
+ "c2c(N)": 1837,
1904
+ "c2ccsc2)": 1838,
1905
+ "(Cl)cc4)": 1839,
1906
+ "C[NH2+]1": 1840,
1907
+ "c[nH]1": 1841,
1908
+ "CCC5": 1842,
1909
+ "c(=O)c3": 1843,
1910
+ "c1cc(OC": 1844,
1911
+ "CCCCCCC1": 1845,
1912
+ "c4c3)": 1846,
1913
+ "CCO2": 1847,
1914
+ "[NH2+]1)": 1848,
1915
+ "C[NH2+]C": 1849,
1916
+ "c1c[nH+]": 1850,
1917
+ "Br)cc1)": 1851,
1918
+ "CCCCC(C)": 1852,
1919
+ "ccc2C)": 1853,
1920
+ "CCn1c(": 1854,
1921
+ "(C(=O)CC": 1855,
1922
+ "CN(S(=O)": 1856,
1923
+ "c(F)c3": 1857,
1924
+ "CC2CCC": 1858,
1925
+ "=CC(": 1859,
1926
+ "N2CC": 1860,
1927
+ "=[NH+]O": 1861,
1928
+ "ccc12": 1862,
1929
+ "C2C1": 1863,
1930
+ "Cc1nc(C)": 1864,
1931
+ "(C(=O)CS": 1865,
1932
+ "OC)n1": 1866,
1933
+ "2)c1=O": 1867,
1934
+ "c%10": 1868,
1935
+ "COCC(C)": 1869,
1936
+ "3)CC2)c1": 1870,
1937
+ "c(NC2": 1871,
1938
+ "c3ccc(O)": 1872,
1939
+ "NC(=O)NC": 1873,
1940
+ "-c2ccc(": 1874,
1941
+ "n2nc(C": 1875,
1942
+ "c2cc(=O)": 1876,
1943
+ "CCC(N)": 1877,
1944
+ "CCn1c(S": 1878,
1945
+ "ncnc3": 1879,
1946
+ "CCCl": 1880,
1947
+ "c1nnc(C": 1881,
1948
+ "(c4ccccc": 1882,
1949
+ "Fc1ccc(": 1883,
1950
+ "c3cc(Br)": 1884,
1951
+ "=Cc1ccc(": 1885,
1952
+ "c2nc(Cl)": 1886,
1953
+ "CCS1": 1887,
1954
+ "COC2": 1888,
1955
+ "SCC(=O)": 1889,
1956
+ "c2[nH+]": 1890,
1957
+ "C(C)(O)": 1891,
1958
+ "COc1cc(N": 1892,
1959
+ "n2ccnc2)": 1893,
1960
+ "(C)C)c(": 1894,
1961
+ "2)c1)": 1895,
1962
+ "(F)c3": 1896,
1963
+ "(F)(F)F": 1897,
1964
+ "CCC4)": 1898,
1965
+ "c(Cl)c2": 1899,
1966
+ "[nH]n1": 1900,
1967
+ "n2c(C": 1901,
1968
+ "(C2CC2)": 1902,
1969
+ "C=CCC1": 1903,
1970
+ "N=C1": 1904,
1971
+ "OC12": 1905,
1972
+ "C4CCCCC": 1906,
1973
+ "(=O)C2": 1907,
1974
+ "CCCC2)C1": 1908,
1975
+ "OC)CC1": 1909,
1976
+ "O=C(NCC": 1910,
1977
+ "nc2c(": 1911,
1978
+ "S1(=O)=O": 1912,
1979
+ "N#CC1": 1913,
1980
+ "Oc3ccc(": 1914,
1981
+ "C(=O)C(C": 1915,
1982
+ "C3O)": 1916,
1983
+ "F)cc1)N": 1917,
1984
+ "CC2(O)": 1918,
1985
+ "c3cc2": 1919,
1986
+ "cnn1C": 1920,
1987
+ "nn2)cc1": 1921,
1988
+ "Cc1cccs1": 1922,
1989
+ "2)no1": 1923,
1990
+ "CC2C1": 1924,
1991
+ ")C2": 1925,
1992
+ "OCC[NH+]": 1926,
1993
+ "SC(": 1927,
1994
+ "3CCN(": 1928,
1995
+ "CCCC4)": 1929,
1996
+ "CCC(C#N)": 1930,
1997
+ "OCC(": 1931,
1998
+ "(C(=O)CO": 1932,
1999
+ "n(C)c1=O": 1933,
2000
+ "[Se]": 1934,
2001
+ "c4ccc(OC": 1935,
2002
+ "F)cc3F)": 1936,
2003
+ "n(CC)": 1937,
2004
+ "[S-])": 1938,
2005
+ "3)C(=O)": 1939,
2006
+ "N#Cc1cc(": 1940,
2007
+ "CCCNC(N)": 1941,
2008
+ "2)nn1": 1942,
2009
+ "c(Cl)cc(": 1943,
2010
+ "Cc1ccnc(": 1944,
2011
+ "C(C)N": 1945,
2012
+ "oc1C": 1946,
2013
+ "cc1OC)": 1947,
2014
+ "4)CC3": 1948,
2015
+ "c3ncccc3": 1949,
2016
+ "cnc3": 1950,
2017
+ "CCC1(C)": 1951,
2018
+ "c1c(O)": 1952,
2019
+ "Cc1nn(C": 1953,
2020
+ "CCC(C1)": 1954,
2021
+ "c3c[nH]": 1955,
2022
+ "(Cl)cc4": 1956,
2023
+ "CCOc1cc": 1957,
2024
+ "CC(Br)": 1958,
2025
+ "CN(CC1": 1959,
2026
+ "c4cc(F)": 1960,
2027
+ "Cc1c(N": 1961,
2028
+ "Cc1cc(F)": 1962,
2029
+ "C(C)CC)": 1963,
2030
+ "c3o": 1964,
2031
+ "c2c[nH+]": 1965,
2032
+ "[O-])N": 1966,
2033
+ "OC)c2)": 1967,
2034
+ "C2CCC1": 1968,
2035
+ "3)nc2": 1969,
2036
+ "Cc1ccc(S": 1970,
2037
+ "=O)ccc1": 1971,
2038
+ ")cccc2": 1972,
2039
+ "CCSCC1": 1973,
2040
+ "N(C)C)": 1974,
2041
+ "c3nccc": 1975,
2042
+ ")cc3)C2": 1976,
2043
+ "OC)cc1)": 1977,
2044
+ "3)n1": 1978,
2045
+ "CC1=N": 1979,
2046
+ "CC(C1": 1980,
2047
+ "n1cc": 1981,
2048
+ "2CCN(": 1982,
2049
+ "CC(CC)": 1983,
2050
+ "(NN)": 1984,
2051
+ "(C)CC2)": 1985,
2052
+ "F)cc1F)": 1986,
2053
+ "Br)cc(C": 1987,
2054
+ "Cn1c(": 1988,
2055
+ "2)cc1F": 1989,
2056
+ "Cc1nc(C2": 1990,
2057
+ "c1ccnc(N": 1991,
2058
+ "OCC(C)C)": 1992,
2059
+ "(C)C)cc(": 1993,
2060
+ "csc1": 1994,
2061
+ "3)c(": 1995,
2062
+ "SS": 1996,
2063
+ "c2c3c(": 1997,
2064
+ "CCCC2)n1": 1998,
2065
+ "C#CCO": 1999,
2066
+ "c1ccoc1": 2000,
2067
+ "C(O)C2": 2001,
2068
+ "4CC[NH+]": 2002,
2069
+ "(C)CC3)": 2003,
2070
+ "CC1CCCC(": 2004,
2071
+ "c1[nH]c(": 2005,
2072
+ "(F)c1F": 2006,
2073
+ ")N(": 2007,
2074
+ "c1nc(=O)": 2008,
2075
+ "c2c(O)": 2009,
2076
+ "c2nn(": 2010,
2077
+ "CCC(C)C1": 2011,
2078
+ "2C(": 2012,
2079
+ "C3)n": 2013,
2080
+ "cnn2": 2014,
2081
+ "2)C(C)": 2015,
2082
+ "CC4)cc3": 2016,
2083
+ "no2)": 2017,
2084
+ "C2(": 2018,
2085
+ "[O-])cn1": 2019,
2086
+ "=C(C)C": 2020,
2087
+ "n6": 2021,
2088
+ "c1noc(": 2022,
2089
+ "c2cnn(": 2023,
2090
+ ")N1CCN": 2024,
2091
+ "N2CCCC2": 2025,
2092
+ ")cc(=O)": 2026,
2093
+ "o2)n1": 2027,
2094
+ "OC)c3)": 2028,
2095
+ "c5cc(": 2029,
2096
+ "c1O": 2030,
2097
+ "Cc1c[nH]": 2031,
2098
+ "OCC(C)C": 2032,
2099
+ "2CC[NH+]": 2033,
2100
+ "O=S1(=O)": 2034,
2101
+ "C(C)(": 2035,
2102
+ "[Si](": 2036,
2103
+ "c2cc1OC": 2037,
2104
+ "c(C)s1": 2038,
2105
+ "c[nH+]1": 2039,
2106
+ ")cc3)n": 2040,
2107
+ ")C": 2041,
2108
+ "c-3": 2042,
2109
+ "CC2(C": 2043,
2110
+ ")ccc1C": 2044,
2111
+ "N3C(=O)": 2045,
2112
+ "Nc1cn": 2046,
2113
+ "CC1CN": 2047,
2114
+ "Br)c1)": 2048,
2115
+ "CC4)cc3)": 2049,
2116
+ "OC)c2": 2050,
2117
+ "3CCN": 2051,
2118
+ "C1(O)": 2052,
2119
+ "nn(": 2053,
2120
+ "=O)cc2)": 2054,
2121
+ "(C)CCO": 2055,
2122
+ "c4nc(": 2056,
2123
+ "c(NS(=O)": 2057,
2124
+ "c2nn[n-]": 2058,
2125
+ "ccc6": 2059,
2126
+ "c3nnc(": 2060,
2127
+ "c2cnc(N": 2061,
2128
+ "co1": 2062,
2129
+ "4)C3)": 2063,
2130
+ "c(C[NH+]": 2064,
2131
+ "CCn1cc": 2065,
2132
+ ")cc1F": 2066,
2133
+ "c2nnc3": 2067,
2134
+ "OCCO)": 2068,
2135
+ "3)n2": 2069,
2136
+ "n2ccnc2": 2070,
2137
+ "c2cn(C)": 2071,
2138
+ "c1ncn2": 2072,
2139
+ "C1CCC1": 2073,
2140
+ "#N)cc2": 2074,
2141
+ "c2ccnn2": 2075,
2142
+ "N1S(=O)": 2076,
2143
+ "[CH]": 2077,
2144
+ "C2CC2)c1": 2078,
2145
+ "CCC12C": 2079,
2146
+ "c6ccc7": 2080,
2147
+ "no1)": 2081,
2148
+ "C2(C)": 2082,
2149
+ "Nc1nc(": 2083,
2150
+ "2CC3": 2084,
2151
+ "c2co": 2085,
2152
+ "c1ccs": 2086,
2153
+ "CC2)cn1": 2087,
2154
+ "c1csc(C": 2088,
2155
+ "C(C)=": 2089,
2156
+ "CCCN(CCC": 2090,
2157
+ "c(N)n": 2091,
2158
+ "Cc1c(Cl)": 2092,
2159
+ "oc2c1": 2093,
2160
+ "F)cc3)n": 2094,
2161
+ "c1)C(": 2095,
2162
+ "CC1CC2": 2096,
2163
+ "C2CCN": 2097,
2164
+ "c(N)n1": 2098,
2165
+ ")C(C)C": 2099,
2166
+ "(CC=C)": 2100,
2167
+ "COCC(": 2101,
2168
+ "C(O)C(": 2102,
2169
+ "c2ccc(I": 2103,
2170
+ "[O-])c(N": 2104,
2171
+ "c1)CCC2": 2105,
2172
+ "c(OC)c3)": 2106,
2173
+ ")C(": 2107,
2174
+ "2)O1": 2108,
2175
+ "c2cn[nH]": 2109,
2176
+ "CCn1cc(": 2110,
2177
+ "C=CCN1": 2111,
2178
+ "c1)OCCO": 2112,
2179
+ "C)CC3)": 2113,
2180
+ "CCC1=O": 2114,
2181
+ "OCCO4)": 2115,
2182
+ "CNc1n": 2116,
2183
+ "(CC3": 2117,
2184
+ "sc(N": 2118,
2185
+ "2)cs1": 2119,
2186
+ "C1(C(=O)": 2120,
2187
+ "CCC1O": 2121,
2188
+ "(C)C)cc3": 2122,
2189
+ "c1ccnc(": 2123,
2190
+ "=N1": 2124,
2191
+ "Cn1nccc1": 2125,
2192
+ "C1=N": 2126,
2193
+ "O=C(OC": 2127,
2194
+ "c1c(Br)": 2128,
2195
+ "c4ccc(N": 2129,
2196
+ "c3ccc(n4": 2130,
2197
+ "c1sc(": 2131,
2198
+ "COc1c(C)": 2132,
2199
+ "CSc1ccc(": 2133,
2200
+ "c1ccc2n": 2134,
2201
+ "3)CC2)n1": 2135,
2202
+ "CC(O)C(": 2136,
2203
+ "c2ccc(N)": 2137,
2204
+ "c4ccncc": 2138,
2205
+ "CCn1cc(C": 2139,
2206
+ "C1=C": 2140,
2207
+ "[O-])s1": 2141,
2208
+ "(=O)CC1": 2142,
2209
+ "(CCN": 2143,
2210
+ "n3C)": 2144,
2211
+ "c2ccn": 2145,
2212
+ "CC(C)Cn1": 2146,
2213
+ "c1ccn": 2147,
2214
+ ")cc(N": 2148,
2215
+ "2CCC": 2149,
2216
+ "ccc2o1": 2150,
2217
+ "COc1cn": 2151,
2218
+ "c(=O)c(": 2152,
2219
+ "Cc1ncc": 2153,
2220
+ "OC5": 2154,
2221
+ "c4ccco": 2155,
2222
+ "Oc2ccc(C": 2156,
2223
+ "(=O)NCC": 2157,
2224
+ "CC3CCC2": 2158,
2225
+ "CCOc1cc(": 2159,
2226
+ "Nc1nc(N": 2160,
2227
+ "oc2": 2161,
2228
+ "CC2)s1": 2162,
2229
+ "nn2)": 2163,
2230
+ "[NH2+]C3": 2164,
2231
+ "cc2c(": 2165,
2232
+ "Cn1nc(": 2166,
2233
+ "cc2F)": 2167,
2234
+ "n2cnc3": 2168,
2235
+ "CCCCN(": 2169,
2236
+ "c1)OCO2)": 2170,
2237
+ "(Cc2cccc": 2171,
2238
+ "C(c2ccc": 2172,
2239
+ "CC2)cc1C": 2173,
2240
+ "c3ccc(C4": 2174,
2241
+ "C2CCCC2": 2175,
2242
+ "CC(OC)": 2176,
2243
+ "c(C=O)": 2177,
2244
+ "C(C#N)=C": 2178,
2245
+ "CCc1c(C)": 2179,
2246
+ "Nc1cccc(": 2180,
2247
+ "-4": 2181,
2248
+ "c2ccc(n3": 2182,
2249
+ "Br)c2": 2183,
2250
+ "CC1CCCN": 2184,
2251
+ "c3ccs": 2185,
2252
+ "Br)CC1": 2186,
2253
+ ")[NH+]1": 2187,
2254
+ "[O-])cn": 2188,
2255
+ "C#N)cc1": 2189,
2256
+ "c(I)": 2190,
2257
+ "F)cc21": 2191,
2258
+ "c(C#N)c1": 2192,
2259
+ "C[NH+](": 2193,
2260
+ "[NH+]2CC": 2194,
2261
+ "c3)cc2": 2195,
2262
+ "c2cnc(": 2196,
2263
+ "cc(F)": 2197,
2264
+ "c2)cn1": 2198,
2265
+ "c(OC)cc2": 2199,
2266
+ "C2(C)C": 2200,
2267
+ "N1CCCCC1": 2201,
2268
+ "CC(C)C(C": 2202,
2269
+ "B(": 2203,
2270
+ "CC(=O)NC": 2204,
2271
+ "=S)N1": 2205,
2272
+ "cccc2": 2206,
2273
+ "O)C1": 2207,
2274
+ "F)cc(F)": 2208,
2275
+ "c34)": 2209,
2276
+ "C=CCC": 2210,
2277
+ "(C)c(C": 2211,
2278
+ "N1C": 2212,
2279
+ "c1ccc(NC": 2213,
2280
+ "(C(=O)C(": 2214,
2281
+ "Cc1cn2": 2215,
2282
+ "c(C)cc2": 2216,
2283
+ "O)ccc1": 2217,
2284
+ "c(C)o1": 2218,
2285
+ ")c1ccc": 2219,
2286
+ "c1cnn2": 2220,
2287
+ "CC3)cc2)": 2221,
2288
+ "C=C=": 2222,
2289
+ ")N(C)": 2223,
2290
+ "OP": 2224,
2291
+ "[NH+](CC": 2225,
2292
+ "C1CO": 2226,
2293
+ "C2C": 2227,
2294
+ "CC#N)": 2228,
2295
+ "[O-])c(C": 2229,
2296
+ "(=S)": 2230,
2297
+ "CCCC(N": 2231,
2298
+ "FC(F)": 2232,
2299
+ "Cc1nccn1": 2233,
2300
+ "CCc2cc(": 2234,
2301
+ "3c(": 2235,
2302
+ "cccnc2": 2236,
2303
+ "CCC1C": 2237,
2304
+ "Sc2n": 2238,
2305
+ "C=C(C#N)": 2239,
2306
+ "COc1n": 2240,
2307
+ "O)CC1": 2241,
2308
+ "[nH]c(C": 2242,
2309
+ ")N(C)C": 2243,
2310
+ "5)ccc4": 2244,
2311
+ "4)C2)C3)": 2245,
2312
+ ")cc1Cl": 2246,
2313
+ "Br)c(": 2247,
2314
+ "(F)F": 2248,
2315
+ "cc1F)": 2249,
2316
+ "4CCCCC4)": 2250,
2317
+ "C=O)": 2251,
2318
+ "5)cc4": 2252,
2319
+ "OC1(C)C": 2253,
2320
+ "5)c4": 2254,
2321
+ "Cc3ccccc": 2255,
2322
+ "CCc2ccc": 2256,
2323
+ "c1ccc(C#": 2257,
2324
+ ")cc1C": 2258,
2325
+ "O=C(CCC": 2259
2326
+ },
2327
+ "merges": [
2328
+ "c c",
2329
+ "C C",
2330
+ "( C",
2331
+ "c 1",
2332
+ "O )",
2333
+ "= O)",
2334
+ "( =O)",
2335
+ "cc c",
2336
+ "(C )",
2337
+ "c 2",
2338
+ "C (=O)",
2339
+ ") cc",
2340
+ "+ ]",
2341
+ "[ N",
2342
+ "CC C",
2343
+ "c1 cc",
2344
+ "[N H",
2345
+ "c1 ccc",
2346
+ "c (",
2347
+ "C (",
2348
+ "c 3",
2349
+ "2 )",
2350
+ "F )",
2351
+ "C 1",
2352
+ "CC CC",
2353
+ "c2 cc",
2354
+ "O C",
2355
+ "c1cc cc",
2356
+ "N C(=O)",
2357
+ ")cc 1",
2358
+ "CC 1",
2359
+ "(=O) N",
2360
+ "(C) C",
2361
+ "- ]",
2362
+ "C O",
2363
+ "c1ccc (",
2364
+ "[ O",
2365
+ "[O -]",
2366
+ "n 1",
2367
+ "[NH +]",
2368
+ "c2 ccc",
2369
+ "3 )",
2370
+ "(C l",
2371
+ "( F)",
2372
+ "c1cccc c1",
2373
+ "cc ccc",
2374
+ "CC O",
2375
+ "C(=O) N",
2376
+ "2 +]",
2377
+ "[NH 2+]",
2378
+ "c2cc ccc",
2379
+ "( CC",
2380
+ "C 2",
2381
+ "[O-] )",
2382
+ "c n",
2383
+ "c1 n",
2384
+ "S (=O)",
2385
+ "[ n",
2386
+ "N )",
2387
+ "O =",
2388
+ "CC N",
2389
+ "(C (=O)",
2390
+ "[n H",
2391
+ "(C (=O)N",
2392
+ "c 4",
2393
+ "(Cl )",
2394
+ "B r",
2395
+ "CC (C)",
2396
+ "C (C)",
2397
+ "[nH ]",
2398
+ "(C)C )",
2399
+ "CC (",
2400
+ "2 )cc1",
2401
+ "c (C",
2402
+ "3 +]",
2403
+ "[NH 3+]",
2404
+ "c3 ccc",
2405
+ "c2ccc (",
2406
+ "C N",
2407
+ "C (C",
2408
+ "c (C)",
2409
+ "c3 ccccc",
2410
+ "C l",
2411
+ "CC CCC",
2412
+ "C =",
2413
+ "cc (",
2414
+ "c2 )",
2415
+ "c2 n",
2416
+ "cc 1",
2417
+ "OC )",
2418
+ "c2ccccc 2",
2419
+ "O= C(",
2420
+ "c1cc (",
2421
+ "F )cc",
2422
+ "c1ccc (C",
2423
+ "CC (=O)N",
2424
+ ") N",
2425
+ "n 2",
2426
+ "CC 2",
2427
+ "[N +]",
2428
+ "2) c1",
2429
+ "C )",
2430
+ "[NH3+] )",
2431
+ "CC [NH+]",
2432
+ "Br )",
2433
+ "4 )",
2434
+ "c( N",
2435
+ "CCC (",
2436
+ "= O",
2437
+ "(Cl )cc",
2438
+ "(F) (F)",
2439
+ "c1 )",
2440
+ "c (=O)",
2441
+ "c3 cc",
2442
+ "[N+] (=O)",
2443
+ "C c1ccc(",
2444
+ "CC (=O)",
2445
+ "c2cc cc",
2446
+ "c1ccc 2",
2447
+ "c1cccc (",
2448
+ "CC 2)",
2449
+ "N 1",
2450
+ "C( F)",
2451
+ "C 3",
2452
+ "s 1",
2453
+ "c3ccccc 3",
2454
+ "C [NH+]",
2455
+ "CCC 1",
2456
+ "ccc 2",
2457
+ "C c1",
2458
+ "n c(",
2459
+ "n c1",
2460
+ "O CC",
2461
+ "C c1cc",
2462
+ "CCCC CCCC",
2463
+ "C( O)",
2464
+ "N 2",
2465
+ "= C",
2466
+ "c3ccc (",
2467
+ "OC (C)",
2468
+ "C c1n",
2469
+ "c3 )",
2470
+ "CO C(=O)",
2471
+ "Cl )",
2472
+ "c (Cl)",
2473
+ "# N)",
2474
+ "C(F) (F)",
2475
+ "c 5",
2476
+ "2) CC1",
2477
+ "(CC )",
2478
+ "O C(=O)",
2479
+ "( O)",
2480
+ "CC [NH2+]",
2481
+ "1 )",
2482
+ "cc 2",
2483
+ "= C(",
2484
+ "C [NH2+]",
2485
+ ")cc c1",
2486
+ "CCN (",
2487
+ "O=C( N",
2488
+ "F )cc1",
2489
+ "(F)(F) F)",
2490
+ "n n",
2491
+ "= N",
2492
+ ")cc 2",
2493
+ "CO c1ccc(",
2494
+ "c4 ccccc",
2495
+ "2) C1",
2496
+ "C S",
2497
+ "CC(C) (C)",
2498
+ "CCCC 1",
2499
+ "c( F)",
2500
+ "c1 cn",
2501
+ "CCO C(=O)",
2502
+ "c2cc (",
2503
+ "CCC N",
2504
+ "CCC (C)",
2505
+ "CC 3)",
2506
+ "n c2",
2507
+ "NC(=O) N",
2508
+ "C (C)C",
2509
+ "= S",
2510
+ "c4 ccc",
2511
+ "CC( O)",
2512
+ "CC 3",
2513
+ "o 1",
2514
+ "c s",
2515
+ "CCC O",
2516
+ "CCC 2",
2517
+ "(C (C)",
2518
+ "(Cl )cc1",
2519
+ "c1ccc2 c(",
2520
+ "c n1",
2521
+ "CC (C",
2522
+ "C(=O)N 1",
2523
+ "( N)",
2524
+ "c2 c(",
2525
+ "[ S",
2526
+ "C n1",
2527
+ "= [NH+]",
2528
+ "C c1ccc",
2529
+ "CCCCC 1",
2530
+ "n 3",
2531
+ "C c1cc(",
2532
+ "O= C(C",
2533
+ "c2 c1",
2534
+ "n cc",
2535
+ "c1cc (C",
2536
+ "2) n1",
2537
+ "c1cccc (C",
2538
+ "CCC (C",
2539
+ "c2ccc 3",
2540
+ "CC )",
2541
+ "c2 cn",
2542
+ "c (C(=O)N",
2543
+ "c1 2",
2544
+ ")N 1",
2545
+ "[nH +]",
2546
+ "[S i",
2547
+ "(CC (=O)N",
2548
+ "c3cc cc",
2549
+ "ccc 3",
2550
+ "C NC(=O)",
2551
+ "[NH+] 1",
2552
+ "CC =",
2553
+ ")cc (",
2554
+ "CC (C)C",
2555
+ "O C1",
2556
+ "n (",
2557
+ "c2cc cc(",
2558
+ "[Si ]",
2559
+ "[NH+] 2",
2560
+ "OC 2",
2561
+ "CC1 )",
2562
+ "c4ccccc 4",
2563
+ "CC n1",
2564
+ "cc cc",
2565
+ "c2ccc (C",
2566
+ "c(C) c1",
2567
+ "(C) cc",
2568
+ "N #",
2569
+ ")cc 2)",
2570
+ "CC NC(=O)",
2571
+ "c1 c(",
2572
+ "CC 2)cc1",
2573
+ "CC S",
2574
+ "3) n",
2575
+ "OC (C",
2576
+ "=O) cc1",
2577
+ "c1cc 2",
2578
+ "c2 )cc1",
2579
+ "n (C)",
2580
+ "5 )",
2581
+ "N S(=O)",
2582
+ "NC(=O) C(",
2583
+ "c1 C",
2584
+ "c [nH]",
2585
+ "N C(",
2586
+ "( [O-])",
2587
+ "c3 n",
2588
+ "(C) C(=O)",
2589
+ "c( OC)",
2590
+ "# N",
2591
+ ")cc 3)",
2592
+ "CCCC 2",
2593
+ "CN 1",
2594
+ "c( N)",
2595
+ "C c1ccc(C",
2596
+ "(C) (=O)",
2597
+ "C (C)C)",
2598
+ "c 6",
2599
+ "O= C1",
2600
+ "n c(N",
2601
+ "C[NH+] 1",
2602
+ ")cc 3",
2603
+ ") C(=O)",
2604
+ "c (C(=O)",
2605
+ "C 2)",
2606
+ "CC 2)c1",
2607
+ "c1cccc 2",
2608
+ "Br )cc1",
2609
+ "N (",
2610
+ "cc (C",
2611
+ "C1 CC1",
2612
+ "S (C)(=O)",
2613
+ "n c(C",
2614
+ "CC C(=O)",
2615
+ "ccc (",
2616
+ "CC C(=O)N",
2617
+ "[O-] )cc1",
2618
+ "c( NC(=O)",
2619
+ "C( N)",
2620
+ "CN (",
2621
+ "CCN 1",
2622
+ "c1ccc( N",
2623
+ "c3 c(",
2624
+ "C 4",
2625
+ "[O-]) c1",
2626
+ "O C(",
2627
+ "c1ccc (C)",
2628
+ "(C(=O)N 2",
2629
+ ")cc 2)cc1",
2630
+ "[nH] 1",
2631
+ "c (Cl)cc",
2632
+ "O CCO",
2633
+ "C1 =O",
2634
+ "C c1cccc(",
2635
+ "= C2",
2636
+ "n (C",
2637
+ "CCC [NH+]",
2638
+ "CCCC (",
2639
+ "=S )",
2640
+ "O 1",
2641
+ "n n1",
2642
+ "CCC 3",
2643
+ "Br) c1",
2644
+ "NC(=O) C1",
2645
+ "[Si] (C)",
2646
+ "(CC (=O)",
2647
+ "cc 3",
2648
+ "OC O",
2649
+ ") C1",
2650
+ "c4ccc (",
2651
+ "N1 C(=O)",
2652
+ "n 2)",
2653
+ "c2) c1",
2654
+ "C(C) (C)C",
2655
+ "n c3",
2656
+ "O CC(=O)N",
2657
+ "c2ccc3 c(",
2658
+ "c4 )",
2659
+ "=S )N",
2660
+ "N c1n",
2661
+ "Cc1 cn",
2662
+ "c5 ccccc",
2663
+ "NC(=O) C2",
2664
+ "(N) =O)",
2665
+ "CC S(=O)",
2666
+ "F)cc 2",
2667
+ "P (=O)",
2668
+ "ccccc 2",
2669
+ "(Cl) c1",
2670
+ "O) cc1",
2671
+ "c1ccc(C 2",
2672
+ "CC c1n",
2673
+ "C(C) (C)",
2674
+ "c(Cl) c1",
2675
+ "c2ccc( N",
2676
+ "C( N",
2677
+ "n cn",
2678
+ "(C 2",
2679
+ "c( S",
2680
+ "c3 cc(",
2681
+ "( CCC",
2682
+ "C #",
2683
+ "c(F) c1",
2684
+ "c2 s",
2685
+ "3) C2",
2686
+ "C S(=O)",
2687
+ "CCO CC1",
2688
+ "CC1 (C)",
2689
+ "OCC )",
2690
+ "CN (C(=O)",
2691
+ "c( O)",
2692
+ "n cc1",
2693
+ "cc c1",
2694
+ "CO c1cc(",
2695
+ "3 CCCC",
2696
+ "Cc1cc (C)",
2697
+ "N2 C(=O)",
2698
+ "CC (CC",
2699
+ "CC[NH+] 1",
2700
+ "c1 =O",
2701
+ "N =",
2702
+ "C 3)",
2703
+ "c s1",
2704
+ "n 3)",
2705
+ "c3ccc 4",
2706
+ "I )",
2707
+ "c2cc 3",
2708
+ "CC (C)C)",
2709
+ "CC 4",
2710
+ "C )cc1",
2711
+ "c2n c(",
2712
+ "s 2)",
2713
+ "C(F)(F) F",
2714
+ "C= C",
2715
+ "C(=O)N C(",
2716
+ "c(C 2",
2717
+ "c2) CC1",
2718
+ "c1n cc",
2719
+ "(C) C1",
2720
+ "(C O)",
2721
+ "CC(=O)N 1",
2722
+ "(C) c1",
2723
+ "CCC( O)",
2724
+ "c4 cc",
2725
+ "C(=O)N 2",
2726
+ "s c1",
2727
+ "( [NH3+])",
2728
+ "CO C1",
2729
+ "[O-]) C1",
2730
+ "OC )cc1",
2731
+ "c1ccc( O",
2732
+ "C(=O)N (",
2733
+ "CO c1ccc",
2734
+ "(=O)N 2",
2735
+ "C c1cccc",
2736
+ "(C)C )cc1",
2737
+ "n1 )",
2738
+ "3 )cc1",
2739
+ "=C( N",
2740
+ "l )",
2741
+ "CCC =",
2742
+ "(F) c1",
2743
+ "c(C) cc",
2744
+ "c2n cc",
2745
+ "(Cl)cc 2",
2746
+ "(C #N)",
2747
+ "OC 3",
2748
+ "n 2)cc1",
2749
+ "ccc2 1",
2750
+ "c1 s",
2751
+ "(C)C (C)",
2752
+ "(C(=O)N C",
2753
+ "CN (C)",
2754
+ "[NH2+] C",
2755
+ "OC) c1",
2756
+ "C(C #N)",
2757
+ "c1n c(",
2758
+ "CC 4)",
2759
+ "CO c1cc",
2760
+ "( N",
2761
+ "CCCCC 2",
2762
+ "C1 =",
2763
+ "F)cc 2)",
2764
+ "C1 )",
2765
+ "s1 )",
2766
+ "n c(C)",
2767
+ "ccccc 3",
2768
+ "=O) c1",
2769
+ "C OC",
2770
+ "o 2)",
2771
+ "CO c1cc(C",
2772
+ "c2cc (Cl)",
2773
+ "CCO CCO",
2774
+ "CCCC O",
2775
+ "c3cc cc(",
2776
+ "CCN (CC)",
2777
+ "c2ccc( OC",
2778
+ "c(C (C)",
2779
+ "N (C",
2780
+ "N (C)",
2781
+ "F)cc c1",
2782
+ "C(C O)",
2783
+ "N (C(=O)",
2784
+ "[NH2+] C1",
2785
+ "c 7",
2786
+ "OC(C) =O)",
2787
+ "c3 cn",
2788
+ "n 4",
2789
+ "CCN (C",
2790
+ "CN (C",
2791
+ "(CC O)",
2792
+ "S C",
2793
+ "c5ccccc 5",
2794
+ "= C1",
2795
+ "c1 cs",
2796
+ "c1ccc( F)",
2797
+ "o c(",
2798
+ "(C)C 2",
2799
+ "C (C(=O)",
2800
+ "c(N 2",
2801
+ "CCO CC2)",
2802
+ "CC c1ccc(",
2803
+ "( CCCC",
2804
+ "c2cc (C",
2805
+ "c2ccc( Br",
2806
+ "c3ccc (C",
2807
+ "[nH] c(",
2808
+ "3) CC2)",
2809
+ "NC(=O) C",
2810
+ "O CCC",
2811
+ "( c2ccccc",
2812
+ "n o1",
2813
+ "(=O)N 1",
2814
+ "c(OC) c1",
2815
+ "c2cc (C)",
2816
+ "ccc 4",
2817
+ "C(O) C(O)",
2818
+ "CCO C1",
2819
+ "O CC1",
2820
+ "c2ccc (C)",
2821
+ "c1cc2 c(",
2822
+ "c2cccc 3",
2823
+ "O) cc",
2824
+ "o 1)",
2825
+ "=O) cc",
2826
+ "c [nH+]",
2827
+ "CC OC",
2828
+ "O= S(=O)",
2829
+ "CCCC (C)",
2830
+ "N =C",
2831
+ "CCC n1",
2832
+ "3 CCO",
2833
+ "c4 cccc",
2834
+ "c2 C)",
2835
+ "c2n cn",
2836
+ "C( =",
2837
+ "c( Br)",
2838
+ "CCC 4",
2839
+ "c2 cs",
2840
+ "c2cccc (C",
2841
+ "c( O",
2842
+ "[n +]",
2843
+ "CCCCC 2)",
2844
+ "(C)C) c1",
2845
+ "C1 CCCCC1",
2846
+ "F)cc 3)",
2847
+ ") C(=O)N",
2848
+ "Cc1cc (C",
2849
+ "(Cl)cc c1",
2850
+ "CCCC 2)",
2851
+ "c1n nc(",
2852
+ "c1 2)",
2853
+ "n c2)",
2854
+ "C( =C",
2855
+ "c1 c(C)",
2856
+ "(Cl)cc 2)",
2857
+ "ccccc 6",
2858
+ "C1 2",
2859
+ "% 1",
2860
+ "C( NC(=O)",
2861
+ "OC(C O)",
2862
+ "(CC 2",
2863
+ "c2cc (F)",
2864
+ "c1ccc( N2",
2865
+ "o 2)cc1",
2866
+ "c1ccc s1",
2867
+ "[O-] )cc",
2868
+ "C2 =O)",
2869
+ "c1ccc nc1",
2870
+ "=C( N)",
2871
+ "C= C1",
2872
+ "c1cc( N",
2873
+ "3 CCCCC",
2874
+ "CCC )",
2875
+ "C( =S)N",
2876
+ "c(C #N)",
2877
+ "c2 1",
2878
+ "[N -]",
2879
+ "CC O)",
2880
+ "n2 cc",
2881
+ "c( S(=O)",
2882
+ "CCCN (",
2883
+ "C (C(=O)N",
2884
+ "c1n cn",
2885
+ "n1 C",
2886
+ "c2ccc (F)",
2887
+ "C[NH+] 2",
2888
+ "NC(=O) CS",
2889
+ "c2n nc(",
2890
+ "( O",
2891
+ "n2 cn",
2892
+ "(C (C)C)",
2893
+ "c3ccccc 2",
2894
+ "n 2)c1",
2895
+ "[NH2+] C2",
2896
+ "Cc1cc 2",
2897
+ "N) =O)",
2898
+ "s 3)",
2899
+ "O C(=O)N",
2900
+ "C1 CCC",
2901
+ "F)cc 3",
2902
+ "CCCCC 1)",
2903
+ "O c1ccc(",
2904
+ "(C)C )cc",
2905
+ "N C(C)",
2906
+ "CN1 C(=O)",
2907
+ "= [NH2+]",
2908
+ "C1 O",
2909
+ "c( OC",
2910
+ "c6 ccccc6",
2911
+ "S 1",
2912
+ "CC1 (",
2913
+ "S CC(=O)N",
2914
+ "c1 [nH]",
2915
+ "c2) C1",
2916
+ "c2 c(C)",
2917
+ "= CC(=O)",
2918
+ "c3ccc4 c(",
2919
+ "O C(F)(F)",
2920
+ "(N) (=O)",
2921
+ "CCC (CC)",
2922
+ "c1 c[nH]",
2923
+ "c o",
2924
+ "CC (C(=O)",
2925
+ "C O)",
2926
+ "n o",
2927
+ "CCN (CC",
2928
+ "s 2)cc1",
2929
+ "O CCCC",
2930
+ "C(=O) OC",
2931
+ "c2 n1",
2932
+ "C2 )cc1",
2933
+ "F) c1",
2934
+ "nc1 2",
2935
+ "Br )cc",
2936
+ "N C1",
2937
+ "CCN C(N",
2938
+ "3) CC1",
2939
+ "c2) n1",
2940
+ "c2 [nH]",
2941
+ "C= C(",
2942
+ "3CCO CC3)",
2943
+ "= C(O)",
2944
+ "n cc2",
2945
+ "C #N)",
2946
+ "c1cc ncc1",
2947
+ "c( Br)c1",
2948
+ "CCCC CC1",
2949
+ ")cc2 1",
2950
+ "- 2",
2951
+ "C 2)c1",
2952
+ "Cc1 cs",
2953
+ "N 3",
2954
+ "O= [N+]",
2955
+ "Br )ccc1",
2956
+ "c2 =O)",
2957
+ "C c1ccc2",
2958
+ "C c2ccc",
2959
+ "NC(=O) CO",
2960
+ "C1 CCCC1",
2961
+ "3) c1",
2962
+ "c(F) c(F)",
2963
+ "C[NH+] (C",
2964
+ "C) c1",
2965
+ "c1cc (C)",
2966
+ "C #N",
2967
+ "NC( =S)N",
2968
+ "F)cc1 )",
2969
+ "n nc1",
2970
+ "CC(C) O",
2971
+ "c5 ccc",
2972
+ "O) c1",
2973
+ ")cc 2)c1",
2974
+ "S (N)(=O)",
2975
+ "CC2) CC1",
2976
+ "C 5",
2977
+ "CC #",
2978
+ "4) CC3)",
2979
+ "O [Si](C)",
2980
+ "CCCC (C",
2981
+ "CCN 2",
2982
+ "CC1 2",
2983
+ "c1c( F)cc",
2984
+ "n n2",
2985
+ "CO c1ccc2",
2986
+ "CC( N",
2987
+ "c2n c(C",
2988
+ "O=C(N C1",
2989
+ "C= C(C)",
2990
+ "cc( N",
2991
+ "n (CC",
2992
+ "3 CC3)",
2993
+ "n 2)CC1",
2994
+ "o c(C",
2995
+ "n c3)",
2996
+ "c4 c(",
2997
+ "CC1 CCC",
2998
+ "(Cl)cc 3",
2999
+ "Cl )cc1",
3000
+ "c2cc( Br)",
3001
+ "O C(F)",
3002
+ "c2cc ncc",
3003
+ "n [nH]",
3004
+ "O CC2",
3005
+ "(Cl)cc 3)",
3006
+ "Cc1n c(",
3007
+ "CN 2",
3008
+ "nc( N)",
3009
+ "C2 =O)cc1",
3010
+ "nc2 c1",
3011
+ "CCCC (=O)",
3012
+ "(F) F)",
3013
+ "C c2ccccc",
3014
+ "(C) CC1",
3015
+ "c3 s",
3016
+ "NC(=O) N2",
3017
+ "CCCC 1)",
3018
+ "O P(=O)",
3019
+ "N c1ccc(",
3020
+ "C(N) =O",
3021
+ ")cc 2)CC1",
3022
+ "6 )",
3023
+ "CC2) n1",
3024
+ "n cn1",
3025
+ "CCC S",
3026
+ "O CC(=O)",
3027
+ "2)C1 =O",
3028
+ "CC2 )ccc1",
3029
+ ")cc 4",
3030
+ ")cc cc1",
3031
+ "C(=O)N (C",
3032
+ "s c2c1",
3033
+ "C(=O) C1",
3034
+ "o 3)",
3035
+ "c1cc (Cl)",
3036
+ "OC) c(OC)",
3037
+ "Cc1ccc( N",
3038
+ "c3ccc( OC",
3039
+ "c2cc 1",
3040
+ "CC1 =",
3041
+ "C( CC)",
3042
+ "CC =C",
3043
+ "c2c( F)cc",
3044
+ "C( OC",
3045
+ "c1) OCO",
3046
+ "c3 ncc",
3047
+ "N (CC",
3048
+ "c1n c(N",
3049
+ "c(=O) n2",
3050
+ "c3ccc( N",
3051
+ "c 8",
3052
+ "c3 C)",
3053
+ "CC1 (C",
3054
+ "CC1 C",
3055
+ "c1) C(=O)",
3056
+ "cc 4",
3057
+ "CN (CC",
3058
+ "ccc2 c1",
3059
+ "ccccc 4",
3060
+ "c4ccc 5",
3061
+ "C(=O)N C1",
3062
+ "(C)C 3",
3063
+ "CC( =",
3064
+ "CC (F)(F)",
3065
+ "cc( Br)",
3066
+ "(C(=O) C2",
3067
+ "CC[NH+] 2",
3068
+ "CC( O",
3069
+ "C(C) O",
3070
+ "3) C1",
3071
+ "CCC (CC",
3072
+ "[O-]) CC1",
3073
+ "CCC= CCC=",
3074
+ "[NH3+] C1",
3075
+ "c(=O) n1",
3076
+ "cn 2",
3077
+ "[NH+] 3",
3078
+ "[NH2+] 1",
3079
+ "CCC 2)",
3080
+ "( OC)",
3081
+ "cc nc1",
3082
+ "c5 ccc(",
3083
+ "CC2) C1",
3084
+ "NC(=O) N1",
3085
+ "c1 N",
3086
+ "(C(C) =O)",
3087
+ "CCC( N",
3088
+ "c3 c2",
3089
+ "CC l)",
3090
+ "C (Cl)",
3091
+ "c2ccc s2)",
3092
+ "Cc1 c(",
3093
+ "CC(C) C1",
3094
+ "3CCCC 3",
3095
+ "C(N) =O)",
3096
+ "[O-] )cc2",
3097
+ "n c4",
3098
+ "( c3ccccc",
3099
+ "[NH2+] C)",
3100
+ "(C)C) C1",
3101
+ "n1 cn",
3102
+ "O 2",
3103
+ "c3 nn",
3104
+ "OC(C) (C)",
3105
+ "c1n c(C",
3106
+ "c3cccc 4",
3107
+ "c2ccc nc2",
3108
+ ") c1ccc(",
3109
+ "CC1 (C)C",
3110
+ "c3cc 4",
3111
+ "n c1)",
3112
+ "CO c1cccc",
3113
+ "N C2",
3114
+ "Cc1 s",
3115
+ "CCCC 3",
3116
+ "CC(=O)N 2",
3117
+ "=N NC(=O)",
3118
+ "= C(C)",
3119
+ "[NH+] (C)",
3120
+ "CC(CC (C",
3121
+ "= C(C",
3122
+ "# C",
3123
+ "F) CC1",
3124
+ "=C (C(=O)",
3125
+ "C(=O) C(",
3126
+ "s c2",
3127
+ "c1ccc o1",
3128
+ "[nH] c1",
3129
+ "(=O) C1",
3130
+ "[NH2+] C(",
3131
+ "c2cc3 c(",
3132
+ "=C( [O-])",
3133
+ "c3cc (Cl)",
3134
+ "c1cccc n1",
3135
+ "o c1",
3136
+ "( NC(=O)",
3137
+ "CC(C) (O)",
3138
+ "c1ncc cc1",
3139
+ "OCC (C",
3140
+ "CCO 1",
3141
+ "3 C)",
3142
+ "Cl) c1",
3143
+ "CCC (C)C",
3144
+ "S c1n",
3145
+ "ccccc 7",
3146
+ "cccc c12",
3147
+ "#N )cc1",
3148
+ "O c2ccc(",
3149
+ "CC OC2",
3150
+ "c3 nc(",
3151
+ "C(=O) C2",
3152
+ "Cc1n o",
3153
+ "c5 )",
3154
+ "n2 ccc",
3155
+ "Cc1 [nH]",
3156
+ "3)n 2)cc1",
3157
+ "(Cl) c2",
3158
+ "C1 C",
3159
+ "CC(C O)",
3160
+ "O CC(O)",
3161
+ "= C(C#N)",
3162
+ "CCCO 1",
3163
+ ")cc 4)",
3164
+ "n1 2",
3165
+ "[nH] c2c1",
3166
+ "cc cc1",
3167
+ "CCC N1",
3168
+ "c1 c(C",
3169
+ "Cc1n n(C)",
3170
+ "c3cc (C)",
3171
+ "N C",
3172
+ "2) nc1",
3173
+ "C3 =O)",
3174
+ "OC (C)C)",
3175
+ "CCO CC1)",
3176
+ "OC (C)C",
3177
+ "(Cl) c2)",
3178
+ "4 CCCC",
3179
+ "c2 c[nH]",
3180
+ "C= CC(=O)",
3181
+ "O C(=O)N1",
3182
+ "C1 CCN",
3183
+ "S )",
3184
+ "c2n c(N",
3185
+ "[NH2+] CC",
3186
+ "C= N",
3187
+ "c %1",
3188
+ "CCN (C)",
3189
+ "=[NH2+] )",
3190
+ "c(C) n1",
3191
+ "C( =[NH+]",
3192
+ "n [nH]1",
3193
+ "(Cl)cc1 )",
3194
+ "= CC=",
3195
+ "c2) OCO",
3196
+ "cc2 1",
3197
+ "c1cc( Br)",
3198
+ "F)cc1 F",
3199
+ "c1cc (F)",
3200
+ "CCCC )",
3201
+ "(F) c2)",
3202
+ "CCC( O",
3203
+ "C(=O) O",
3204
+ ")N (C",
3205
+ "(C(=O) C",
3206
+ "=C2 S",
3207
+ "= C)",
3208
+ "NC(=O) CC",
3209
+ "NC(=O) C3",
3210
+ "c(Cl) c2)",
3211
+ "s 2)c1",
3212
+ "C= CC=",
3213
+ "CCN S(=O)",
3214
+ "CC( N)=O)",
3215
+ "c(Cl) c3)",
3216
+ "OC)c1 OC",
3217
+ "(C) (C)",
3218
+ "OCCO 2",
3219
+ "F C(F)(F)",
3220
+ "c4 cc(",
3221
+ "3CCCC 3)",
3222
+ "Cl) CC1",
3223
+ "ccccc2 c1",
3224
+ "ccc (C",
3225
+ "cn 2)",
3226
+ "CO CCO",
3227
+ "CC1 (O)",
3228
+ "nc(N 2",
3229
+ "[nH] 1)",
3230
+ "CC(C) C(",
3231
+ "o 2)c1",
3232
+ "F) C1",
3233
+ "c7 ccccc7",
3234
+ "cc (C)",
3235
+ "F)cc (",
3236
+ "c3 c(C)",
3237
+ "[nH] 2)",
3238
+ "(Cl) c3)",
3239
+ "c2ccc o2)",
3240
+ "S C1",
3241
+ "n2 c(",
3242
+ "C2 =O",
3243
+ "nn 3",
3244
+ "o n1",
3245
+ "[nH] c2",
3246
+ "CC 5",
3247
+ "4) c3)",
3248
+ "C) CC1",
3249
+ "C2 CCCC",
3250
+ "c3ccc( Br",
3251
+ "=C2 C(=O)",
3252
+ "3CCCCC 3)",
3253
+ "c2ccc( O",
3254
+ "2 )ccc1",
3255
+ "(C(=O)N 3",
3256
+ "c3cc ncc",
3257
+ "CC1 CCCC",
3258
+ "n2 c(=O)",
3259
+ "N) c1",
3260
+ ")N C1",
3261
+ "CN S(=O)",
3262
+ "C2 CC2)",
3263
+ "Cn1 cn",
3264
+ "CC(C) (C",
3265
+ "C c1cccc2",
3266
+ "(C)C) CC1",
3267
+ "cc cc(",
3268
+ "cn c1",
3269
+ "[ C",
3270
+ "4 CCO",
3271
+ "n c2cccc",
3272
+ "[nH] c1=O",
3273
+ "CC c1cc",
3274
+ "C= C(C",
3275
+ "c1) C1",
3276
+ "F)cc (C",
3277
+ "CC c1",
3278
+ "c(C) c(C)",
3279
+ "C 2)n1",
3280
+ "CCC S(=O)",
3281
+ "OC) c(",
3282
+ "N =C(",
3283
+ "(C) (C)C)",
3284
+ "(C) O",
3285
+ "= CC(=O)N",
3286
+ "CCC1 2",
3287
+ "c1cccc( N",
3288
+ "Cc1n c(C",
3289
+ "= C3",
3290
+ "4) c3",
3291
+ "c2cc( N",
3292
+ "C3 CC3)",
3293
+ "NC( =S)",
3294
+ "C= CC1",
3295
+ "CCC O)",
3296
+ "CCCC CC",
3297
+ "Cc1cc( N",
3298
+ "c2cc s",
3299
+ "CC(C) CC(",
3300
+ "S C)",
3301
+ "C(C) =O)",
3302
+ "= [N+]",
3303
+ "[NH+] (C",
3304
+ "CO CC1",
3305
+ "3) c2",
3306
+ "c2 c(C)cc",
3307
+ "s 2)CC1",
3308
+ "CC l",
3309
+ "CCC2 (",
3310
+ ") (",
3311
+ "c2n n",
3312
+ "C c2ccc(",
3313
+ "=C(N) N)",
3314
+ "2) C(=O)",
3315
+ "c1cc(C 2",
3316
+ "(C(=O) OC",
3317
+ "cc1 Cl",
3318
+ "CO c1",
3319
+ "C 4)",
3320
+ "CCC 3)",
3321
+ ")cc n1",
3322
+ "c3cccc (C",
3323
+ "CC(=O)N (",
3324
+ "c1ccc( OC",
3325
+ "CCC c1n",
3326
+ "c3ccc s3)",
3327
+ "CC( N)",
3328
+ "cc n1",
3329
+ "Br )cc(",
3330
+ "[O-]) c(",
3331
+ "c2 o",
3332
+ "C( OC(=O)",
3333
+ ") NC(=O)",
3334
+ "2)cc1 OC",
3335
+ "C= CCO",
3336
+ ")ccc1 OC",
3337
+ "c2ncc cc2",
3338
+ "2) s1",
3339
+ "O=S(=O) (",
3340
+ "c2ccc(N 3",
3341
+ "4 )cc3",
3342
+ "[nH] c3",
3343
+ "(C)C) n1",
3344
+ "CS c1n",
3345
+ "C2 CCC",
3346
+ "C= CC",
3347
+ "c3n cn",
3348
+ "C 2)CC1",
3349
+ "C2 O)",
3350
+ "c3ccc (C)",
3351
+ "c(=O) o",
3352
+ "O=C(C 1",
3353
+ "[nH+] 1",
3354
+ ")cc2 c1",
3355
+ "C( OC)",
3356
+ "c (Cl)cc1",
3357
+ "[n -]",
3358
+ "C 2)C1",
3359
+ "N N",
3360
+ "(C(=O) O",
3361
+ "c1ccc s1)",
3362
+ "[ P",
3363
+ "c1ccc o1)",
3364
+ "O CCCO",
3365
+ "(F) c3)",
3366
+ "Cc1 o",
3367
+ "CC(C) n1",
3368
+ "C= CCn1",
3369
+ "cc1 C",
3370
+ "c4 n",
3371
+ "CC1 CC1",
3372
+ "c2n nc(C",
3373
+ "c(N) c1",
3374
+ "CCCC CCC",
3375
+ ")N1 CCN(",
3376
+ "c(N 3",
3377
+ "c(O) c1",
3378
+ "c3cc (F)",
3379
+ "(C) CC",
3380
+ "C) C1",
3381
+ "2)cc1 )",
3382
+ ")cc (C",
3383
+ "C3 CCCCC",
3384
+ "c3ccc (F)",
3385
+ "c(=O) c1",
3386
+ "c1)OCO 2",
3387
+ "c4 cn",
3388
+ "c2ccc( O)",
3389
+ "n 2)C1",
3390
+ "cc (Cl)",
3391
+ "c(F) c2)",
3392
+ "C1 (",
3393
+ "O S(=O)",
3394
+ "NC(C) =O)",
3395
+ "C( CC(=O)",
3396
+ "Cn1 cc(",
3397
+ "(C)cc 2)",
3398
+ "S1 (=O)",
3399
+ "c2ccc (CC",
3400
+ "(CC) CC",
3401
+ "2)c1 C",
3402
+ "c1n cc(",
3403
+ "(C)cc 3)",
3404
+ "CC(=O) O",
3405
+ "(F) c2",
3406
+ "4) ccc3",
3407
+ "ccc 5",
3408
+ "c1ccc(C N",
3409
+ "c(=O) c2",
3410
+ "c1cn cc(",
3411
+ "c2n c(C)",
3412
+ "[P +]",
3413
+ "c3 [nH]",
3414
+ "[n+] 1",
3415
+ "CCO c1ccc",
3416
+ "C [NH3+]",
3417
+ "4 CCCCC",
3418
+ "4 )cc3)",
3419
+ "C(=O)N C",
3420
+ "n cc(",
3421
+ "3) C1)",
3422
+ "CC (F)",
3423
+ "C(C) C1",
3424
+ "N c1cc",
3425
+ "[O-]) c2)",
3426
+ "C1 =O)",
3427
+ "(C) C(",
3428
+ "O 2)",
3429
+ "=N N",
3430
+ "CCCC 3)",
3431
+ "c2 c(C",
3432
+ "c2n ccc",
3433
+ "nc(C 2",
3434
+ "C1 (C",
3435
+ "c(N C",
3436
+ "nc2 n1",
3437
+ "c2s ccc2",
3438
+ "n 5",
3439
+ "= [N-]",
3440
+ "[S -]",
3441
+ "CC OC(C",
3442
+ "CO P(=O)",
3443
+ "c1ccc( n2",
3444
+ "( Br)",
3445
+ "c3cc (C",
3446
+ "c1 o",
3447
+ "=[NH+] O)",
3448
+ "CC1 CCN",
3449
+ "CC2 CCC1",
3450
+ ")ccc1 Cl",
3451
+ "C(=O)N C2",
3452
+ "I) c1",
3453
+ "c3ccc o3)",
3454
+ "nn n1",
3455
+ "CCC(C O)",
3456
+ "n2 c1",
3457
+ "nc2 1",
3458
+ "n3 cn",
3459
+ "cn n1",
3460
+ "n(C 2",
3461
+ "c4ccc (C",
3462
+ ") S(=O)",
3463
+ "cc1 2",
3464
+ "N c1cc(",
3465
+ "I )cc1",
3466
+ "nc1 C",
3467
+ "NC( N)",
3468
+ "cn c2",
3469
+ "(CC C(=O)",
3470
+ "n 4)",
3471
+ "c2cc( OC)",
3472
+ "2) cn1",
3473
+ "CC c1cccc",
3474
+ "Cn1 c(=O)",
3475
+ ")cc 2)n1",
3476
+ "C1 C2",
3477
+ "(CC) CC)",
3478
+ "(CC (C)C)",
3479
+ "C1 CCN(",
3480
+ "C( =N",
3481
+ "P (",
3482
+ "O=C(C O",
3483
+ "(F) c1)",
3484
+ "c2n cc(",
3485
+ "C= C(C)C",
3486
+ "NC(=O) CN",
3487
+ "CC c1cn",
3488
+ "n 2)n1",
3489
+ "c( [O-])",
3490
+ "CC1 O",
3491
+ "(C)cc (C)",
3492
+ "C(=O) OCC",
3493
+ "(C)cc c1",
3494
+ "Cc1 c(C",
3495
+ "S C2",
3496
+ ")ccc1 F",
3497
+ "cn 2)cc1",
3498
+ "c1n c2c(",
3499
+ "CO c1cc2",
3500
+ "CC2 1",
3501
+ "CC c1ccc",
3502
+ "C(F) F",
3503
+ "CC1 CN(",
3504
+ "c2n nn",
3505
+ "(Cl) c1)",
3506
+ "c4cc cc(",
3507
+ "CCO C(",
3508
+ "c(F)c1 F",
3509
+ "Cl) C1",
3510
+ "c2 C)cc1",
3511
+ "(Cl) c(",
3512
+ "CCCC (O)",
3513
+ "OC 4",
3514
+ "c1cs c(",
3515
+ "CC 5)",
3516
+ "4CCO CC4)",
3517
+ "CC (Cl)",
3518
+ "C(O) C1",
3519
+ "3 )cc2",
3520
+ "CCCC n1",
3521
+ "C(C S",
3522
+ "OC) C1",
3523
+ "OC(C) =O",
3524
+ "CC =C(",
3525
+ "NC(=O) OC",
3526
+ "CC[NH+] (",
3527
+ "c4ccccc 3",
3528
+ "3 CC4",
3529
+ "C= O",
3530
+ "C1= C(C)",
3531
+ "F)cc 2)c1",
3532
+ "CC [NH3+]",
3533
+ "CC c1cc(",
3534
+ "c 9",
3535
+ "O)cc 2)",
3536
+ ")cc 2)C1",
3537
+ "c1cc sc1",
3538
+ "ncc c1",
3539
+ "O [Si]",
3540
+ "c1n c(C)",
3541
+ "=[NH+] C",
3542
+ "c2ccc(C 3",
3543
+ "CCCC 2)c1",
3544
+ "c1 )N",
3545
+ "CCCC N",
3546
+ "(=O) C",
3547
+ "n c(Cl)",
3548
+ "- 3",
3549
+ "c1 c(C)cc",
3550
+ "Cc1cc2 c(",
3551
+ "C( OC2",
3552
+ "c4ccc5 c(",
3553
+ "2) c(C)c1",
3554
+ "[O-] )cc(",
3555
+ "C( CCCC",
3556
+ "CC1 CC1)",
3557
+ "cc (C(=O)",
3558
+ "cc2 c1",
3559
+ "CC1 CO",
3560
+ "CC2)c1 C",
3561
+ "N )cc1",
3562
+ ")ccc1 O",
3563
+ "C) n1",
3564
+ "O=C(C S",
3565
+ "(C) O)",
3566
+ "(Cl) c(C",
3567
+ "c1 c(N",
3568
+ "3) c2)",
3569
+ "C1 CCCC",
3570
+ "( OCC)",
3571
+ "CC1 CCN(",
3572
+ "c( =S)",
3573
+ "c(C 3",
3574
+ "c2n oc(C",
3575
+ "F)cc 4)",
3576
+ "C(C (C)C)",
3577
+ "(CCC )",
3578
+ "OC )cc(",
3579
+ "O)cc1 )",
3580
+ "cn1 )",
3581
+ "c2n [nH]",
3582
+ "(C) CCC",
3583
+ "1 )N",
3584
+ "c(Cl) c1)",
3585
+ "[O-]) n1",
3586
+ "C(C) =O",
3587
+ "c(=O) n(",
3588
+ "O=C(C n1",
3589
+ "OC )cc",
3590
+ "2) o1",
3591
+ "cc2 Cl)",
3592
+ "c3ccc nc3",
3593
+ "C( CC",
3594
+ "c(F) c3)",
3595
+ "CC c2c(",
3596
+ "CC(C) =",
3597
+ "c2n c3c(",
3598
+ "n3 cc",
3599
+ "cn 3",
3600
+ "O c2ccc",
3601
+ "o 2)CC1",
3602
+ "2) c(",
3603
+ "c(S C",
3604
+ "3) CC2",
3605
+ "C 6",
3606
+ "=O) C1",
3607
+ "ccc3 C)",
3608
+ "Cc1n n(",
3609
+ "nc( S",
3610
+ "O= c1",
3611
+ "=O) N",
3612
+ "s c3",
3613
+ "CCCO C1",
3614
+ "nn1 C",
3615
+ "CCC(C) C(",
3616
+ "n(C) c1",
3617
+ "O=C1 N",
3618
+ "c1ccc( S",
3619
+ "ncn c1",
3620
+ "2) N1",
3621
+ "F) n1",
3622
+ "CC= CC=",
3623
+ "c[nH] 1)",
3624
+ "CCC3 (CC",
3625
+ "(N) =S)",
3626
+ "[NH3+] )N",
3627
+ "e ]",
3628
+ "(=O) [nH]",
3629
+ "c1n n",
3630
+ "ncc 3",
3631
+ "(C c2ccc",
3632
+ "OCC 3",
3633
+ "s c(",
3634
+ "CCC1 )",
3635
+ "7 )",
3636
+ "cccc 3",
3637
+ "c5ccc 6",
3638
+ "CCC N2",
3639
+ "3) ccc2",
3640
+ "c2 =O)cc1",
3641
+ "C= CC(",
3642
+ "(C)C (C",
3643
+ "n2 cc(",
3644
+ "2) C2",
3645
+ "n2 nn",
3646
+ "C= CCN",
3647
+ ")N1 CCC(",
3648
+ "ccc n1",
3649
+ "CC1 CC(",
3650
+ "3CCCCC 3",
3651
+ "C [Si](C)",
3652
+ "Cn1 cc(C",
3653
+ "n2 n",
3654
+ "nn c2",
3655
+ "3)C2 )cc1",
3656
+ "nc1 N",
3657
+ "CC1 CCC(",
3658
+ "(C(=O)N 1",
3659
+ "B (O)",
3660
+ "CO C(=O)N",
3661
+ "CC3 )cc2",
3662
+ "N =N",
3663
+ "c5 cccc",
3664
+ "c1cc( N2",
3665
+ "ccccc 5",
3666
+ "(CC (O)",
3667
+ "4) C3",
3668
+ "cn 2)CC1",
3669
+ "3 CCC",
3670
+ "CO C(",
3671
+ "c1c( N)",
3672
+ "H ]",
3673
+ "(=O) o",
3674
+ "c2cn cc",
3675
+ "c -2",
3676
+ "C [NH3+])",
3677
+ "ccccc 8",
3678
+ "3) C2)",
3679
+ "CCC2 C3",
3680
+ "(=O) [O-]",
3681
+ "F)cc1 Cl",
3682
+ "CCCCC O",
3683
+ ")cc 5",
3684
+ "CC1 CCCC1",
3685
+ "CO CCC",
3686
+ "3 C(=O)",
3687
+ "N c1ccc",
3688
+ "OCO 4)",
3689
+ "CCC l)",
3690
+ "C(C c1cn",
3691
+ "c3ccc s",
3692
+ "c2s c(",
3693
+ "(CC (C)",
3694
+ "CCO CC2",
3695
+ "O= c1[nH]",
3696
+ "O c2ccccc",
3697
+ "CCCN (C",
3698
+ "c1ccc(C (",
3699
+ "C(F) F)",
3700
+ "c1) OCCO2",
3701
+ "c(Cl)cc 2",
3702
+ "s c1C",
3703
+ ")cc1 2",
3704
+ "c6 ccc",
3705
+ "c2cs c(",
3706
+ "c2cc( N)",
3707
+ "c4 c3",
3708
+ "c3 cs",
3709
+ "(CC l)",
3710
+ "c(C) cc1",
3711
+ "N# C",
3712
+ "C(O) C1O",
3713
+ "c2cn n(C)",
3714
+ "c2) OCCO",
3715
+ "c2)OCO 3)",
3716
+ "C(C 2",
3717
+ "F)cc 4",
3718
+ "4 CC4)",
3719
+ "c5 cc",
3720
+ "c3c( c2",
3721
+ "Cn1 cc",
3722
+ "CC2 CCCO",
3723
+ "C(C)C) c1",
3724
+ "[C -]",
3725
+ "Cc1 c(C)",
3726
+ "(=O) c1cc",
3727
+ "c1ccc( O)",
3728
+ "c2cc o",
3729
+ "[O-]) c2",
3730
+ ")cc1 )",
3731
+ "CC1 CCC(C",
3732
+ "c(C) c3)",
3733
+ "cc 5",
3734
+ "(C) c2",
3735
+ "ccc2 n1",
3736
+ "OC(F) F",
3737
+ "CCCC (C)C",
3738
+ "C OC(C",
3739
+ "[O-]) c3)",
3740
+ "[N+] 1",
3741
+ "n2 C",
3742
+ "Cc1cc( N2",
3743
+ "C NC(=O)N",
3744
+ "C(C)C 2",
3745
+ "CCCO 1)",
3746
+ "Cc1ccc( O",
3747
+ "CC(C) c1n",
3748
+ "n(C) n1",
3749
+ "CO CCn1",
3750
+ "C(C) CC",
3751
+ "Br )cc2",
3752
+ "Cl )N",
3753
+ "c1s ccc1",
3754
+ "(C)cc 3",
3755
+ "CC( =N",
3756
+ "C2 CCCCC",
3757
+ "c4cccc 5",
3758
+ "C) C(=O)",
3759
+ "S C(=C",
3760
+ "(C 3",
3761
+ "c(=O) n3",
3762
+ "CC(C #N)",
3763
+ "c(F) c2",
3764
+ "C1 CCC(",
3765
+ "(C) CC2",
3766
+ "C= C2",
3767
+ "2) N",
3768
+ "cc1 Cl)",
3769
+ "c1cn c(",
3770
+ "c1C #N",
3771
+ "N1 CCCC1",
3772
+ "=O) CC1",
3773
+ "CC(C) (",
3774
+ "CCCCC =",
3775
+ "2)cc1 C",
3776
+ "( c2ccc(",
3777
+ "(CCCC )",
3778
+ "C(C 1",
3779
+ "4) C2)",
3780
+ "c1 c(Cl)",
3781
+ "c(C (C)C)",
3782
+ "n2 C)",
3783
+ "CC c2ccc(",
3784
+ "CO CCN",
3785
+ "OC c1ccc(",
3786
+ "c(=O) n(C",
3787
+ "N (C)C",
3788
+ "3)C1) C2",
3789
+ "C( Br)",
3790
+ "c1n c(N)",
3791
+ "Br) c2)",
3792
+ "c(C) c(",
3793
+ "ccc2 Cl)",
3794
+ "nc2 s",
3795
+ "[nH] 3)",
3796
+ "CS CCC(",
3797
+ "c8 ccccc8",
3798
+ "CCC (C)C)",
3799
+ "CCCCC )",
3800
+ "(C) c(",
3801
+ "CO 1",
3802
+ "= NC(=O)",
3803
+ "OC) C(=O)",
3804
+ "CCCC CC)",
3805
+ "O) C(O)",
3806
+ "c2c( c1)",
3807
+ "n2 nc(",
3808
+ "c( F)cc2",
3809
+ "C2 CC3",
3810
+ "3)n 2)",
3811
+ "# [N+]",
3812
+ "c(C)c1 C",
3813
+ "Cc1cc sc1",
3814
+ "O 1)",
3815
+ "C2 CCCCC2",
3816
+ "s 2)C1",
3817
+ "F)cc cc3",
3818
+ "c(=O) n",
3819
+ "C1 CC",
3820
+ "[NH3+] C(",
3821
+ "C2 =O)c1",
3822
+ "(Cl) s1",
3823
+ "[n+] 2",
3824
+ "[O-] )cc3",
3825
+ "CC( S",
3826
+ "Cc1ccc o1",
3827
+ ")N1 CCCC1",
3828
+ "cc2 C)",
3829
+ "C(C) O)",
3830
+ "C1 (C)",
3831
+ "(C NC(=O)",
3832
+ "C1 CC1)",
3833
+ "O 3)",
3834
+ "CO c1c(",
3835
+ "Cc1n c(N",
3836
+ "( c2ccc",
3837
+ "N1 CCN",
3838
+ "(CC [NH+]",
3839
+ "(C) (C)C",
3840
+ "c1n c(Cl)",
3841
+ "c2cc (O)",
3842
+ "cs 2)cc1",
3843
+ "c2n oc(",
3844
+ "c1cc( O)",
3845
+ "nc2 )cc1",
3846
+ "ccccc2 3)",
3847
+ "2)CC1 )",
3848
+ "N1 CCN(",
3849
+ "(=O)N C",
3850
+ "O=C(C =C",
3851
+ "OC) c(C",
3852
+ "OC(F) F)",
3853
+ "n c4)",
3854
+ "c1cn n(",
3855
+ "(C) cc1",
3856
+ "S 2",
3857
+ "c1n c(C2",
3858
+ "c(C) c2)",
3859
+ "C1 =C(",
3860
+ "C OC(C)",
3861
+ "c3ccc( O",
3862
+ "CCC(C 2",
3863
+ ")ccc1 2",
3864
+ "ncn 2",
3865
+ "c1cn cc",
3866
+ "c3) OCO4)",
3867
+ "N2 CCN",
3868
+ "CC1 CC",
3869
+ "CC(C) N",
3870
+ "s 2)n1",
3871
+ "(=O)N (C)",
3872
+ "Nc1n cn",
3873
+ "c2cn 3",
3874
+ "(Cl) c3",
3875
+ "CCC1 CCCC",
3876
+ "CC( =C",
3877
+ "O c1ccc(C",
3878
+ "CC(O) C1",
3879
+ "= CN",
3880
+ "(=O)N C2",
3881
+ "[O-]) C(",
3882
+ "CCC OC",
3883
+ "(C)C) C2",
3884
+ "2)cc1 Cl",
3885
+ ")N c1ccc(",
3886
+ "c1n [nH]",
3887
+ ")ccc1 N",
3888
+ "cc( S(=O)",
3889
+ "CCO CC",
3890
+ "cn 2)c1",
3891
+ "c4cc 5",
3892
+ "3CCO CC3",
3893
+ "(C) =O)",
3894
+ "n nc(",
3895
+ "c2c( c1",
3896
+ "CN2 C(=O)",
3897
+ "C1 CC2",
3898
+ "2) C(=O)N",
3899
+ "NC( N",
3900
+ "c1ncc s1",
3901
+ "c2 C1",
3902
+ "cccc c12)",
3903
+ "CCc1n c(",
3904
+ "nn nc1",
3905
+ "c2c1 C",
3906
+ "3)n 2)c1",
3907
+ "c5 c(",
3908
+ "=C(N) N",
3909
+ "2) C(",
3910
+ "cn 3)",
3911
+ "CCC #N)",
3912
+ "c1ccc( N)",
3913
+ "c1n cc(C",
3914
+ "c3ccc o",
3915
+ "C2 C3",
3916
+ "c2n o",
3917
+ "c2C) CC1",
3918
+ "c1n o",
3919
+ "c2 c(Cl)",
3920
+ "Br) C1",
3921
+ "= CC2",
3922
+ "(C) n1",
3923
+ "= CC",
3924
+ "CCn1 cn",
3925
+ "CCCCC (",
3926
+ "OCC1 OC(",
3927
+ "cs1 )",
3928
+ "[O-]) C2",
3929
+ "=C (Cl)",
3930
+ "= CC1",
3931
+ "o c(=O)",
3932
+ "O c1ccc",
3933
+ "C( =S)",
3934
+ "C= CCCC",
3935
+ "cc1 )",
3936
+ "c3 =O)",
3937
+ "[nH] 2)c1",
3938
+ "(C 1",
3939
+ "(C) c3)",
3940
+ "c(F) c1)",
3941
+ "C= CC(C",
3942
+ "ccc( N",
3943
+ "C(=O) OC1",
3944
+ "CC) c1",
3945
+ "C12 CC3",
3946
+ "CCC (C(C)",
3947
+ "Cc1n nc(",
3948
+ "[O-]) c1)",
3949
+ "O=C(N CC1",
3950
+ "CCS C1",
3951
+ "c2)cc1 OC",
3952
+ "s 4)",
3953
+ "c2cc nc(N",
3954
+ "C(=O) C3",
3955
+ "c6 )",
3956
+ "(C (N)=O)",
3957
+ "[NH3+]) C",
3958
+ "= N)",
3959
+ "(CC CCC",
3960
+ "3)C2 =O)",
3961
+ "2) n",
3962
+ "CC OC(C)",
3963
+ "C(N) =S",
3964
+ "ccc3 Cl)",
3965
+ "ccccc3 4)",
3966
+ "CCCC CCO",
3967
+ "CC(=O) OC",
3968
+ "Cn1 ccnc1",
3969
+ "3 CC[NH+]",
3970
+ "(=O) O",
3971
+ "C2 1",
3972
+ "O=C( CC",
3973
+ "CCCCC 3",
3974
+ "F)cc 2)n1",
3975
+ "c2cc ncc2",
3976
+ "C1 (C)C",
3977
+ "C(= C)",
3978
+ "c2 c(N",
3979
+ "n2 n1",
3980
+ "c1cn c(N",
3981
+ "OCC) c1",
3982
+ "c(C O",
3983
+ "c1cn 2",
3984
+ "CC1 CCCO1",
3985
+ "n c2)c1",
3986
+ "c1 [nH+]",
3987
+ "c( F)cc1",
3988
+ "CC(O) CO",
3989
+ "ccc2 F)",
3990
+ "c3cc4 c(",
3991
+ "CC OC3",
3992
+ "(C c2ccc(",
3993
+ "c(C O)",
3994
+ "c2n c(=O)",
3995
+ "cc1 F",
3996
+ "C(=O)N CC",
3997
+ "n2 nc(C)",
3998
+ ")ccc1 Br",
3999
+ "N1 CCC(",
4000
+ "c4) CC3)",
4001
+ "c2n c(N)",
4002
+ "CO CCN1",
4003
+ "F)ccc1 F",
4004
+ "=[N-] )",
4005
+ "C3 )cc1",
4006
+ "c1ccc(C O",
4007
+ "c(C) c2",
4008
+ "cc( O)",
4009
+ "Cl) n1",
4010
+ "3) c2)cc1",
4011
+ "n c5",
4012
+ "c2C) c1",
4013
+ "O=C( CC1",
4014
+ "C(C) S",
4015
+ "( CCO",
4016
+ "N C1=O",
4017
+ "c2n cc(C",
4018
+ "#N) c1",
4019
+ "C1 CCCN",
4020
+ "c2n (",
4021
+ "N1 CC",
4022
+ "C(O) =C(",
4023
+ "Cc1n n",
4024
+ "N c1",
4025
+ "S C(C)",
4026
+ "CCC1 (C",
4027
+ "OCO 2",
4028
+ "CCCC CC=",
4029
+ "CC# CC#",
4030
+ "c1cc( N)",
4031
+ "F)cc2 F)",
4032
+ "2 )cc(",
4033
+ "(C)C (C)C",
4034
+ "cs 2)",
4035
+ "c3cc( OC)",
4036
+ "CCC2 1",
4037
+ "c1 =O)",
4038
+ "(CC OC)",
4039
+ "c2 3)",
4040
+ "C(=O) OC)",
4041
+ "(C)C )cc2",
4042
+ "c2n c3",
4043
+ "O O",
4044
+ "c2nnn n2",
4045
+ "3 )cc2)",
4046
+ "= CCCC",
4047
+ "(Cl)c1 Cl",
4048
+ "N# Cc1",
4049
+ "c(C N",
4050
+ "c oc(",
4051
+ "(C(C) (C)",
4052
+ "OC c1ccc",
4053
+ "C= CC2",
4054
+ "%1 0",
4055
+ "O=C(N C",
4056
+ "NC(=O)N (",
4057
+ "c4 ncc",
4058
+ "=C( S",
4059
+ "CCO P(=O)",
4060
+ "Cc1cs c(",
4061
+ "CC(C) O1",
4062
+ ")cc (C)c1",
4063
+ "c1ccc( N(",
4064
+ "CS C1",
4065
+ "CC2) nc1",
4066
+ "c4 )cc3",
4067
+ "c1cc (=O)",
4068
+ "N= [N+]",
4069
+ "C( c1ccc(",
4070
+ ")cc 5)",
4071
+ "(C(=O) C3",
4072
+ "N1 CCOCC1",
4073
+ "C c2cccc",
4074
+ "CC2 (",
4075
+ "nn (C)",
4076
+ "n2 cc(C",
4077
+ "c1n n(",
4078
+ "CCC1 (CC)",
4079
+ "C( O",
4080
+ "n 3)n",
4081
+ "o 2)C1",
4082
+ "Cl) C(=O)",
4083
+ "n3 ccc",
4084
+ "(C) c2)",
4085
+ "c( Br)cc1",
4086
+ "n2 c(C)",
4087
+ "Br) s1",
4088
+ "CC(O) (C",
4089
+ "C1 CC(",
4090
+ "nc(C) n1",
4091
+ "c6 ccc(",
4092
+ "C(C) (C",
4093
+ "F)cc 2)C1",
4094
+ "F) C(=O)",
4095
+ ")N1 CC",
4096
+ "(Cl) (Cl)",
4097
+ "c1cccc( O",
4098
+ "=O) cc2",
4099
+ "CC )cc1",
4100
+ "4 C)",
4101
+ "CC2 (CC",
4102
+ "c1 co",
4103
+ "C1 CCC2",
4104
+ "c2c( N)",
4105
+ "c2ccs c2)",
4106
+ "(Cl)cc 4)",
4107
+ "C[NH2+] 1",
4108
+ "c[nH] 1",
4109
+ "CCC 5",
4110
+ "c(=O) c3",
4111
+ "c1cc( OC",
4112
+ "CCCC CCC1",
4113
+ "c4 c3)",
4114
+ "CCO 2",
4115
+ "[NH2+] 1)",
4116
+ "C[NH2+] C",
4117
+ "c1 c[nH+]",
4118
+ "Br)cc1 )",
4119
+ "CCCCC (C)",
4120
+ "ccc2 C)",
4121
+ "CCn1 c(",
4122
+ "(C(=O) CC",
4123
+ "CN( S(=O)",
4124
+ "c(F) c3",
4125
+ "CC2 CCC",
4126
+ "= CC(",
4127
+ "N2 CC",
4128
+ "=[NH+] O",
4129
+ "cc c12",
4130
+ "C2 C1",
4131
+ "Cc1n c(C)",
4132
+ "(C(=O) CS",
4133
+ "OC) n1",
4134
+ "2)c1 =O",
4135
+ "c%1 0",
4136
+ "CO CC(C)",
4137
+ "3) CC2)c1",
4138
+ "c(N C2",
4139
+ "c3ccc( O)",
4140
+ "NC(=O)N C",
4141
+ "- c2ccc(",
4142
+ "n2 nc(C",
4143
+ "c2cc (=O)",
4144
+ "CCC( N)",
4145
+ "CCn1 c(S",
4146
+ "ncn c3",
4147
+ "CCC l",
4148
+ "c1n nc(C",
4149
+ "( c4ccccc",
4150
+ "F c1ccc(",
4151
+ "c3cc( Br)",
4152
+ "= Cc1ccc(",
4153
+ "c2n c(Cl)",
4154
+ "CCS 1",
4155
+ "C OC2",
4156
+ "S CC(=O)",
4157
+ "c2 [nH+]",
4158
+ "C(C) (O)",
4159
+ "COc1cc( N",
4160
+ "n2cc nc2)",
4161
+ "(C)C) c(",
4162
+ "2)c1 )",
4163
+ "(F) c3",
4164
+ "(F)(F) F",
4165
+ "CCC 4)",
4166
+ "c(Cl) c2",
4167
+ "[nH] n1",
4168
+ "n2 c(C",
4169
+ "(C2 CC2)",
4170
+ "C= CCC1",
4171
+ "N= C1",
4172
+ "OC1 2",
4173
+ "C4 CCCCC",
4174
+ "(=O) C2",
4175
+ "CCCC 2)C1",
4176
+ "OC) CC1",
4177
+ "O=C(N CC",
4178
+ "nc2 c(",
4179
+ "S1(=O) =O",
4180
+ "N# CC1",
4181
+ "O c3ccc(",
4182
+ "C(=O) C(C",
4183
+ "C3 O)",
4184
+ "F)cc1 )N",
4185
+ "CC2 (O)",
4186
+ "c3cc 2",
4187
+ "cn n1C",
4188
+ "nn 2)cc1",
4189
+ "Cc1ccc s1",
4190
+ "2) no1",
4191
+ "CC2 C1",
4192
+ ") C2",
4193
+ "O CC[NH+]",
4194
+ "S C(",
4195
+ "3 CCN(",
4196
+ "CCCC 4)",
4197
+ "CCC(C #N)",
4198
+ "O CC(",
4199
+ "(C(=O) CO",
4200
+ "n(C) c1=O",
4201
+ "[S e]",
4202
+ "c4ccc( OC",
4203
+ "F)cc3 F)",
4204
+ "n (CC)",
4205
+ "[S-] )",
4206
+ "3) C(=O)",
4207
+ "N# Cc1cc(",
4208
+ "CCCN C(N)",
4209
+ "2) nn1",
4210
+ "c(Cl)cc (",
4211
+ "Cc1cc nc(",
4212
+ "C(C) N",
4213
+ "o c1C",
4214
+ "cc1 OC)",
4215
+ "4) CC3",
4216
+ "c3ncc cc3",
4217
+ "cn c3",
4218
+ "CCC1 (C)",
4219
+ "c1c( O)",
4220
+ "Cc1n n(C",
4221
+ "CCC(C 1)",
4222
+ "c3 c[nH]",
4223
+ "(Cl)cc 4",
4224
+ "CCO c1cc",
4225
+ "CC( Br)",
4226
+ "CN( CC1",
4227
+ "c4cc (F)",
4228
+ "Cc1 c(N",
4229
+ "Cc1cc (F)",
4230
+ "C(C) CC)",
4231
+ "c3 o",
4232
+ "c2 c[nH+]",
4233
+ "[O-]) N",
4234
+ "OC) c2)",
4235
+ "C2 CCC1",
4236
+ "3) nc2",
4237
+ "Cc1ccc( S",
4238
+ "=O) ccc1",
4239
+ ")cc cc2",
4240
+ "CCS CC1",
4241
+ "N (C)C)",
4242
+ "c3n ccc",
4243
+ ")cc3) C2",
4244
+ "OC)cc1 )",
4245
+ "3) n1",
4246
+ "CC1 =N",
4247
+ "CC(C 1",
4248
+ "n1 cc",
4249
+ "2 CCN(",
4250
+ "CC (CC)",
4251
+ "(N N)",
4252
+ "(C) CC2)",
4253
+ "F)cc1 F)",
4254
+ "Br)cc (C",
4255
+ "Cn1 c(",
4256
+ "2)cc1 F",
4257
+ "Cc1n c(C2",
4258
+ "c1cc nc(N",
4259
+ "OCC (C)C)",
4260
+ "(C)C )cc(",
4261
+ "cs c1",
4262
+ "3) c(",
4263
+ "S S",
4264
+ "c2 c3c(",
4265
+ "CCCC 2)n1",
4266
+ "C# CCO",
4267
+ "c1cc oc1",
4268
+ "C(O) C2",
4269
+ "4 CC[NH+]",
4270
+ "(C) CC3)",
4271
+ "CC1 CCCC(",
4272
+ "c1 [nH]c(",
4273
+ "(F)c1 F",
4274
+ ")N (",
4275
+ "c1n c(=O)",
4276
+ "c2c( O)",
4277
+ "c2n n(",
4278
+ "CCC(C) C1",
4279
+ "2 C(",
4280
+ "C 3)n",
4281
+ "cn n2",
4282
+ "2) C(C)",
4283
+ "CC4 )cc3",
4284
+ "n o2)",
4285
+ "C2 (",
4286
+ "[O-]) cn1",
4287
+ "=C (C)C",
4288
+ "n 6",
4289
+ "c1n oc(",
4290
+ "c2cn n(",
4291
+ ")N1 CCN",
4292
+ "N2 CCCC2",
4293
+ ")cc (=O)",
4294
+ "o 2)n1",
4295
+ "OC) c3)",
4296
+ "c5 cc(",
4297
+ "c1 O",
4298
+ "Cc1 c[nH]",
4299
+ "OCC (C)C",
4300
+ "2 CC[NH+]",
4301
+ "O= S1(=O)",
4302
+ "C(C) (",
4303
+ "[Si] (",
4304
+ "c2cc1 OC",
4305
+ "c(C) s1",
4306
+ "c[nH+] 1",
4307
+ ")cc 3)n",
4308
+ ") C",
4309
+ "c -3",
4310
+ "CC2 (C",
4311
+ ")ccc1 C",
4312
+ "N3 C(=O)",
4313
+ "N c1cn",
4314
+ "CC1 CN",
4315
+ "Br) c1)",
4316
+ "CC4 )cc3)",
4317
+ "OC) c2",
4318
+ "3 CCN",
4319
+ "C1 (O)",
4320
+ "nn (",
4321
+ "=O)cc 2)",
4322
+ "(C) CCO",
4323
+ "c4 nc(",
4324
+ "c(N S(=O)",
4325
+ "c2nn [n-]",
4326
+ "ccc 6",
4327
+ "c3n nc(",
4328
+ "c2cn c(N",
4329
+ "c o1",
4330
+ "4) C3)",
4331
+ "c(C [NH+]",
4332
+ "CCn1 cc",
4333
+ ")cc1 F",
4334
+ "c2n nc3",
4335
+ "OCC O)",
4336
+ "3) n2",
4337
+ "n2cc nc2",
4338
+ "c2cn (C)",
4339
+ "c1ncn 2",
4340
+ "C1 CCC1",
4341
+ "#N )cc2",
4342
+ "c2cc nn2",
4343
+ "N1 S(=O)",
4344
+ "[C H]",
4345
+ "C2 CC2)c1",
4346
+ "CCC12 C",
4347
+ "c6ccc 7",
4348
+ "n o1)",
4349
+ "C2 (C)",
4350
+ "Nc1n c(",
4351
+ "2 CC3",
4352
+ "c2 co",
4353
+ "c1cc s",
4354
+ "CC2) cn1",
4355
+ "c1cs c(C",
4356
+ "C(C) =",
4357
+ "CCCN (CCC",
4358
+ "c(N) n",
4359
+ "Cc1 c(Cl)",
4360
+ "o c2c1",
4361
+ "F)cc 3)n",
4362
+ "c1) C(",
4363
+ "CC1 CC2",
4364
+ "C2 CCN",
4365
+ "c(N) n1",
4366
+ ") C(C)C",
4367
+ "(CC =C)",
4368
+ "CO CC(",
4369
+ "C(O) C(",
4370
+ "c2ccc( I",
4371
+ "[O-]) c(N",
4372
+ "c1) CCC2",
4373
+ "c(OC) c3)",
4374
+ ") C(",
4375
+ "2) O1",
4376
+ "c2cn [nH]",
4377
+ "CCn1 cc(",
4378
+ "C= CCN1",
4379
+ "c1) OCCO",
4380
+ "C) CC3)",
4381
+ "CCC1 =O",
4382
+ "OCCO 4)",
4383
+ "CN c1n",
4384
+ "(CC 3",
4385
+ "s c(N",
4386
+ "2) cs1",
4387
+ "C1 (C(=O)",
4388
+ "CCC1 O",
4389
+ "(C)C )cc3",
4390
+ "c1cc nc(",
4391
+ "= N1",
4392
+ "Cn1 nccc1",
4393
+ "C1 =N",
4394
+ "O=C( OC",
4395
+ "c1c( Br)",
4396
+ "c4ccc( N",
4397
+ "c3ccc( n4",
4398
+ "c1s c(",
4399
+ "CO c1c(C)",
4400
+ "CS c1ccc(",
4401
+ "c1ccc2 n",
4402
+ "3)CC2) n1",
4403
+ "CC(O) C(",
4404
+ "c2ccc( N)",
4405
+ "c4cc ncc",
4406
+ "CCn1 cc(C",
4407
+ "C1 =C",
4408
+ "[O-]) s1",
4409
+ "(=O) CC1",
4410
+ "(CC N",
4411
+ "n3 C)",
4412
+ "c2cc n",
4413
+ "CC(C)C n1",
4414
+ "c1cc n",
4415
+ ")cc( N",
4416
+ "2 CCC",
4417
+ "ccc2 o1",
4418
+ "CO c1cn",
4419
+ "c(=O) c(",
4420
+ "Cc1n cc",
4421
+ "OC 5",
4422
+ "c4ccc o",
4423
+ "O c2ccc(C",
4424
+ "(=O)N CC",
4425
+ "CC3 CCC2",
4426
+ "CCO c1cc(",
4427
+ "Nc1n c(N",
4428
+ "o c2",
4429
+ "CC2) s1",
4430
+ "nn 2)",
4431
+ "[NH2+] C3",
4432
+ "cc2 c(",
4433
+ "Cn1 nc(",
4434
+ "cc2 F)",
4435
+ "n2cn c3",
4436
+ "CCCC N(",
4437
+ "c1)OCO 2)",
4438
+ "(C c2cccc",
4439
+ "C( c2ccc",
4440
+ "CC2)cc1 C",
4441
+ "c3ccc(C 4",
4442
+ "C2 CCCC2",
4443
+ "CC( OC)",
4444
+ "c(C =O)",
4445
+ "C(C#N) =C",
4446
+ "CC c1c(C)",
4447
+ "N c1cccc(",
4448
+ "- 4",
4449
+ "c2ccc( n3",
4450
+ "Br) c2",
4451
+ "CC1 CCCN",
4452
+ "c3cc s",
4453
+ "Br) CC1",
4454
+ ") [NH+]1",
4455
+ "[O-]) cn",
4456
+ "C#N )cc1",
4457
+ "c( I)",
4458
+ "F)cc2 1",
4459
+ "c(C#N) c1",
4460
+ "C[NH+] (",
4461
+ "[NH+]2 CC",
4462
+ "c3 )cc2",
4463
+ "c2cn c(",
4464
+ "cc (F)",
4465
+ "c2) cn1",
4466
+ "c(OC )cc2",
4467
+ "C2 (C)C",
4468
+ "N1 CCCCC1",
4469
+ "CC(C)C (C",
4470
+ "B (",
4471
+ "CC(=O)N C",
4472
+ "=S )N1",
4473
+ "cc cc2",
4474
+ "O) C1",
4475
+ "F)cc (F)",
4476
+ "c3 4)",
4477
+ "C= CCC",
4478
+ "(C) c(C",
4479
+ "N1 C",
4480
+ "c1ccc(N C",
4481
+ "(C(=O) C(",
4482
+ "Cc1cn 2",
4483
+ "c(C) cc2",
4484
+ "O) ccc1",
4485
+ "c(C) o1",
4486
+ ") c1ccc",
4487
+ "c1cn n2",
4488
+ "CC3 )cc2)",
4489
+ "C= C=",
4490
+ ")N (C)",
4491
+ "O P",
4492
+ "[NH+] (CC",
4493
+ "C1 CO",
4494
+ "C2 C",
4495
+ "CC #N)",
4496
+ "[O-]) c(C",
4497
+ "( =S)",
4498
+ "CCCC( N",
4499
+ "F C(F)",
4500
+ "Cc1n ccn1",
4501
+ "CC c2cc(",
4502
+ "3 c(",
4503
+ "ccc nc2",
4504
+ "CCC1 C",
4505
+ "S c2n",
4506
+ "C= C(C#N)",
4507
+ "CO c1n",
4508
+ "O) CC1",
4509
+ "[nH] c(C",
4510
+ ")N (C)C",
4511
+ "5) ccc4",
4512
+ "4)C2) C3)",
4513
+ ")cc1 Cl",
4514
+ "Br) c(",
4515
+ "(F) F",
4516
+ "cc1 F)",
4517
+ "4CCCCC 4)",
4518
+ "C =O)",
4519
+ "5 )cc4",
4520
+ "OC1 (C)C",
4521
+ "5) c4",
4522
+ "C c3ccccc",
4523
+ "CC c2ccc",
4524
+ "c1ccc(C #",
4525
+ ")cc1 C",
4526
+ "O=C( CCC"
4527
+ ]
4528
+ }
4529
+ }
tokenizer_config.json ADDED
@@ -0,0 +1,52 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "added_tokens_decoder": {
3
+ "0": {
4
+ "content": "<pad>",
5
+ "lstrip": false,
6
+ "normalized": false,
7
+ "rstrip": false,
8
+ "single_word": false,
9
+ "special": true
10
+ },
11
+ "1": {
12
+ "content": "<s>",
13
+ "lstrip": false,
14
+ "normalized": false,
15
+ "rstrip": false,
16
+ "single_word": false,
17
+ "special": true
18
+ },
19
+ "2": {
20
+ "content": "<unk>",
21
+ "lstrip": false,
22
+ "normalized": false,
23
+ "rstrip": false,
24
+ "single_word": false,
25
+ "special": true
26
+ },
27
+ "3": {
28
+ "content": "<mask>",
29
+ "lstrip": false,
30
+ "normalized": false,
31
+ "rstrip": false,
32
+ "single_word": false,
33
+ "special": true
34
+ },
35
+ "2260": {
36
+ "content": "</s>",
37
+ "lstrip": false,
38
+ "normalized": false,
39
+ "rstrip": false,
40
+ "single_word": false,
41
+ "special": true
42
+ }
43
+ },
44
+ "bos_token": "<s>",
45
+ "clean_up_tokenization_spaces": true,
46
+ "eos_token": "</s>",
47
+ "mask_token": "<mask>",
48
+ "model_max_length": 512,
49
+ "pad_token": "<pad>",
50
+ "tokenizer_class": "PreTrainedTokenizerFast",
51
+ "unk_token": "<unk>"
52
+ }