“…A number of previous studies investigated quantitative AD: (1) range setting of each variable or latent variable after dimensional reduction [21] by Principal Component Analysis (PCA) [22], (2) distance from neighboring point [23] using kNN [20], (3) data density estimation [24] by OneClassSVM (OCSVM) [25], and (4) variation of predicted values [26] by ensemble learning [8,9]. [2,3,4,5,6,7,8,9,10,15,20] In this study, we proposed to use Tanimoto distance of binary unhashed Morgan fingerprint (radius = 4) with the training data as the AD estimation model. First, Tanimoto distances were calculated for each compound against all compounds in the training data.…”