Traditional drug
production is a long and complex process that
leads to new drug production. The virtual screening technique is a
computational method that allows chemical compounds to be screened
at an acceptable time and cost. Several databases contain information
on various aspects of biologically active substances. Simple statistical
tools are difficult to use because of the enormous amount of information
and complex data samples of molecules that are structurally heterogeneous
recorded in these databases. Many techniques for capturing the biological
similarity between a test compound and a known target ligand in LBVS
have been established. However, despite the good performances of the
above methods compared to their prior, especially when dealing with
molecules that have homogeneous active structural elements, they are
not satisfied when dealing with molecules that are structurally heterogeneous.
Deep learning models have recently achieved considerable success in
a variety of disciplines due to their powerful generalization and
feature extraction capabilities. Also, the Siamese network has been
used in similarity models for more complicated data samples, especially
with heterogeneous data samples. The main aim of this study is to
enhance the performance of similarity searching, especially with molecules
that are structurally heterogeneous. The Siamese architecture will
be enhanced using two similarity distance layers with one fusion layer
to further improve the similarity measurements between molecules and
then adding many layers after the fusion layer for some models to
improve the retrieval recall. In this architecture, several methods
of deep learning have been used, which are long short-term memory
(LSTM), gated recurrent unit (GRU), convolutional neural network-one
dimension (CNN1D), and convolutional neural network-two dimensions
(CNN2D). A series of experiments have been carried out on real-world
data sets, and the results have shown that the proposed methods outperformed
the existing methods.