Screening of “ drug-like” molecule from the molecular database produced through high throughput techniques and their large
repositories requires robust classification. In our work, a set of heuristically chosen nine molecular descriptors including four
from Lipinski's rule, were used as classification parameter for screening “drug-like” molecules. The robustness of
classification was compared with four fundamental descriptors of Lipinski. Back propagation neural network based classifier
was applied on a database of 60000 molecules for classification of, “ drug-like” and “non drug-like” molecules. Classification
result using nine descriptors showed high classification accuracy of 96.1% in comparison to that using four Lipinski's
descriptors which yielded an accuracy of 82.48%. Also a significant decrease of false positives resulted while using nine
descriptors causing a sharp 18% increase of specificity of classification. From this study it appeared that Lipinski's
descriptors which mainly deal with pharmacokinetic properties of molecules form the basis for identification of “drug-like”
molecules that can be substantially improved by adding more descriptors representing pharmacodynamics properties of
molecules.
Current work targeted to predicate parametric relationship between aggregate and individual property of a protein. In this approach, we considered individual
property of a protein as its Surface Roughness Index (SRI) which was shown to have potential to classify SCOP protein families. The bulk property was however
considered as Intensity Level based Multi-fractal Dimension (ILMFD) of ordinary microscopic images of heat denatured protein aggregates which was known to
have potential to serve as protein marker. The protocol used multiple ILMFD inputs obtained for a protein to produce a set of mapped outputs as possible SRI
candidates. The outputs were further clustered and largest cluster centre after normalization was found to be a close approximation of expected SRI that was
calculated from known PDB structure. The outcome showed that faster derivation of individual protein’s surface property might be possible using its bulk form,
heat denatured aggregates.
Online retrieval of the homologous nucleotide sequences through existing alignment techniques is a common practice against the given database of sequences. The salient point of these techniques is their dependence on local alignment techniques and scoring matrices the reliability of which is limited by computational complexity and accuracy. Toward this direction, this work offers a novel way for numerical representation of genes which can further help in dividing the data space into smaller partitions helping formation of a search tree. In this context, this paper introduces a 36-dimensional Periodicity Count Value (PCV) which is representative of a particular nucleotide sequence and created through adaptation from the concept of stochastic model of Kolekar et al. (American Institute of Physics 1298:307-312, 2010. doi: 10.1063/1.3516320 ). The PCV construct uses information on physicochemical properties of nucleotides and their positional distribution pattern within a gene. It is observed that PCV representation of gene reduces computational cost in the calculation of distances between a pair of genes while being consistent with the existing methods. The validity of PCV-based method was further tested through their use in molecular phylogeny constructs in comparison with that using existing sequence alignment methods.
Study on geometric properties of nanoparticles and their relation with biomolecular activities, especially protein is quite a new
field to explore. This work was carried out towards this direction where images of gold nanoparticles obtained from transmission
electron microscopy were processed to extract their size and area profile at different experimental conditions including and
excluding a protein, citrate synthase. Since the images were ill-posed, texture of a context-window for each pixel was used as input
to a back-propagation network architecture to obtain decision on its membership as nanoparticle. The segmented images were
further analysed by k-means clustering to derive geometric properties of individual nanoparticles even from their assembled form.
The extracted geometric information was found to be crucial to give a model featuring porous cage like configuration of
nanoparticle assembly using which the chaperone like activity of gold nanoparticles can be explained.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.