2019
DOI: 10.1093/bioinformatics/btz521
|View full text |Cite
|
Sign up to set email alerts
|

MaNGA: a novel multi-niche multi-objective genetic algorithm for QSAR modelling

Abstract: Summary Quantitative structure–activity relationship (QSAR) modelling is currently used in multiple fields to relate structural properties of compounds to their biological activities. This technique is also used for drug design purposes with the aim of predicting parameters that determine drug behaviour. To this end, a sophisticated process, involving various analytical steps concatenated in series, is employed to identify and fine-tune the optimal set of predictors from a large dataset of mo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
15
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
7
2

Relationship

3
6

Authors

Journals

citations
Cited by 18 publications
(15 citation statements)
references
References 42 publications
0
15
0
Order By: Relevance
“…To critically examine the predictivity of models MO59 and CO15, we compared their Williams plots [ 35 , 36 ], as presented in Figure 4 . As expected, model CO15 had a larger number (129 with h * = 0.0399) of structural outliers as compared to model MO59 (25 with h * = 0.0533).…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…To critically examine the predictivity of models MO59 and CO15, we compared their Williams plots [ 35 , 36 ], as presented in Figure 4 . As expected, model CO15 had a larger number (129 with h * = 0.0399) of structural outliers as compared to model MO59 (25 with h * = 0.0533).…”
Section: Resultsmentioning
confidence: 99%
“…To do so, we built the so-called Williams plot, in which standardized residuals were plotted against leverage values. Doing so permitted us to identify response and structural outliers [ 35 , 36 ]. All plots shown in the present work were conceived with Matplotlib [ 37 ].…”
Section: Methodsmentioning
confidence: 99%
“…Due to the homogenized preprocessing and manual curation of the metadata, this collection is a relevant resource for identification of toxicity biomarkers. This can be addressed by using multiple feature selection approaches 35,36 or more advanced data modelling techniques [37][38][39] . Biomarkers could also be detected by means of gene co-expression network analysis, under the assumption that central network genes play a key role in the adaptation to the exposure 40,41 .…”
Section: Usage Notesmentioning
confidence: 99%
“…Recently, a new methodology for feature selection from complex data, called MaNGA has been proposed [114]. MaNGA uses a multi-objective optimization strategy to identify the minimum set of predictive features with the widest AD, better predictivity capability and high stability.…”
Section: Stability and Applicability Domainmentioning
confidence: 99%