2020
DOI: 10.1007/978-1-0716-0150-1_32
|View full text |Cite
|
Sign up to set email alerts
|

alvaDesc: A Tool to Calculate and Analyze Molecular Descriptors and Fingerprints

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
133
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 196 publications
(134 citation statements)
references
References 56 publications
1
133
0
Order By: Relevance
“…The internal database contains approximately 200k unique structures with wSFLogD data and approximately 80k unique structures with wELogD data. Models on both datasets are constructed using descriptors generated by MoKa (Molecular Discovery Ltd 2020) , alvaDesc (Mauri 2020), and counts of a set of internally developed SMARTS fragments (calculated using OpenEye OEChemTK (OpenEye Scientific Software 2020). For a MoKa pKa prediction where no value is given (e.g., neutrals), the value of ApKa is set to 14 and BpKa is set to 0.…”
Section: Methodsmentioning
confidence: 99%
“…The internal database contains approximately 200k unique structures with wSFLogD data and approximately 80k unique structures with wELogD data. Models on both datasets are constructed using descriptors generated by MoKa (Molecular Discovery Ltd 2020) , alvaDesc (Mauri 2020), and counts of a set of internally developed SMARTS fragments (calculated using OpenEye OEChemTK (OpenEye Scientific Software 2020). For a MoKa pKa prediction where no value is given (e.g., neutrals), the value of ApKa is set to 14 and BpKa is set to 0.…”
Section: Methodsmentioning
confidence: 99%
“…We used alvaDesc to calculate molecular descriptors [ 35 ]. The alvaDesc software can be used to calculate various molecular descriptors and molecular fingerprints.…”
Section: Methodsmentioning
confidence: 99%
“…The dataset was first randomly divided into a training set containing 3867 data points (70% of the data) and a validation set containing 1656 data-points (30% of the data) using the random division technique of QSAR-Co tool [22]. A total of 5305 input descriptors (D i ) were calculated for the training set by employing the alvaDesc tool [38], and these descriptors were subsequently converted to 15,915 deviation descriptors with the help of QSAR-Co tool [22]. It must be noted here that all models were set up solely on the basis of the training set, which was further split into a sub-training set containing 2707 data points (70% of the training set) and a test set containing 1160 data points (30% of the training set) by applying the random division technique of QSAR-Co tool [22] for models' development purposes.…”
Section: Linear Interpretable Mt-qsar Modelsmentioning
confidence: 99%
“…The topological distance between Firstly, it is noteworthy that all the experimental elements considered in this work (i.e., a t , m e , and b t ) consistently appeared in the final LDA models demonstrating their importance (Table 4). To set up the models, 30 distinct categories of descriptors (available in alvaDesc [38]) were considered but only 10 persisted. The latter pertained more frequently to 2D atom pairs (pairs of atoms at a given topological distance) and functional group counts descriptors, as well as to atom-centered fragments [49].…”
Section: Interpretation Of Molecular Descriptorsmentioning
confidence: 99%
See 1 more Smart Citation