2021
DOI: 10.3390/app11031323
|View full text |Cite
|
Sign up to set email alerts
|

Cocrystal Prediction Using Machine Learning Models and Descriptors

Abstract: Cocrystals are of much interest in industrial application as well as academic research, and screening of suitable coformers for active pharmaceutical ingredients is the most crucial and challenging step in cocrystal development. Recently, machine learning techniques are attracting researchers in many fields including pharmaceutical research such as quantitative structure-activity/property relationship. In this paper, we develop machine learning models to predict cocrystal formation. We extract descriptor value… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
32
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 32 publications
(37 citation statements)
references
References 62 publications
0
32
0
Order By: Relevance
“…Quantitative structure–property relationships (QSPR) models are frequently developed using molecular descriptors, and PaDEL is amongst the attractive and well-known tools to extract descriptors [ 33 ]. There are various tools used in cheminformatics [ 31 ] such as Mordred [ 37 ], PyDPI [ 38 ], Rcpi [ 39 ], Dragon [ 40 ], and cinfony [ 41 ], which is a collection or a wrapper of other libraries such as Open Babel [ 42 ], RDKit [ 31 ] ( (accesssed on 22 June 2021)), and Chemistry Development Kit (CDK) [ 43 ]. We decided to utilize PaDEL because of its advantages: it provides approximately 1875 molecular descriptors within a brief execution time, and it is simple to install and utilize.…”
Section: Methodsmentioning
confidence: 99%
See 4 more Smart Citations
“…Quantitative structure–property relationships (QSPR) models are frequently developed using molecular descriptors, and PaDEL is amongst the attractive and well-known tools to extract descriptors [ 33 ]. There are various tools used in cheminformatics [ 31 ] such as Mordred [ 37 ], PyDPI [ 38 ], Rcpi [ 39 ], Dragon [ 40 ], and cinfony [ 41 ], which is a collection or a wrapper of other libraries such as Open Babel [ 42 ], RDKit [ 31 ] ( (accesssed on 22 June 2021)), and Chemistry Development Kit (CDK) [ 43 ]. We decided to utilize PaDEL because of its advantages: it provides approximately 1875 molecular descriptors within a brief execution time, and it is simple to install and utilize.…”
Section: Methodsmentioning
confidence: 99%
“…We have implemented various ML models such as artificial neural network (ANN), support vector machine (SVM) [ 46 ], random forest (RF) [ 47 ], extreme gradient boost (XGB) [ 48 ], and Logistic Regression (LR) [ 49 ]. The ANN is recognized to be useful in a variety of research fields, including image analysis, natural language processing, and speech recognition; if it has a deep structure, it is a deep learning model (i.e., multiple hidden layers) [ 31 ]. The SVM is known to be successful in many classification applications and tasks [ 50 ], and it identifies a decision boundary based on boundary examples or instances (i.e., support vectors).…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations