2017
DOI: 10.48550/arxiv.1703.00564
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

MoleculeNet: A Benchmark for Molecular Machine Learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
51
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 32 publications
(51 citation statements)
references
References 9 publications
0
51
0
Order By: Relevance
“…Some of these can be achieved by using all the physical, chemical and structural properties [31], which all together are rarely well documented so getting this information is considered cumbersome task. Over time, this has been tackled by using several alternative approaches that work well for specific problems [32,33,34,35,36,37]. However, obtaining robust representations of molecules for diverse machine learning problems is still a challenging task and any gold standard method that works consistently for all kind of problems is yet to be known.…”
Section: Data Generation and Molecular Representationmentioning
confidence: 99%
“…Some of these can be achieved by using all the physical, chemical and structural properties [31], which all together are rarely well documented so getting this information is considered cumbersome task. Over time, this has been tackled by using several alternative approaches that work well for specific problems [32,33,34,35,36,37]. However, obtaining robust representations of molecules for diverse machine learning problems is still a challenging task and any gold standard method that works consistently for all kind of problems is yet to be known.…”
Section: Data Generation and Molecular Representationmentioning
confidence: 99%
“…Following an identical approach as the original Chemception paper, [9] we obtained the Tox21, HIV, and Free-Solv datasets (Table 1) from the MoleculeNet benchmark database, [30] to evaluate the performance of Chemception for predicting toxicity, activity and solvation free energy respectively. These datasets comprise of a mix of large vs small datasets, physical vs non-physical properties and regression vs classification problems.…”
Section: Dataset Descriptionmentioning
confidence: 99%
“…The baseline model for comparison is the multilayer perceptron (MLP) deep neural network model (random data splitting) reported in the Molecu-leNet benchmark. [30]. In addition, we also compare to a novel convolutional graph algorithm (ConvGraph).…”
Section: Augchemception Performance Against Contemporary Modelsmentioning
confidence: 99%
“…60,[67][68][69][70][71][72] Since graph theory offers a nature representation of molecular structure, it is a common approach for analyzing chemical datasets 56,[73][74][75][76][77] and biomolecular datasets. 60,[78][79][80][81][82][83] Although there was much effort in constructing various graph representations in the past, graph based quantitative models are often less accurate than other competitive models in the analysis and prediction of biomolecular properties from massive and diverse datasets. Indeed, in the protein stability changes upon mutation analysis, the other models 23,52,84 are more accurate than the graph-based approach.…”
Section: Introductionmentioning
confidence: 99%