2020
DOI: 10.19101/ijacr.2019.940150
|View full text |Cite
|
Sign up to set email alerts
|

Traditional machine learning and big data analytics in virtual screening: a comparative study

Abstract: An unprecedented development in biomedical data has been observed in latest years. The capability to analyze a large portion of this data will offer many opportunities that will in turn affect the future of health care [1]. In this age, traditional storage and processing techniques are not sufficient to meet the demand and hence, computing techniques must scale to handle the huge volume of data. The main difficulty in managing these data is the speed at which they are generated, that is, data generation is muc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
4
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 13 publications
(4 citation statements)
references
References 54 publications
0
4
0
Order By: Relevance
“…Spark can compare the cost of different methods and automatically select the optimal data recovery method. Method and the equivariant method are relatively stable, and the AST method has a certain degree of improvement in F1 value compared with the equivariant method [19].…”
Section: Experimental Analysis Of the Feature Large Datasetmentioning
confidence: 99%
“…Spark can compare the cost of different methods and automatically select the optimal data recovery method. Method and the equivariant method are relatively stable, and the AST method has a certain degree of improvement in F1 value compared with the equivariant method [19].…”
Section: Experimental Analysis Of the Feature Large Datasetmentioning
confidence: 99%
“…K-mean clustering is one of the simplest nonsupervised learning algorithms, which was first proposed by Macqueen in 1967. It has been applied by many researchers to solve some of the problems of known groups [28]. is technique classifies a particular dataset into a certain number of groups.…”
Section: Related Workmentioning
confidence: 99%
“…Clustering, as a subdomain of unsupervised machine learning, entails the grouping of data points into clusters or categories based on shared characteristics, thereby allowing for the exploration of latent patterns and the identification of homogeneous subsets within the data [8][9][10]. This fundamental process has found applications across diverse domains, including but not limited to marketing, healthcare, finance, and scientific research [11,12]. Its importance has been further accentuated with the advent of big data, where traditional data processing techniques prove inadequate in the face of sheer volume, velocity, and variety [13][14][15][16][17].…”
Section: Introductionmentioning
confidence: 99%