2021
DOI: 10.11591/ijece.v11i3.pp2525-2534
|View full text |Cite
|
Sign up to set email alerts
|

A smart method for spark using neural network for big data

Abstract: Apache spark, famously known for big data handling ability, is a distributed open-source framework that utilizes the idea of distributed memory to process big data. As the performance of the spark is mostly being affected by the spark predominant configuration parameters, it is challenging to achieve the optimal result from spark. The current practice of tuning the parameters is ineffective, as it is performed manually. Manual tuning is challenging for large space of parameters and complex interactions with an… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 24 publications
0
2
0
Order By: Relevance
“…It has the advantage of having an automatic data balancing and a distributed design, making it a good choice for big data analysis. A collection of dominating people in occupations for numerous machine learning tasks, such as classification, regression, base compilation and extraction (and dimensional reduction), is introduced by Apache Spark [47]- [51]. Despite the fact that numerous research have been conducted on machine learning and its usefulness, ML libraries for big data analysis, such as Apache Spark MLlib, have received little attention.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…It has the advantage of having an automatic data balancing and a distributed design, making it a good choice for big data analysis. A collection of dominating people in occupations for numerous machine learning tasks, such as classification, regression, base compilation and extraction (and dimensional reduction), is introduced by Apache Spark [47]- [51]. Despite the fact that numerous research have been conducted on machine learning and its usefulness, ML libraries for big data analysis, such as Apache Spark MLlib, have received little attention.…”
Section: Introductionmentioning
confidence: 99%
“…Lunga et al [85] proposed a framework that makes use of Spark's distributed computing capabilities as well as deep learning architecture for multiple layers perceptron (MLP) using cascade learning to train multiple layers perceptrons is proposed. A framework for in-depth training learning models with Apache Spark has been created and developed in [47], [48], [50], [51], [57], [69], [86]- [92]. This framework shortens the training time by taking advantage of the advantages of both data and parity modeling at the same time.…”
Section: Introductionmentioning
confidence: 99%