2017
DOI: 10.14419/ijet.v7i1.3.9225
|View full text |Cite
|
Sign up to set email alerts
|

Technical challenges and perspectives in batch and stream big data machine learning

Abstract: Machine Learning is playing a predominant role across various domains. However traditional Machine Learning algorithms are becoming unsuitable for majority of applications as the data is acquiring new characteristics. Sensors, devices, servers, Internet, Social Networking, Smart phones and Internet of Things are contributing the major sources of data. Hence there is a paradigm shift in the Machine learning with the advent of Big Data. Research works are in evolution to deal with Big Data Batch and stream real … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 8 publications
0
1
0
Order By: Relevance
“…It is popular because of negligible training time requirement. In this technique the algorithm trains itself with every input value passed for the classification, hence no separate training phase is required resulting in faster execution of the algorithm [18]. During classification of the input data, the closeness of the data item to be classified is measured as a distance with other classified data items of different classes and the data item will be classified to the class, where the distance is minimum with more number of data points in a particular class.…”
Section: K Nearest Neighbourmentioning
confidence: 99%
“…It is popular because of negligible training time requirement. In this technique the algorithm trains itself with every input value passed for the classification, hence no separate training phase is required resulting in faster execution of the algorithm [18]. During classification of the input data, the closeness of the data item to be classified is measured as a distance with other classified data items of different classes and the data item will be classified to the class, where the distance is minimum with more number of data points in a particular class.…”
Section: K Nearest Neighbourmentioning
confidence: 99%
“…It consists of HDFS as the distributed storage and MapReduce as a parallel processing framework. It can handle the big data challenges in the cluster of commodity hardware [9], [10]. In this paper, we present a model to collect data from multiple databases to HDFS for the distributed storage.…”
mentioning
confidence: 99%