2014
DOI: 10.1145/2674026.2674028
|View full text |Cite
|
Sign up to set email alerts
|

Open challenges for data stream mining research

Abstract: Every day, huge volumes of sensory, transactional, and web data are continuously generated as streams, which need to be analyzed online as they arrive. Streaming data can be considered as one of the main sources of what is called big data. While predictive modeling for data streams and big data have received a lot of attention over the last decade, many research approaches are typically designed for well-behaved controlled problem settings, overlooking important challenges imposed by real-world applications. T… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
107
0
3

Year Published

2015
2015
2022
2022

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 254 publications
(118 citation statements)
references
References 37 publications
0
107
0
3
Order By: Relevance
“…By using our generative stream model as a tailored feature to describe stream patterns we believe that our organization model will be further improved. It is worthwhile to mention that there are several academic works based on stream prediction and mining [16], but the same cannot be said about stream similarity and stream characterization. Further work needs to be done to assert some ideas expressed in this paper, but our stream characterization model appears to be a viable option.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…By using our generative stream model as a tailored feature to describe stream patterns we believe that our organization model will be further improved. It is worthwhile to mention that there are several academic works based on stream prediction and mining [16], but the same cannot be said about stream similarity and stream characterization. Further work needs to be done to assert some ideas expressed in this paper, but our stream characterization model appears to be a viable option.…”
Section: Resultsmentioning
confidence: 99%
“…While there are several academic works based on stream prediction and mining [16], the same can not be said about stream similarity. Most methods are based on longest common sub-sequence algorithm [17], [18].…”
Section: Background and Related Workmentioning
confidence: 99%
“…When applying machine-learning techniques for M2M communication unexpected problems may arise due to the complexity and processing time added by the scale of applications. The large scale makes trivial operations costly in terms of computation time and memory requirement and forces the system designers to reconsider the applied machine learning algorithms since the cost of learning arises primarily from bandwidth and disk reads [25,40]. In this section, a literature review is presented to discuss the challenges in data mining and machine learning for M2M applications.…”
Section: Research Challengesmentioning
confidence: 99%
“…A naive solution of the delayed labeling problem will be requesting labels immediately at the time of concept drift [15]. Then the framework waits for the labeling process to finish before building any updated models.…”
Section: Introductionmentioning
confidence: 99%