2008
DOI: 10.1002/sam.10006
|View full text |Cite
|
Sign up to set email alerts
|

Distributed Decision‐Tree Induction in Peer‐to‐Peer Systems

Abstract: This paper offers a scalable and robust distributed algorithm for decision tree induction in large Peer-to-Peer (P2P) environments. Computing a decision tree in such large distributed systems using standard centralized algorithms can be very communication-expensive and impractical because of the synchronization requirements. The problem becomes even more challenging in the distributed stream monitoring scenario where the decision tree needs to be updated in response to changes in the data distribution. This pa… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
57
0

Year Published

2008
2008
2019
2019

Publication Types

Select...
5
4

Relationship

4
5

Authors

Journals

citations
Cited by 64 publications
(57 citation statements)
references
References 46 publications
0
57
0
Order By: Relevance
“…Parallel programming is incomplete without discussing the most recent approach called MAP Reduce. It can process large sized data in highly parallel manner [8]. Map Reduce was introduced by Google in 2004.…”
Section: Parallel Approches For Data Miningmentioning
confidence: 99%
“…Parallel programming is incomplete without discussing the most recent approach called MAP Reduce. It can process large sized data in highly parallel manner [8]. Map Reduce was introduced by Google in 2004.…”
Section: Parallel Approches For Data Miningmentioning
confidence: 99%
“…Very few works address issues related to concept drift in a P2P network. A fully distributed decision tree induction method was proposed by Bhaduri et al [10]. The proposal involves drift detection, that triggers a tree update.…”
Section: Handling Concept Drift In Fully Distributed Environmentsmentioning
confidence: 99%
“…Bawa et al [4] developed an approach based on probabilistic counting. In addition, techniques have been developed for addressing more complex data mining/data problems over large-scale dynamic networks: association rule mining [28], facility location [24], outlier detection [9], decision tree induction [7], ensemble classification [25], support vector machine-based classification [1], K-means clustering [11], top-K query processing [3]. A related line of research concerns the monitoring of various kinds of data models over large numbers of data streams.…”
Section: Data Analysis In Large Dynamic Networkmentioning
confidence: 99%