2004
DOI: 10.1007/978-3-540-24668-8_21
|View full text |Cite
|
Sign up to set email alerts
|

Flow Clustering Using Machine Learning Techniques

Abstract: Abstract. Packet header traces are widely used in network analysis. Header traces are the aggregate of traffic from many concurrent applications. We present a methodology, based on machine learning, that can break the trace down into clusters of traffic where each cluster has different traffic characteristics. Typical clusters include bulk transfer, single and multiple transactions and interactive traffic, amongst others. The paper includes a description of the methodology, a visualisation of the attribute sta… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
235
1
2

Year Published

2005
2005
2022
2022

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 390 publications
(239 citation statements)
references
References 2 publications
1
235
1
2
Order By: Relevance
“…Researchers and developers often embed an assumption of traffic symmetry in tools and analyses [8,9,10], an assumption only safe for stub access links, otherwise quite harmful [11].…”
Section: Introductionmentioning
confidence: 99%
“…Researchers and developers often embed an assumption of traffic symmetry in tools and analyses [8,9,10], an assumption only safe for stub access links, otherwise quite harmful [11].…”
Section: Introductionmentioning
confidence: 99%
“…This better models a real-world situation given that it may not be accurate to associate a given data point to one exclusive cluster based on the training set. McGregor et al [57] used the Expectation Maximization (EM) algorithm to classify flows. The authors believed Internet traffic flows can be clustered by application as they were able to see distinct applications based on the packet size and inter-arrival time of packets.…”
Section: Clusteringmentioning
confidence: 99%
“…Also, it is unclear how good the discrimination of flows is because in [3] the sets of attributes are averaged over all flows of certain applications in 24-hour periods. In [4] the authors use the Expectation Maximization (EM) algorithm to cluster flows into different application types using a fixed set of attributes. From their evaluation it is not clear what influence different attributes have and how good the clustering actually is.…”
Section: Related Workmentioning
confidence: 99%
“…EM is an unsupervised Bayesian classifier that automatically learns the 'natural' classes (also called clustering) inherent in a training dataset with unclassified cases. The resulting classifier can then be used to classify new cases (see [4], [9]). …”
Section: Ml-based Flow Classification Approach and Evaluationmentioning
confidence: 99%
See 1 more Smart Citation