2018 IEEE Wireless Communications and Networking Conference (WCNC) 2018
DOI: 10.1109/wcnc.2018.8377027
|View full text |Cite
|
Sign up to set email alerts
|

Clustering-based separation of media transfers in DPI-classified cellular video and VoIP traffic

Abstract: Identifying VoIP and video traffic is often useful in the context of managing a cellular network, and to perform such traffic classification deep packet inspection (DPI) approaches are often used. Commercial DPI classifiers do not necessarily differentiate between, for example, YouTube traffic that arises from browsing inside the YouTube app, and traffic arising from the actual viewing of a YouTube video. Here we apply unsupervised clustering methods on such cellular DPI-labeled VoIP and video traffic to ident… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(2 citation statements)
references
References 16 publications
0
2
0
Order By: Relevance
“…Both these actions would be labeled as YouTube by the DPI, but if the objective is to detect flows that perform actual video transfer such ground truth does not provide the optimal training. Utilizing unsupervised machine learning to improve the quality of the DPI training labels has been initially explored for a smaller data set in [17], and further elaborated for the current data set in [18].…”
Section: Discussionmentioning
confidence: 99%
“…Both these actions would be labeled as YouTube by the DPI, but if the objective is to detect flows that perform actual video transfer such ground truth does not provide the optimal training. Utilizing unsupervised machine learning to improve the quality of the DPI training labels has been initially explored for a smaller data set in [17], and further elaborated for the current data set in [18].…”
Section: Discussionmentioning
confidence: 99%
“…At this point, the dissimilarity associated with two sessions is defined as the Euclidean distance between the two related points in the aforementioned hyperplane. Indeed, the optimal value of K is calculated in order to ensure that the intra-cluster distances are minimized and the inter-cluster distances are maximized [14] (note that, according to K-means terminology, this means that the silhouette [15] is maximized). Finally, the clustering process provides in output the sessions of each cluster and a special point of the hyperplane, namely centroid, that identifies the cluster itself.…”
Section: The Proposed Approachmentioning
confidence: 99%