2021
DOI: 10.22266/ijies2021.0430.48
|View full text |Cite
|
Sign up to set email alerts
|

ANOVA-SVM for Selecting Subset Features in Encrypted Internet Traffic Classification

Abstract: Encryption technique is widely used in the internet network for protecting user privacy, maintaining the confidentiality of the data, avoiding firewall detection, and administrating the system. To prevent encryption techniques in malicious activities such as encrypting data that contains malware or viruses, illegal transactions like selling drugs, illegal weapons and fake documents, a company or institution uses encrypted internet traffic classification to analyze and identify the activity. A challenging probl… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 11 publications
(2 citation statements)
references
References 17 publications
0
2
0
Order By: Relevance
“…We evaluated the feature importance through three different types of models, intended to check which features are consistently important between models. The pipeline ANOVA-SVM (Megantara and Ahmad, 2021 ) calculates the average F-score for the selected features. Linear models calculate the coefficient of each feature to determine feature importance.…”
Section: Methodsmentioning
confidence: 99%
“…We evaluated the feature importance through three different types of models, intended to check which features are consistently important between models. The pipeline ANOVA-SVM (Megantara and Ahmad, 2021 ) calculates the average F-score for the selected features. Linear models calculate the coefficient of each feature to determine feature importance.…”
Section: Methodsmentioning
confidence: 99%
“…ANOVA-f test. Analysis of variance, or ANOVA, is a method used to determine any statistically significant differences among the means of two or more groups into the target variable [33]. The high value of F-statistic and low p-value suggests that there are significant differences of the groups, suggesting that the features are relevant to the targeted variable [34].…”
Section: (Contingency Table) Contingency Table or Also Known As Cross...mentioning
confidence: 99%