2021
DOI: 10.1109/access.2021.3082755
|View full text |Cite
|
Sign up to set email alerts
|

Towards an Unsupervised Feature Selection Method for Effective Dynamic Features

Abstract: Dynamic features applications present new obstacles for the selection of streaming features. The dynamic features applications have various characteristics: a) features are processed sequentially while the number of instances is fixed; and b) the feature space does not exist in advance. For example, in a text classification task for spam detection, new features (e.g. words) are dynamically generated and therefore need to be mined to filter out the spams rather than waiting for all features to be collected in o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 17 publications
(3 citation statements)
references
References 21 publications
0
3
0
Order By: Relevance
“…Embedded methods 11 13 such as LASSO and Decision Trees, incorporate feature selection within the algorithm itself for different scenarios. These methods usually provide a good trade-off between performance and speed but are limited by the biases of the algorithms they are integrated with for different use cases 14 16 .…”
Section: In-depth Review Of Existing Machine Learning Models Used For...mentioning
confidence: 99%
“…Embedded methods 11 13 such as LASSO and Decision Trees, incorporate feature selection within the algorithm itself for different scenarios. These methods usually provide a good trade-off between performance and speed but are limited by the biases of the algorithms they are integrated with for different use cases 14 16 .…”
Section: In-depth Review Of Existing Machine Learning Models Used For...mentioning
confidence: 99%
“…In a text mining assignment for spam filtering, for example, additional features (e.g., words) are dynamically created and must therefore be exploited to filter out the spam instead of waiting for every characteristic to be collected. Traditional methodologies, which have not been developed for streaming information applications, cannot be employed in this situation since they demand that the whole extracted feature set be known beforehand to evaluate the effective attributes effectively and scientifically [2,3]. Parkinson's disease is a widespread neurological disorder.…”
Section: Literature Reviewmentioning
confidence: 99%
“…Since the datasets have large amounts of information and all data are not required for processing. When the dataset dimension expands then the classification accuracy of the system diminishes also it takes extra time for processing [2]. To avoid this problem feature selection helps to select only the necessary information from the datasets.…”
Section: Introductionmentioning
confidence: 99%