2019
DOI: 10.1016/j.ins.2018.12.060
|View full text |Cite
|
Sign up to set email alerts
|

Variable selection in classification for multivariate functional data

Abstract: When classification methods are applied to high-dimensional data, selecting a subset of the predictors may lead to an improvement in the predictive ability of the estimated model, in addition to reducing the model complexity. In Functional Data Analysis (FDA), i.e., when data are functions, selecting a subset of predictors corresponds to selecting a subset of individual time instants in the time interval in which the functional data are measured. In this paper, we address the problem of selecting the most info… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

1
15
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
6
1

Relationship

2
5

Authors

Journals

citations
Cited by 22 publications
(16 citation statements)
references
References 40 publications
1
15
0
Order By: Relevance
“…Instead, we propose to maximize the Pearson correlation, R, between the class label Y i of the functional data X i and the score,Ŷ (X i , θ, α) in (1), where all the variables, including the time instants, are treated as continuous variables. Other references in the literature, such as [7,34], have previously used with excellent results the Pearson correlation coefficient. Despite the fact that, when using the Pearson correlation coefficient as a surrogate of accuracy, a linear relationship between the binary label, Y ∈ {−1, 1}, and the real-valued score,Ŷ ∈ R, is implicitly assumed, this coefficient is very fast to compute, and even more important, it also allows us to use gradient-based methodologies since its optimization amounts to solving a continuous optimization problem.…”
Section: Problem Formulation and Optimizationmentioning
confidence: 99%
See 2 more Smart Citations
“…Instead, we propose to maximize the Pearson correlation, R, between the class label Y i of the functional data X i and the score,Ŷ (X i , θ, α) in (1), where all the variables, including the time instants, are treated as continuous variables. Other references in the literature, such as [7,34], have previously used with excellent results the Pearson correlation coefficient. Despite the fact that, when using the Pearson correlation coefficient as a surrogate of accuracy, a linear relationship between the binary label, Y ∈ {−1, 1}, and the real-valued score,Ŷ ∈ R, is implicitly assumed, this coefficient is very fast to compute, and even more important, it also allows us to use gradient-based methodologies since its optimization amounts to solving a continuous optimization problem.…”
Section: Problem Formulation and Optimizationmentioning
confidence: 99%
“…Support Vector Machine (SVM) [13,14,19,20,39,41,42,56,61] is one of the most used tools in multivariate classification, and it has also been widely applied for functional data. See [7,34,43,44,49,50] among others.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…In this article, we consider the problem of the optimal selection of time intervals (including time instants as a degenerate case) for functional Support Vector Regression. We extend the work done recently for selecting time instants, but not intervals, for the classification, instead of regression, problem (Blanquero et al, 2019b). The use of Support Vector Regression allows us to capture nonlinearities.…”
Section: Introductionmentioning
confidence: 99%
“…In particular, the feature selection problem in SVR with functional data has not yet been studied in the literature. For the classification problem, an SVM-based method to select time instants has been proposed in Blanquero et al (2019b).…”
Section: Introductionmentioning
confidence: 99%