Structural Attention Enhanced Continual Meta-Learning for Graph Edge Labeling Based Few-Shot Remote Sensing Scene Classification

Li, Feimo; Li, Shuaibo; Fan, Xinxin; Li, Xiong; Chang, Hongxing

doi:10.3390/rs14030485

Cited by 11 publications

(3 citation statements)

References 54 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…5(b) shows that the higher the number of parallel attention heads, the higher the F1-scores, due to the extended capacity to encode distinct relationships between time steps. This seems to agree with the results of [40], where, although the optimal number of heads depended on the dataset, the datasets with more samples and variability in each class were able to achieve better performances when the number of heads increased.…”

Section: A Hyperparameter Optimization Resultssupporting

confidence: 90%

Seeking Optimal and Explainable Deep Learning Models for Inertial-based Posture Recognition

Martins,

Cerqueira,

Santos

2024

Preprint

View full text Add to dashboard Cite

Deep Learning (DL) models, widely used in several domains, are currently often used for posture recognition. This work researches four DL architectures for posture recognition: Convolutional Neural Network (CNN), Long Short-Term Memory (LSTM), hybrid CNN-LSTM, and transformer. Agriculture and construction working postures were addressed as use cases, by acquiring an inertial dataset during the simulation of their typical tasks in circuits. Since model performance greatly depends on the choice of the hyperparameters, a grid search was conducted to find the optimal hyperparameters. An extensive analysis of the hyperparameter combinations' effects is presented, identifying some general tendencies. Moreover, to unveil the black-box DL models, we applied the Gradient-weighted Class Activation Mapping (Grad-CAM) explainability method on CNN's outputs to better understand the model's decision-making, in terms of the most important sensors and time steps for each window output. An innovative combination of CNN and LSTM was implemented for the hybrid architecture, by using the convolution feature maps as LSTM inputs and fusing both subnetworks' outputs with weights, which are learned during the training. All architectures were successful in recognizing the eight posture classes, with the best model of each architecture exceeding 91.5% F1-score in the test. A top F1-score of 94.31%, with an inference time of just 2.96 ms, was achieved by a hybrid CNN-LSTM.Impact Statement-Work-related musculoskeletal disorders (WRMSDs) are the most reported work-related health problem in the European Union. To get a wider picture of the working tasks with a higher WRMSD risk, we aim to decompose them by determining the postures necessary for their accomplishment. However, a manual record by the worker is neither practical nor feasible. Accordingly, automated human posture recognition and, particularly, DL models have shown huge potential, reaching high accuracies, but they suffer from a lack of transparency. Thus, in addition to their classification performance, we are concerned about these models' capacity to support their decisions with explanations. Therefore, we implemented Grad-CAM-based explainable CNNs, which also demonstrated their capacity to reach high F1-scores. Besides, we combined CNN with LSTM in a hybrid model, and we let the network learn which subnetwork it should give more importance to decide the output.

show abstract

Section: A Hyperparameter Optimization Resultssupporting

confidence: 90%

Seeking Optimal and Explainable Deep Learning Models for Inertial-based Posture Recognition

Martins,

Cerqueira,

Santos

2024

Preprint

View full text Add to dashboard Cite

show abstract

“…For example, Li et al. (2022) utilize Gated Recurrent Unit as the core of a meta‐learning module to continuously iterate and optimize based on the input sequence data, providing a few‐shot method for applications such as agricultural evaluation. This type of meta‐learning method is also suitable for data‐efficient few‐shot learning and has wider applicability.…”

Section: Methods Of Meta‐learning In Plant Disease Recognitionmentioning

confidence: 99%

Meta‐learning shows great potential in plant disease recognition under few available samples

Deng

Wang

et al. 2023

The Plant Journal

View full text Add to dashboard Cite

Plant diseases worsen the threat of food shortage with the growing global population, and disease recognition is the basis for the effective prevention and control of plant diseases. Deep learning has made significant breakthroughs in the field of plant disease recognition. Compared with traditional deep learning, metalearning can still maintain more than 90% accuracy in disease recognition with small samples. However, there is no comprehensive review on the application of meta-learning in plant disease recognition. Here, we mainly summarize the functions, advantages, and limitations of meta-learning research methods and their applications for plant disease recognition with a few data scenarios. Finally, we outline several research avenues for utilizing current and future meta-learning in plant science. This review may help plant science researchers obtain faster, more accurate, and more credible solutions through deep learning with fewer labeled samples.

show abstract

“…In high-resolution sat ellite image scene classification, Zhai and colleagues introduced a lifelong few-shot learn ing approach [19], enabling easy adaptation to new datasets. Li et al [20] improved inter task relevance by integrating more historical prior knowledge from partial intratask se quences. They also introduced a graph transformer to optimize the distribution of sample features in the embedding space.…”

Section: Introductionmentioning

confidence: 99%

Optimizing Few-Shot Remote Sensing Scene Classification Based on an Improved Data Augmentation Approach

Dong,

Lin,

Xie

2024

Remote Sensing

View full text Add to dashboard Cite

In the realm of few-shot classification learning, the judicious application of data augmentation methods has a significantly positive impact on classification performance. In the context of few-shot classification tasks for remote sensing images, the augmentation of features and the efficient utilization of limited features are of paramount importance. To address the performance degradation caused by challenges such as high interclass overlap and large intraclass variance in remote sensing image features, we present a data augmentation-based classification optimization method for few-shot remote sensing image scene classification. First, we construct a distortion magnitude space using different types of features, and we perform distortion adjustments on the support set samples while introducing an optimal search for the distortion magnitude (ODS) method. Then, the augmented support set offers a wide array of feature distortions in terms of types and degrees, significantly enhancing the generalization of intrasample features. Subsequently, we devise a dual-path classification (DC) decision strategy, effectively leveraging the discriminative information provided by the postdistortion features to further reduce the likelihood of classification errors. Finally, we evaluate the proposed method using a widely used remote sensing dataset. Our experimental results demonstrate that our approach outperforms benchmark methods, achieving improved classification accuracy.

show abstract

Structural Attention Enhanced Continual Meta-Learning for Graph Edge Labeling Based Few-Shot Remote Sensing Scene Classification

Cited by 11 publications

References 54 publications

Seeking Optimal and Explainable Deep Learning Models for Inertial-based Posture Recognition

Seeking Optimal and Explainable Deep Learning Models for Inertial-based Posture Recognition

Meta‐learning shows great potential in plant disease recognition under few available samples

Optimizing Few-Shot Remote Sensing Scene Classification Based on an Improved Data Augmentation Approach

Contact Info

Product

Resources

About