CSI-IANet: An Inception Attention Network for Human-Human Interaction Recognition Based on CSI Signal

Rahman, Mizanur; Shin, Wonjae

doi:10.1109/access.2021.3134794

Cited by 13 publications

(10 citation statements)

References 46 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The proposed model is trained and evaluated using a 10fold CV technique. The fold-wise performance results in Table 3 assumes that the highest accuracy is achieved for the To make a performance comparison with other deep learning based approaches, we use four different models: two pre-trained CNNs (ResNet-50 [29], DenseNet-121 [30]), an end-to-end deep learning framework (E2EDLF) [31], and CSI-IANet [32]. Two pre-trained CNNs are tuned via the transfer learning concept using the collected CSI gesture dataset.…”

Section: Resultsmentioning

confidence: 99%

CSI-DeepNet: A Lightweight Deep Convolutional Neural Network Based Hand Gesture Recognition System Using Wi-Fi CSI Signal

Hasan

Shin

2022

IEEE Access

Self Cite

View full text Add to dashboard Cite

Hand gesture is a visual input of human-computer interaction for providing different applications in smart homes, healthcare, and eldercare. Most deep learning-based techniques adopt standard convolution neural networks (CNNs) which require a large number of model parameters with high computational complexity; thus, it is not suitable for application in devices with limited computational resources. However, fewer model parameters can reduce the system accuracy. To address this challenge, we propose a lightweight heterogeneous deep learning-based gesture recognition system, coined CSI-DeepNet. The CSI-DeepNet comprises four steps: i) data collection, ii) data processing, iii) feature extraction, and iv) classification. We utilize a low-power system-on-chip (SoC), ESP-32, for the first time to collect alphanumeric hand gesture datasets using channel state information (CSI) with 1,800 trials of 20 gestures, including the steady-state data of ten people. A Butterworth low-pass filter with Gaussian smoothing is applied to remove noise; subsequently, the data is split into windows with sufficient dimensions in the data processing step before feeding to the model. The feature extraction section utilizes a depthwise separable convolutional neural network (DS-Conv) with a feature attention (FA) block and residual block (RB) to extract fine-grained features while reducing the complexity using fewer model parameters. Finally, the extracted refined features are classified in the classification section. The proposed system achieves an average accuracy of 96.31% with much less computational complexity, which is better than the results obtained using state-of-the-art pre-trained CNNs and two deep learning models using CSI data.INDEX TERMS Hand gesture recognition, channel state information (CSI), deep learning, depthwise separable convolutional neural network (DS-Conv), feature attention, residual block, system-on-chip (SoC).

show abstract

Section: Resultsmentioning

confidence: 99%

CSI-DeepNet: A Lightweight Deep Convolutional Neural Network Based Hand Gesture Recognition System Using Wi-Fi CSI Signal

Hasan

Shin

2022

IEEE Access

Self Cite

View full text Add to dashboard Cite

show abstract

“…However, the presence of unsynchronized transmitters and receivers can cause random phase offsets in CSI and change it chaotically. In addition, the phase can be influenced by the sampling frequency offset, while CSI usually has an almost fixed range [ 24 ]. Therefore, the CSI amplitude is usually used.…”

Section: System Methodsmentioning

confidence: 99%

CSI-Based Human Activity Recognition Using Multi-Input Multi-Output Autoencoder and Fine-Tuning

Chahoushi

Nabati

Asvadi

et al. 2023

Sensors

View full text Add to dashboard Cite

Wi-Fi-based human activity recognition (HAR) has gained considerable attention recently due to its ease of use and the availability of its infrastructures and sensors. Channel state information (CSI) captures how Wi-Fi signals are transmitted through the environment. Using channel state information of the received signals transmitted from Wi-Fi access points, human activity can be recognized with more accuracy compared with the received signal strength indicator (RSSI). However, in many scenarios and applications, there is a serious limit in the volume of training data because of cost, time, or resource constraints. In this study, multiple deep learning models have been trained for HAR to achieve an acceptable accuracy level while using less training data compared to other machine learning techniques. To do so, a pretrained encoder which is trained using only a limited number of data samples, is utilized for feature extraction. Then, by using fine-tuning, this encoder is utilized in the classifier, which is trained by a fraction of the rest of the data, and the training is continued alongside the rest of the classifier’s layers. Simulation results show that by using only 50% of the training data, there is a 20% improvement compared with the case where the encoder is not used. We also showed that by using an untrainable encoder, an accuracy improvement of 11% using 50% of the training data is achievable with a lower complexity level.

show abstract

“…With respect to deep learning-based methods, Kabir et al [15] introduced CSI-IANet, which utilized a Butterworth lowpass filter to denoise the CSI signal, employed three layers of CNN which is an inception module providing the model with varying receptive fields, and spatial-attention to first achieved an accuracy over 90% (91.3%). Kabir and Shin [28] presented DCNN, which employed only three layers of CNNs and achieved an accuracy of 88.66%.…”

Section: B Compare To State-of-the-art-methodsmentioning

confidence: 99%

“…Kabir et al [15] developed the CSI-based Inception Attention Network (CSI-IANet) incorporating CNNs and spatial-attention and evaluated it using a dataset of Wi-Fibased human-to-human interactions (HHI), which is the same dataset used in this paper [1]. The HHI dataset includes 12 different human-to-human interactions performed by two subjects and will be described in detail in section III.…”

Section: A Cnn-based Approachesmentioning

confidence: 99%

A Lightweight Mobile Temporal Convolution Network for Multi-Location Human Activity Recognition based on Wi-Fi

Jiang

et al. 2021

2021 IEEE/CIC International Conference on Communications in China (ICCC Workshops)

View full text Add to dashboard Cite

The utilization of Wi-Fi-based human activity recognition (HAR) has gained considerable interest in recent times, primarily owing to its applications in various domains such as healthcare for monitoring breath and heart rate, security, elderly care, and others. These Wi-Fi-based methods exhibit several advantages over conventional state-of-the-art techniques that rely on cameras and sensors, including lower costs and ease of deployment. However, a significant challenge associated with Wi-Fi-based HAR is the significant decline in performance when the scene or subject changes. To mitigate this issue, it is imperative to train the model using an extensive dataset. In recent studies, the utilization of CNN-based models or sequenceto-sequence models such as LSTM, GRU, or Transformer has become prevalent. While sequence-to-sequence models can be more precise, they are also more computationally intensive and require a larger amount of training data. To tackle these limitations, we propose a novel approach that leverages a temporal convolution network with augmentations and attention, referred to as TCN-AA. Our proposed method is computationally efficient and exhibits improved accuracy even when the data size is increased threefold through our augmentation techniques. Our experiments on a publicly available dataset indicate that our approach outperforms existing state-of-the-art methods, with a final accuracy of 99.42%.

show abstract

CSI-IANet: An Inception Attention Network for Human-Human Interaction Recognition Based on CSI Signal

Cited by 13 publications

References 46 publications

CSI-DeepNet: A Lightweight Deep Convolutional Neural Network Based Hand Gesture Recognition System Using Wi-Fi CSI Signal

CSI-DeepNet: A Lightweight Deep Convolutional Neural Network Based Hand Gesture Recognition System Using Wi-Fi CSI Signal

CSI-Based Human Activity Recognition Using Multi-Input Multi-Output Autoencoder and Fine-Tuning

A Lightweight Mobile Temporal Convolution Network for Multi-Location Human Activity Recognition based on Wi-Fi

Contact Info

Product

Resources

About