Human–robot collaborative interaction with human perception and action recognition

Yu, Xinyi; Zhang, Xin; Xu, Chengjun; Ou, Linlin

doi:10.1016/j.neucom.2023.126827

Cited by 5 publications

(2 citation statements)

References 30 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Artificial intelligence (AI) [1][2][3][4][5][6][7][8] and robots [9][10][11][12][13][14][15][16][17][18][19][20] have given birth to a new wave of technological break throughs whereby the fusion of control systems with cognitive capabilities is not merely a dream but a reality that is not only being improved but also exploited with passion. The convergence is underlined by the Unscented Kalman Filter (UKF) [21][22][23][24][25][26][27][28][29][30][31][32][33], a sophisticated algorithm that has done a lot in advancing the abilities of AI-based robotics [34][35][36][37][38][39][40][41][42][43][44][45][46][47][48]. The ability of UKF to...…”

Section: Introductionmentioning

confidence: 99%

The convergence of control and cognition: a bibliometric overview of UKF in AI-infused robotics

Obaideen,

AlShabi,

Bettayeb

et al. 2024

Disruptive Technologies in Information Sciences VIII

View full text Add to dashboard Cite

This paper gives a bibliometric summary of Unscented Kalman Filter (UKF) in AI-infused robotics, highlighting its role in unifying control and cognition. Using a systematic approach that includes literature collection from IEEE Xplore, Web of Science and Google Scholar, rigorous screening and selection, and VOSviewer for a comprehensive bibliometric analysis. This analysis reports major trends, primary contributors and central themes, highlighting UKF's pivotal role in improving robotics cognitive and control capacities. The study emphasizes the universally used UKF in many fields of robotics, i.e. in navigation and mapping, sensor fusion, and state estimation, as one of its principal developers, which illustrates its vital role in promoting robotic autonomy and intelligence. The integration of findings from the bibliometric analysis thus not only presents the current state of research but also identifies possible future research directions, highlighting the increasing unification of control theories and cognitive processes in robotics. This research adds to the body of knowledge by delivering a comprehensive map of the UKF application. In this light, the UKF will be able to penetrate AI-infused robotics, the future of robotic developments will rely on the deep fusion of control and cognition facilitated by UKF and alike.

show abstract

Section: Introductionmentioning

confidence: 99%

The convergence of control and cognition: a bibliometric overview of UKF in AI-infused robotics

Obaideen,

AlShabi,

Bettayeb

et al. 2024

Disruptive Technologies in Information Sciences VIII

View full text Add to dashboard Cite

show abstract

“…It is challenging due to many actions, varying camera angles, similarities between actions, and changes in environmental conditions. It has applications in various industries, such as surveillance [2], healthcare [3], eldercare [4], sports [1,5], entertainment [6], and beyond [7].…”

Section: Introductionmentioning

confidence: 99%

Action Recognition in Videos Through a Transfer Learning Based Technique

López-Lozada,

Sossa,

Rubio-Espino

et al. 2024

Preprint

View full text Add to dashboard Cite

In computer vision, human action recognition is a hot topic, popularized by the development of deep learning. Current models have achieved high accuracy results on public datasets. Despite this success, they require significant computational resources for training. Given that transfer learning based techniques allow reusing what other models have already learned and training models with less computational resources, in this work we propose using a transfer learning based approach for action recognition in videos. We describe a methodology for human action recognition using transfer learning techniques in a custom dataset. The proposed method consists of four stages: 1) human detection and tracking, 2) video preprocessing, 3) feature extraction (using pretrained models with ImageNet), and 4) action recognition using a two-stream model consisting of TCNs, LSTMs, and CNNs layers. The custom dataset is imbalanced with 189, 390, 490, 854, and 890 videos per class, respectively. For feature extraction, we analyzed the performance of seven pretrained models: Inception-v3, MobileNet-v2, MobileNet-v3-L, VGG-16, VGG-19, Xception, and ConvNeXt-L. We show that the best results were obtained with the last one. Finally, using pretrained models for feature extraction allowed training in a PC with a single GPU with an accuracy of 94.9%.

show abstract