Stair Recognition for Robotic Exoskeleton Control using Computer Vision and Deep Learning

Kurbis, Andrew Garrett; Laschowski, Brock; Mihailidis, Alex

doi:10.1101/2022.04.11.487925

Cited by 2 publications

(12 citation statements)

References 20 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To train and test our image classification models, we used the open-source StairNet dataset [21] with four environment classes: level-ground terrain (LG), level-ground transition to incline stairs (LG-IS), incline stairs (IS), and incline stairs transition to level-ground (IS-LG). The StairNet dataset includes videos recorded in urban environments using a wearable camera.…”

Section: A Image Datasetmentioning

confidence: 99%

“…Recent studies [12]-[19] have focused on using deep learning and large-scale datasets, such as ExoNet [20] and StairNet [21], to develop systems that can generalize to diverse walking environments. These systems use convolutional neural networks (CNNs) and transfer learning for image classification such that the model weights are trained on large datasets like ImageNet [22] and fine-tuned on downstream tasks with mod-…”

Section: Introductionmentioning

confidence: 99%

“…Cameras have been used for environment state estimation in combination with inertial sensors, used for locomotor state estimation [2]. Early systems for environment recognition were limited to statistical pattern recognition and machine learning algorithms that required manual feature engineering and/or were developed using relatively small image datasets [3]- [11].Recent studies [12]-[19] have focused on using deep learning and large-scale datasets, such as ExoNet [20] and StairNet [21], to develop systems that can generalize to diverse walking environments. These systems use convolutional neural networks (CNNs) and transfer learning for image classification such that the model weights are trained on large datasets like ImageNet [22] and fine-tuned on downstream tasks with mod-…”

mentioning

confidence: 99%

See 2 more Smart Citations

Sequential Image Classification of Human-Robot Walking Environments using Temporal Neural Networks

Ivanyuk-Skulskiy,

Kurbis,

Mihailidis

et al. 2023

Preprint

Self Cite

View full text Add to dashboard Cite

Robotic prosthetic legs and exoskeletons require real-time and accurate estimation of the walking environment for smooth transitions between different locomotion mode controllers. However, previous studies have mainly been limited to static image classification, therein ignoring the temporal dynamics of human-robot locomotion. Motivated by these limitations, here we developed several state-of-the-art temporal convolutional neural networks (CNNs) to compare the performances between static vs. sequential image classification of real-world walking environments (i.e., level-ground terrain, incline stairs, and transitions to and from stairs). Using our large-scale image dataset, we trained a number of encoder networks such as VGG, MobileNetV2, ViT, and MobileViT, each coupled with a temporal long short-term memory (LSTM) backbone. We also trained MoViNet, a new video classification model designed for mobile and embedded devices, to further compare the performances between 2D and 3D temporal deep learning models. Our 3D network outperformed all the hybrid 2D encoders with LSTM backbones and the 2D CNN baseline model in terms of classification accuracy, suggesting that network architecture can play an important role in performance. However, although our 3D neural network achieved the highest classification accuracy, it had disproportionally higher computational and memory storage requirements, which can be disadvantageous for real-time control of robotic leg prostheses and exoskeletons with limited onboard resources.

show abstract

Section: A Image Datasetmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

mentioning

confidence: 99%

See 1 more Smart Citation

Sequential Image Classification of Human-Robot Walking Environments using Temporal Neural Networks

Ivanyuk-Skulskiy,

Kurbis,

Mihailidis

et al. 2023

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…However, the dataset has many environmental classes that overlap with one another with varying degrees, resulting in poor generalization. Kurbis and colleagues [8]- [9] recently developed a 4-class image dataset called StairNet based on ExoNet by combining similar classes and focusing on stair recognition. The system allowed for more consistent and robust training of a convolutional neural network.…”

Section: Introductionmentioning

confidence: 99%

“…Our goal is to support the development of new computer vision systems for environment-adaptive control of robotic prosthetic legs and exoskeletons. This work focuses on improving training efficiency and making computer vision systems more accessible to researchers in wearable robotics by minimizing the number of required labelled images while maintaining high prediction accuracy similar to the previous state-of-the-art [8]- [9].…”

Section: Introductionmentioning

confidence: 99%

Efficient Visual Perception of Human-Robot Walking Environments using Semi-Supervised Learning

Kuzmenko

Tsepa

Kurbis

et al. 2023

Preprint

View full text Add to dashboard Cite

Convolutional neural networks trained using supervised learning can improve visual perception for human-robot walking. These advances have been possible due to large-scale datasets like ExoNet and StairNet - the largest open-source image datasets of real-world walking environments. However, these datasets require vast amounts of manually annotated data, the development of which is time consuming and labor intensive. Here we present a novel semi-supervised learning system (ExoNet-SSL) that uses over 1.2 million unlabelled images from ExoNet to improve training efficiency. We developed a deep learning model based on mobile vision transformers and trained the model using semi-supervised learning for image classification. Compared to standard supervised learning (98.4%), our ExoNet-SSL system was able to maintain high prediction accuracy (98.8%) when tested on previously unseen environments, while requiring 35% fewer labelled images during training. These results show that semi-supervised learning can improve training efficiency by leveraging large amounts of unlabelled data and minimize the size requirements for manually annotated images. Future research will focus on model deployment for onboard real-time inference and control of human-robot walking.

show abstract

Stair Recognition for Robotic Exoskeleton Control using Computer Vision and Deep Learning

Cited by 2 publications

References 20 publications

Sequential Image Classification of Human-Robot Walking Environments using Temporal Neural Networks

Sequential Image Classification of Human-Robot Walking Environments using Temporal Neural Networks

Efficient Visual Perception of Human-Robot Walking Environments using Semi-Supervised Learning

Contact Info

Product

Resources

About