HANDS: a multimodal dataset for modeling toward human grasp intent inference in prosthetic hands

Han, Mo; Gunay, Sezen Yagmur; Schirner, Gunar; Padır, Taşkın; Erdoğmuş, Deni̇z

doi:10.1007/s11370-019-00293-8

Cited by 18 publications

(10 citation statements)

References 25 publications

(51 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…HCI enables users to communicate their physiological information with machines for help with manipulating external devices in a more reliable, robust and safe manner. Traditionally, the assessment of physiological activity (e.g., human stress level and mental status) was implemented by monitoring signals such as electroencephalography (EEG) [2] and electromyography (EMG) [3]. However, these measurements require either surface (non-invasive) or implanted (invasive) electrodes and frequent calibration, which increase system cost and decrease user comfort.…”

Section: Introductionmentioning

confidence: 99%

Disentangled Adversarial Autoencoder for Subject-Invariant Physiological Feature Extraction

Han

Özdenizci

Wang

et al. 2020

IEEE Signal Process. Lett.

Self Cite

View full text Add to dashboard Cite

Recent developments in biosignal processing have enabled users to exploit their physiological status for manipulating devices in a reliable and safe manner. One major challenge of physiological sensing lies in the variability of biosignals across different users and tasks. To address this issue, we propose an adversarial feature extractor for transfer learning to exploit disentangled universal representations. We consider the trade-off between task-relevant features and user-discriminative information by introducing additional adversary and nuisance networks in order to manipulate the latent representations such that the learned feature extractor is applicable to unknown users and various tasks. Results on cross-subject transfer evaluations exhibit the benefits of the proposed framework, with up to 8.8% improvement in average accuracy of classification, and demonstrate adaptability to a broader range of subjects.

show abstract

Section: Introductionmentioning

confidence: 99%

Disentangled Adversarial Autoencoder for Subject-Invariant Physiological Feature Extraction

Han

Özdenizci

Wang

et al. 2020

IEEE Signal Process. Lett.

Self Cite

View full text Add to dashboard Cite

show abstract

“…The module predicts the class of the frame. The frame-level score ( ŷ( f , c)) is calculated as shown in Equation (3).…”

Section: Gaze-driven Object Recognition Cnnmentioning

confidence: 99%

“…To overcome the limitations of traditional control solely based on the electromyographic (EMG) activity of the remaining muscles, promising alternatives consider hybrid systems combining noninvasive motion capture and vision control [1,2]. They include camera vision modules that allow for recognition of the subject's intention to grasp an object and assist visual control of prosthetic arms for object reaching and grasping [3].…”

Section: Introduction and State-of-the Artmentioning

confidence: 99%

Hybrid FPGA–CPU-Based Architecture for Object Recognition in Visual Servoing of Arm Prosthesis

et al. 2022

View full text Add to dashboard Cite

The present paper proposes an implementation of a hybrid hardware–software system for the visual servoing of prosthetic arms. We focus on the most critical vision analysis part of the system. The prosthetic system comprises a glass-worn eye tracker and a video camera, and the task is to recognize the object to grasp. The lightweight architecture for gaze-driven object recognition has to be implemented as a wearable device with low power consumption (less than 5.6 W). The algorithmic chain comprises gaze fixations estimation and filtering, generation of candidates, and recognition, with two backbone convolutional neural networks (CNN). The time-consuming parts of the system, such as SIFT (Scale Invariant Feature Transform) detector and the backbone CNN feature extractor, are implemented in FPGA, and a new reduction layer is introduced in the object-recognition CNN to reduce the computational burden. The proposed implementation is compatible with the real-time control of the prosthetic arm.

show abstract

“…2) Dataset: The HANDS dataset [19] is a collection of images from graspable objects used in daily life including office supplies, utensils and complex-shaped objects like toys, from the hand camera perspective and different orientations. The labels are probabilistic as opposed to the common onehot encoding because of the feasibility of grasping objects in multiple ways with different preferences.…”

Section: B Visual Classifiermentioning

confidence: 99%

NetCut: Real-Time DNN Inference Using Layer Removal

Zandigohar,

Erdogmus,

Schirner

2021

Preprint

Self Cite

View full text Add to dashboard Cite

Deep Learning plays a significant role in assisting humans in many aspects of their lives. As these networks tend to get deeper over time, they extract more features to increase accuracy at the cost of additional inference latency. This accuracyperformance trade-off makes it more challenging for Embedded Systems, as resource-constrained processors with strict deadlines, to deploy them efficiently. This can lead to selection of networks that can prematurely meet a specified deadline with excess slack time that could have potentially contributed to increased accuracy.In this work, we propose: (i) the concept of layer removal as a means of constructing TRimmed Networks (TRNs) that are based on removing problem-specific features of a pretrained network used in transfer learning, and (ii) NetCut, a methodology based on an empirical or an analytical latency estimator, which only proposes and retrains TRNs that can meet the application's deadline, hence reducing the exploration time significantly.We demonstrate that TRNs can expand the Pareto frontier that trades off latency and accuracy to provide networks that can meet arbitrary deadlines with potential accuracy improvement over off-the-shelf networks. Our experimental results show that such utilization of TRNs, while transferring to a simpler dataset, in combination with NetCut, can lead to the proposal of networks that can achieve relative accuracy improvement of up to 10.43% among existing off-the-shelf neural architectures while meeting a specific deadline, and 27x speedup in exploration time.

show abstract

HANDS: a multimodal dataset for modeling toward human grasp intent inference in prosthetic hands

Cited by 18 publications

References 25 publications

Disentangled Adversarial Autoencoder for Subject-Invariant Physiological Feature Extraction

Disentangled Adversarial Autoencoder for Subject-Invariant Physiological Feature Extraction

Hybrid FPGA–CPU-Based Architecture for Object Recognition in Visual Servoing of Arm Prosthesis

NetCut: Real-Time DNN Inference Using Layer Removal

Contact Info

Product

Resources

About