On hierarchical modelling of motion for workflow analysis from overhead view

Arbab-Zavar, Banafshe; Carter, John N.; Nixon, Mark S.

doi:10.1007/s00138-013-0528-7

Cited by 6 publications

(4 citation statements)

References 19 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In the context of manufacturing, a limited number of papers were found, with notable works being [6,7,[33][34][35][36][37] The application of classic machine learning models was predominant in these papers, where models like Hidden Markov Models [6,34,35] or Support Vector Machines [7] were employed following the manual extraction of features. Makantasis et al [33] applied a deep learning model based on a 2D convolutional neural network and multi-layer perceptron, using manually created features with the Motion History Image algorithm.…”

Section: Related Workmentioning

confidence: 99%

“…In paper [37], a system comprising three stages was developed: spatial feature extraction using a Vectors Assembly Graph (VAG) and graph networks from RGB-D video frames; contact force feature extraction via a sliding window technique; and action segmentation through a multi-stage temporal convolution network (MS-TCN) that combines these features. Jiang et al [7] collected data in laboratory conditions, while studies [33][34][35] were conducted using a dataset described in [38], which is no longer publicly available. Rude et al [6] used a dataset from [39], collected with a depth sensor for painting manufactured parts, recording data over a single workday with two different workers.…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Improving Time Study Methods Using Deep Learning-Based Action Segmentation Models

Gudlin,

Hegedić,

Golec

et al. 2024

Applied Sciences

View full text Add to dashboard Cite

In the quest for industrial efficiency, human performance within manufacturing systems remains pivotal. Traditional time study methods, reliant on direct observation and manual video analysis, are increasingly inadequate, given technological advancements. This research explores the automation of time study methods by deploying deep learning models for action segmentation, scrutinizing the efficacy of various architectural strategies. A dataset, featuring nine work activities performed by four subjects on three product types, was collected from a real manufacturing assembly process. Our methodology hinged on a two-step video processing framework, capturing activities from two perspectives: overhead and hand-focused. Through experimentation with 27 distinctive models varying in viewpoint, feature extraction method, and the architecture of the segmentation model, we identified improvements in temporal segmentation precision measured with the F1@IoU metric. Our findings highlight the limitations of basic Transformer models in action segmentation tasks, due to their lack of inductive bias and the limitations of a smaller dataset scale. Conversely, the 1D CNN and biLSTM architectures demonstrated proficiency in temporal data modeling, advocating for architectural adaptability over mere scale. The results contribute to the field by underscoring the interplay between model architecture, feature extraction method, and viewpoint integration in refining time study methodologies.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Improving Time Study Methods Using Deep Learning-Based Action Segmentation Models

Gudlin,

Hegedić,

Golec

et al. 2024

Applied Sciences

View full text Add to dashboard Cite

show abstract

“…The more general problem of workflow monitoring is already being addressed in more constrained industrial environments such as car manufacturing (Voulodimos et al, 2011;Veres et al, 2011;Arbab-Zavar et al, 2014). In 2014, Arbab-Zavar et al (Arbab-Zavar et al, 2014) exploited shape and motion features extracted from an overhead video in order to identify highly structured tasks and activities within a car manufacturing plant. A Markov temporal structure based decision system has been proposed in (Behera et al, 2014) to model spatio-temporal relationships during object manipulations tasks and has been tested for continuous activity recognition in assembling a pump system.…”

Section: Related Workmentioning

confidence: 99%

Modeling Human Motion for Predicting Usage of Hospital Operating Room

Sghir¹,

Shah

2016

Proceedings of the 11th Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications

View full text Add to dashboard Cite

In this paper, we present a system that exploits existing video streams from a hospital operating room (OR) to infer OR usage states. We define OR states that are relevant for assessing OR usage efficiency. We adopt a holistic approach that involves the combination of two meaningful human motion features: gestures or upper body movements computed using optical flow and whole body movements computed through motion trajectories. The two features are independently modeled for each of the defined OR usage states and eventually fused to obtain a final decision. Our approach is tested on a large collection of videos and the results show that the combination of both human motion features provide significant discriminative power in understanding usage of an OR.

show abstract

“…State event models were described in [8][9][10]. Bayesian network event model utilizes probability as a mechanism for handling the uncertainty of observations and interpreting existing events in a video.…”

Section: Introductionmentioning

confidence: 99%

Joint Abnormal Blob Detection and Localization Under Complex Scenes

Lai

Snoussi

2015

Intelligent Computing Theories and Methodologies

View full text Add to dashboard Cite

Abstract. In this paper, an algorithm is proposed to detect the abnormal event in the form of rectangular blob in global images. Observing the status of the varying blobs, unusual behavior can be monitored and alarmed. A method extracting blobs from crowded video scenes is proposed, the covariance matrix descriptor fuses the image intensity and the optical flow to encode moving information and image characteristics of a blob. After characterizing normal behaviors of blobs or frames in a learning period, the nonlinear one-class SVM algorithm locates the abnormal blobs intra frame. The method is applied to detect abnormal events on several video surveillance datasets, and get promising results.

show abstract

On hierarchical modelling of motion for workflow analysis from overhead view

Cited by 6 publications

References 19 publications

Improving Time Study Methods Using Deep Learning-Based Action Segmentation Models

Improving Time Study Methods Using Deep Learning-Based Action Segmentation Models

Modeling Human Motion for Predicting Usage of Hospital Operating Room

Joint Abnormal Blob Detection and Localization Under Complex Scenes

Contact Info

Product

Resources

About