Low-back musculoskeletal disorders (MSDs) are major cause of work-related injury among workers in manual material handling (MMH). Epidemiology studies show that excessive repetition is one of major risk factors of low-back MSDs. Thus, it is essential to monitor the frequency of lifting tasks for an ergonomics intervention. In the current field practice, safety practitioners need to manually observe workers to identify their lifting frequency, which is time consuming and labor intensive. In this study, we propose a method that can recognize lifting actions from videos using computer vision and deep neural networks. An open-source package OpenPose was first adopted to detect bony landmarks of human body in real time. Interpolation and scaling techniques were then applied to prevent missing points and offset different recording environments. Spatial and temporal kinematic features of human motion were then derived. These features were fed into long short-term memory networks for lifting action recognition. The results show that the F1-score of the lifting action recognition is 0.88. The proposed method has potential to monitor lifting frequency in an automated way and thus could lead to a more practical ergonomics intervention.