“…The system consists of four main steps: 1) Object detection, 2) Region proposal, 3) Activity recognition, and 4) generation of Activity timelines. The first step is based on a deep learning system to detect objects such as bag-mask ventilator, suction devices, and health care hands [86]. Regions around these objects are proposed and used in a new network to recognize stimulation, ventilation, suction, and if the newborn is covered or uncovered [87].…”