In this paper, a novel framework for acceleration of 3D model-based, markerless visual tracking in multi-camera videos is proposed. The objective function being the most computationally demanding part of model-based 3D motion reconstruction is calculated on a GPU. The proposed framework effectively utilizes the rendering power of OpenGL to render the 3D models in the predicted poses, whereas the CUDA threads are used to match such rendered models with the image observations and to perform particle swarm optimization-based tracking. We demonstrate effective parallelization of the particle swarm optimization on GPU. Execution of time-consuming parts of the algorithm on GPU using CUDA-OpenGL significantly accelerates the 3D motion reconstruction, making our method capable of tracking full-body movements with a maximum speed of 15 fps. Qualitative and quantitative experimental results on various four-camera benchmark datasets demonstrate the efficiency and accuracy of our method for real-time motion tracking.
This paper describes how to achieve real-time tracking of 3D human motion using multiview images and graphics processing unit (GPU)-accelerated particle swarm optimization. The tracking involves configuring the 3D human model in the pose described by each particle and then rasterizing it in each 2D plane. The Compute Unified Device Architecture threads rasterize the columns of the triangles and perform the summing of the fitness values of pixels belonging to the processed columns. Such a parallel particle swarm optimization (PSO) exhibits the level of parallelism that allows us to effectively utilize the GPU resources. Image acquisition and image processing are multithreaded and run on CPU in parallel with PSO-based searching for the best pose. Owing to such task decomposition, the tracking of the full human body can be performed at rates of 12 frames per second. For a PSO consisting of 1000 particles and executing 10 iterations, the GPU achieves an average speedup of 12 over the CPU. Using marker-less motion capture system consisting of four calibrated and synchronized cameras, the efficiency comparisons were conducted on four CPU cores and four GTX GPUs on two cards. 1552 B. RYMUT AND B. KWOLEK icant speedups in comparison with homogeneous multicore systems in the same price range. These papers initiated a passionate debate on the limits of GPU acceleration for various classes of applications [2]. A comparison of 14 various implementations showed speedups from 0:5 to 15 (GPU over CPU). There is a common agreement that in order to achieve such speedups, the algorithms to be executed on GPU should be carefully designed. Even though it is easier to program GPUs than ever, efficiently taking advantages of GPU resources still requires unique techniques.The CPUs are still the most frequently used hardware for image processing, given their versatility and tremendous speed. On the other hand, image processing algorithms are good candidates for GPU implementation, because many image processing operations have high inherent parallelism, which is frequently achieved through per-pixel operations. Many research reports confirmed this by showing GPU acceleration of many image processing algorithms [3]. A recent study [1] reports a 30-fold speedup for low-level algorithms and up to 10-fold speedup for high-level functions, which contain more overhead and many steps that are not easy to parallelize.Non-intrusive human pose tracking is a key issue in computer vision because of many possible applications in surveillance, human activity recognition, virtual reality, motion capture, and so on. Because of variations in individual body shapes, this is one of the most challenging problems in computer vision being at the same time one of the most computationally demanding tasks. Particle filtering is widely used in articulated motion tracking because of its ability to represent multimodal probability distributions and to maintain multiple pose hypotheses. Several improvements of ordinary particle filter (PF) were proposed to achieve f...
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.