This paper proposes a low-complexity design for 3D hand tracking, which can provide depth information and is able to work under critical backgrounds. This paper also proposes an effective way to segment hands out of entire image and also facilitates depth estimation of tracked hands in real-time by dualcamera systems. Multithreading and several techniques are applied to reduce computational complexity in proposed design. The final algorithm has been implemented both on PCs (with Intel Core i7 processor) and an embedded system (with ARM Cortex A9 processor). On PCs, it reaches 24 frames per second at VGA video. On the other hand, after reducing image size (i.e. QVGA video), it achieves the performance about 8 frames per second on PandaBoard embedded system.
I.
This paper proposes a low complexity multi-view video encoder which includes mode decision and early termination based on B-frame characteristics. According to the statistics of coding mode distribution in different B-frame types, we classify all the coding modes into several classes and propose an early terminated mode decision algorithm that can largely reduce the computing complexity. On the other hand, MVDbased adaptive search range scheme is also included in the proposed encoding strategy. In our experimental results, the encoding time is saved up to 91% -93% but the quality loss is controlled within 0.1 dB PSNR drop.I.
This paper presents a view scalable multi-view video decoder system that integrates multiple decoder cores into the proposed system to decode multi-view video and achieve parallel decoding with high view scalability. We manage the firmware for video bit-stream partition and design an arbitration mechanism to balance the work load among decoder cores with a 4KB twolevel cache architecture for inter-view/inter-frame prediction data reusing. With such a flexible architecture, the proposed system can reach 1.8 times performance improvement with two decoding cores and 3.5 times with four decoding cores. Based on the proposed system, users only need to adjust the number of decoder cores and set the firmware parameters for different system applications. This feature also benefits to adopting 3D IC packaging or implementation to exploit high bandwidth DRAM access. The proposed view scalable multi-view video decoder system is able to decode multiple-view HD video in real time.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.