By introducing the interactive 3D photo/video browsing and exploration system, we propose novel approaches for handling the limitations of the current 2D mobile technology from two aspects: interaction design and visualization. Our contributions feature an effective interaction that happens in the 3D space behind the mobile device's camera. 3D motion analysis of the user's gesture captured by the device's camera is performed to facilitate the interaction between users and multimedia collections in various applications. This approach will solve a wide range of problems with the current input facilities such as miniature keyboards, tiny joysticks and 2D touch screens. The suggested interactive technology enables users to control, manipulate, organize, and re-arrange their photo/video collections in 3D space using bare-hand, marker-less gesture. Moreover, with the proposed techniques we aim to visualize the 2D photo collection, in 3D, on normal 2D displays. This process is automatically done by retrieving the 3D structure from single images, finding the stereo/multiple views of a scene or using the geo-tagged meta-data from huge photo collections. By using the design and implementation of the contributions of this work, we aim to achieve the following goals: Solving the limitations of the current 2D interaction facilities by 3D gestural interaction; Increasing the usability of the multimedia applications on mobile devices; Enhancing the quality of user experience with the digital collections.Mobile devices have been found to be extremely useful in the modern world. In addition to the ordinary tasks, they are being used by people for various purposes in scientific areas, entertainment, education, medical applications, gaming, etc. New mobile devices feature advanced capabilities in computing, communication, data storage and visualization. Furthermore, embedded sensors such as high quality cameras, accelerometer, gyro, proximity, compass and GPS have turned them into powerful portable devices. However, the rapid growth in the market of mobile devices reveals that in the near future we will be faced with a substantial amount of digital data such as photos, videos and the corresponding meta-data obtained by other sensors. Here, the major issue is how to organize this enormous amount of data in an efficient and effective way to be accessible, searchable and useful. How should users interact with the sea of images and videos and how should the visualization happen? Currently, people interact with their mobile devices through the touchscreen displays. The latest and probably the best available technology, such as what we experience with the iPhone or iPad, offers single or multi-touch gestural interaction on 2D touchscreens. This approach is designed to give users a more natural feeling when they are operating their mobile devices. Although this technology has solved many limitations in human mobile device interaction, the recent trend in the digital world reveals that people always prefer to have natural experiences with ...