Geographic Information System (GIS) applications now increasingly make use of geo-located multimedia data such as images and videos. Furthermore, the widespread use of smartphones (and increasingly tablets) and the rapid improvement of hardware has enabled the acquisition of high-definition user-generated videos that are annotated with geo-properties. The sensor meta-data (e.g., GPS and digital compass values), which associate a continuous stream of location and viewing direction information with the collected videos, are considerably smaller in size than the visual content and are helpful in effectively and efficiently manage and search through large repositories of videos. In this paper, we proposed the Double-Bounding R-tree (DBR-tree), which utilizes the inner-bounding rectangle and outer bounding rectangle to manage the camera's field of view. Experimental results show that this structure can prune the unnecessary overlap calculation and improve the spatial query performance.