Techniques for the fusion of real-world videos with virtual scenes are key to the augmentation of three-dimensional (3-D) virtual geographic scenes, which greatly enhances the immersive visual experience. When a 3-D scene is updated dynamically, the existing video projection-based method for real-virtual fusion is generally slow and inefficient, as all rendered objects must be traversed in the new scene to identify the objects to be fused in the user's new field of view (FOV). To address this issue, a fast, topology-accounting method for multi-video fusion with 3-D geographic information system (GIS) scenes is proposed. First, the topological models for video object and rendered object are constructed, respectively. Second, by using the topological models, a method that considering topological relationships is proposed to realize rapid identification of rendered objects during the dynamic update of 3-D scenes. Finally, real video and 3-D scene data in Tengzhou City were used to validate the method proposed in this paper. The experiments demonstrated that the method is fast and efficient in the fusion of videos with 3-D GIS scenes, and the computational cost of the proposed method is significantly lower than that of the current method. The proposed method is highly viable and robust, facilitating the fusion of videos with virtual environments.