The advancement of smartphones has been a prerequisite for multi-sensor fusion-based indoor mapping due to their high penetration and low cost. This paper constructs a sparse single-track semantic map by fusing semantic landmarks, Wi-Fi landmarks, and waypoints, which are scene-representable and navigational in unknown indoor environments. The pedestrian dead reckoning (PDR)-aided visual-inertial simultaneous localization and mapping (PDR-aided VI-SLAM) method uses the PDR velocity as an external observation to constrain the preintegration of inertial measurement unit (IMU) measurements. The semantic objects detected by the YOLO V4 are prefiltered by the proposed semantic object filtering algorithm before semantics matching. The centroid of feature points clustered by the density-based spatial clustering of applications with noise (DBSCAN) algorithm characterizes the location of a semantic landmark relative to the waypoints estimated by the PDR-aided VI-SLAM. To enhance the stability of Wi-Fi landmarks, Wi-Fi signals are processed by the proposed sliding window-based Wi-Fi fusion algorithm. By using waypoints as an intermediate quantity, the association relationships between waypoints, semantic landmarks, and Wi-Fi landmarks are established. After that, a lightweight single-track semantic map is constructed. A landmark matching-based localization method is proposed to evaluate the similarity between the local map and the constructed scene map to infer the location of a pedestrian in the scene map. Experiments are conducted in the office building and mall scenes under various illumination conditions and semantic density. Results demonstrate the high quality of the single-track semantic map and high-precision localization of a local map in the constructed scene map.
The advancement of smartphones with multiple built-in sensors facilitates the development of crowdsourcing-based indoor map construction and localization. This paper proposes a crowdsourcing-based indoor semantic map construction and localization method using graph optimization. Using waypoints, semantic landmarks, and Wi-Fi landmarks as nodes and the relevance between waypoints and landmarks (i.e., waypoint–waypoint, waypoint–semantic, waypoint–Wi-Fi, semantic–semantic, and Wi-Fi–Wi-Fi) as edges, the optimization graph is constructed. Initializing the venue map is the single-track semantic map with the highest quality, as determined by a proposed map quality evaluation function. The aligned venue and candidate maps are optimized while satisfying the constraints, with the candidate map exhibiting the highest degree of similarity to the venue map. The lightweight venue map is then updated in terms of waypoint and landmark attributes, as well as the relationship between waypoints and landmarks. To determine a pedestrian’s location on a venue map, similarities between a local map and a venue map are evaluated. Experiments conducted in an office building and shopping mall scenes demonstrate that crowdsourcing-based venue maps are superior to single-track semantic maps. Additionally, the landmark matching-based localization method can achieve a mean localization error of less than 0.5 m on the venue map, compared to 0.6 m in a single-track semantic map.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.