When faced with navigating back somewhere we have been before we might either retrace our steps or seek a shorter path. Both choices have costs. In this study, we ask whether it is possible to characterize formally the choice of navigational plans in terms of a bounded rational process that trades off the quality of the plan (e.g., its length) and the cognitive cost required to find and implement it. We analyze the navigation strategies of two groups of people that are firstly trained to follow a "default policy" taking a route in a virtual maze and then asked to navigate to various known goal destinations, either in the way they want ("Go To Goal") or by taking novel shortcuts ("Take Shortcut"). We address these wayfinding problems using InfoRL: an information-theoretic approach that formalizes the cognitive cost of devising a navigational plan, as the informational cost to deviate from a well-learned route (the "default policy"). In InfoRL, optimality does not refer to finding the shortest route, but to the best trade-off between route length and the amount of control information required to find it. We report four main findings. First, the navigational strategies automatically identified by InfoRL correspond closely to different routes (optimal or suboptimal) in the virtual reality map, which were annotated by hand in previous research. Second, people deliberate more in places where the value of investing cognitive resources (i.e., relevant goal information) is greater. Third, the group of people who receive the "Take Shortcut" find shorter solutions compared to those who receive the "Go To Goal" instruction. However, while the former appear to be naively more efficient if one only considers plan length, our bounded rational approach reveals that they invest more cognitive resources and find overall less optimal solutions, reflecting the intrinsic difficulty of finding optimal shortcuts. While prior analysis indicates greater male optimality in short-cut finding, we find that both male and female participants similarly modulate their cognitive resources in a task-dependent way, as formally expressed through our information-theoretic approach. Finally, we found a surprising amount of variability in the choice of navigational strategies and resource investment across trials. Participants do not simply adopt one strategy and maintain it. Taken together, these results illustrate the benefits of using InfoRL to address navigational planning problems from a bounded rational perspective.