Humans are strategic animals. We constantly make prospective choices, allocating limited resources in situations of uncertain, future outcomes. The management of our finite monthly budget, financial investments, or the allocation of time to the different questions in an exam are just a few examples. In these scenarios, both decision-making and resource allocation tend to fluctuate over time even under invariable set of constraints. However, it is unclear whether these fluctuations affect performance and whether they underlie additional objectives beyond pure reward maximisation. We address these questions using the breadth-depth dilemma, a novel ecological protocol where participants engage in sequential multiple-choice scenarios characterised by limited capacity. We designed two experimental environments. In one environment, optimal performance, formalised with an ideal allocator model, is associated with homogeneous resource allocation across consecutive choices. In contrast, the other environment entails that fluctuating resource allocation leads to greater expected rewards. Our study evaluates participants' adherence to these scenarios and measures fluctuations as deviation from homogeneous allocations. The results revealed that participants’ behaviour fluctuates more than optimal, but critically, behavioural fluctuations adapt to the available capacity and the environmental context. Moreover, our findings unveil pronounced sequential strategies, such as save-for-later and reward history-dependent choice, further implying that these strategies contribute to decision variability. An extension of the optimal allocator model showed that the characteristic excess fluctuation is driven by entropy seeking, the pursuit of information-gain and risk avoidance. Although having a modest impact on performance, these strategies may reflect advantageous behaviours in the long run under ever changing real-world environments.