Industrial waste heat recovery is an attractive option having the simultaneous benefits of reducing energy costs as well as carbon emissions. In this context, thermal energy storage can be used along with an optimal operation strategy like model predictive control (MPC) to realize significant energy savings. However, conventional control methods offer little robustness against uncertainty in terms of daily operation, where supply and demand of energy in the cluster can vary significantly from their predicted profiles. A major concern is that ignoring the uncertainties in the system may lead to the system violating critical constraints that affect the quality of the end-product of the participating processes. To this end, we present a method to make optimal energy storage and discharge decisions, while rigorously handling this uncertainty. We employ multivariate data analysis on historical industrial data to implement a multistage nonlinear MPC scheme based on a scenario-tree formulation, where the economic objective is to minimize energy costs. Principal component analysis (PCA) is used to detect outliers in the industrial data on heat profiles, and to select appropriate scenarios for building the scenario-tree in the multistage MPC formulation. The results show that this data-driven robust MPC approach is successfully able to keep the system from violating any operating constraints. The solutions obtained are not overly conservative, even in the presence of significant deviations between the predicted and actual heat profiles. This leads to an energy-efficient utilization of the storage unit, benefiting all the stakeholders involved in heat-exchange in the cluster.