Five-axis flank milling is prevalent in complex surfaces manufacturing, and it typically consumes high electricity energy. To save energy and improve energy efficiency, this paper proposes a tool path optimisation of five-axis flank milling by meta-reinforcement learning. Firstly, considering flank milling features, a feed angle is defined that guides tool spatial motion and identifies an ideal principal path. Then, machining energy consumption and time are modelled by tool path variables, i.e., feed angle, cutting strip width and path length. Secondly, an energy-efficient tool path dynamic optimisation model is constructed, which is then described by multiple Markov Decision Processes (MDPs). Thirdly, meta-learning integrating with the Soft Actor-Critic (MSAC) framework is utilised to address the MDPs. In an MDP with one principal path randomly generated by a feed angle, cutting strip width is dynamically optimised under a maximum scallop height limit to realise energy-efficient multi-expansions. By quick traversal of MDPs with various feed angles, MSAC enables an energy-efficient path generation and expansion integrated scheme. Experiments show that, regarding machining energy consumption and time, the proposed method achieves a reduction of 69.96% and 68.44% over the end milling with an iso-scallop height, and of 41.50% and 39.80% over the flank milling with an iso-scallop height, with a minimum amount of machining carbon emission, which highlights its contribution to the arena of energy-oriented and sustainable intelligent manufacturing.