Automatic treatment planning of radiation therapy (RT) is desired to ensure plan quality, improve planning efficiency, and reduce human errors. We have proposed an Intelligent Automatic Treatment Planning framework with a virtual treatment planner (VTP), an artificial intelligence robot built using deep reinforcement learning (DRL), autonomously operating a treatment planning system (TPS). This study extends our previous successes in relatively simple prostate cancer RT planning to head-and-neck (H&N) cancer, a more challenging context even for human planners due to multiple prescription levels, proximity of targets to critical organs, and tight dosimetric constraints.

We integrated VTP with a real clinical TPS to establish a fully automated planning workflow guided by VTP. This integration allowed direct model training and evaluation using the clinical TPS. We designed the VTP network structure to approach the decision-making process in RT planning in a hierarchical manner that mirrors human planners. The VTP network was trained via the Q-learning framework. To assess the effectiveness of VTP, we conducted a prospective evaluation in the 2023 Planning Challenge organized by the American Association of Medical Dosimetrists (AAMD). We extended our evaluation to include 20 clinical H&N cancer patients, comparing the plans generated by VTP against their clinical plans.

In the prospective evaluation for the AAMD Planning Challenge, VTP achieved a plan score of 139.08 in the initial phase evaluating plan quality, and 15 min of planning time with the first place ranking in the adaptive phase competing for planning efficiency while meeting all plan quality requirements. For clinical cases, VTP-generated plans achieved an average VTP score of 125.33±11.12, which outperformed the corresponding clinical plans with an average score of117.76±13.56.

We successfully integrated VTP with the clinical TPS to achieve a fully automated treatment planning workflow. The compelling performance of VTP demonstrated its potential in automating H&N RT planning.