Background Prior studies evaluating health-related quality of life (HR-QoL) outcomes in patients undergoing surgery for benign non-toxic goiter have used different instruments and time points, leading to conflicting results. We sought to systematically review the differences in HR-QoL among patients with BNTG at baseline and 6 months after surgery, using exclusively the ThyPRO questionnaire. Methods A systematic search was performed using PubMed, Embase, Web of Science, Cochrane Central, CINAHL, and PsycINFO for papers reporting the assessment of HR-QoL utilizing ThyPRO. Data were meta-analyzed using a random-effects model, and pooled estimates were calculated using weighted mean differences (WMD) between baseline and 6 months after surgery. We assessed the quality and risk of bias of the studies using the Robins-I tool and previously published minimally important change (MIC) values to assess clinical significance. Results Six papers met the predefined inclusion criteria, describing a total of 496 patients. Meta-analysis demonstrated improved QoL in all thirteen domains of ThyPRO six months post-surgical intervention compared to baseline. Specifically, the largest improvement in QoL was seen in the domains of overall QoL, WMD -25.84 (95% CI -29.70, -21.98, p \ 0.001, I 2 = 23%), goiter symptoms, 23.96 (95% CI -30.29, -17.64, p \ 0.001, I 2 = 91%), and tiredness, -16.20 (95% , p \ 0.001, I 2 = 3%). The differences in scores 9 of 13 domains were clinically significant based on MIC. Conclusions Disease-specific HR-QoL improved in all ThyPRO domains after surgery in patients with BNTG. Future studies of QoL in thyroid surgery patients will benefit from a standard questionnaire and improved reporting of covariates including complications to ensure comparability across studies.