As advances in fabrication technology continue and the power density of microelectronic devices increases, efficient and highly integrated through-silicon via (TSV) inductors have emerged as a viable solution for addressing challenges in power and thermal stress management in on-chip direct currentto-direct current (DC/DC) converters. This study proposes a method for modeling and optimizing TSV inductors with magnetic cores based on thermal stress and lumped circuit. CoZrTa is selected as the magnetic core material, and RLCG circuit modeling is performed for multiple factors, including resistance, inductance value, and parasitic capacitance. To validate the accuracy of the model, simulation tests were conducted using ANSYS HFSS and SPICE tools. The results show that under low-frequency conditions (< 2GHz), the error between the model and the 3D full-wave simulation for L and Q parameters is less than 3.3%, proving the model's accuracy. Furthermore, a deep reinforcement learning optimization design model based on the deep deterministic policy gradients (DDPG) algorithm is proposed, which comprehensively considers factors such as current ripple, power loss, parasitic parameters, and thermal stress. With almost no change in electrical performance, the thermal stress in the optimized TSV inductor array decreased by 11.91% and its distribution was improved, reducing the likelihood of fatigue damage.