This study focuses on improving short-term power load forecasting, a critical aspect of power system planning, control, and operation, especially within the context of China’s "dual-carbon" policy. The integration of renewable energy under this policy has introduced complexities such as nonlinearity and instability. To enhance forecasting accuracy, the VMD-SE-BiSATCN prediction model is proposed. This model improves computational efficiency and reduces prediction errors by analyzing and reconstructing sequence component complexity using sample entropy (SE) following variational mode decomposition (VMD). Additionally, a self-attention mechanism is integrated into the temporal convolutional network (TCN) to overcome the traditional TCN’s limitations in capturing long-term dependencies. The model was evaluated using data from the China Ninth Electrical Attribute Modeling Competition and validated with real-world data from a specific county in Shijiazhuang City, Hebei Province, China. Results indicate that the VMD-SE-BiSATCN model outperforms other models, achieving a mean absolute error (MAE) of 92.87, a root mean square error (RMSE) of 126.906, and a mean absolute percentage error (MAPE) of 0.81%.