Accurate dynamic response prediction is a challenging and crucial aspect for the fatigue or ultimate analysis of floating offshore wind turbines (FOWTs), which are increasingly recognized for their potential to harness wind energy in deep-water environments. However, traditional numerical modeling approaches like the finite element method are time-consuming, making them inefficient for generating the extensive datasets required. This paper presents an efficient deep learning-based approach, referred to as the CNN-GRU model, considering multiple external environments. This model integrates convolutional neural networks (CNNs) and gated recurrent units (GRUs), effectively extracting the coupling relationships among various input features and capturing the temporal dependencies to enhance predictive accuracy. The proposed model is applied to two distinct types of FOWTs under three sea states, and the results demonstrate its satisfactory accuracy, with an average correlation coefficient (CC) of 0.9962 and an average coefficient of determination (R²) of 0.9864. The high accuracy across all cases proves the model’s robustness and reliability. Furthermore, the model’s optimal configurations, including memory lengths, sample sizes, and optimizer, are identified through parametric studies. Moreover, the Shapley additive explanations (SHAP) interpretation is utilized to reveal the most significant features influencing structural responses. In addition, a comparative analysis with two other ensemble models, namely random forest and gradient boosting, is conducted. The proposed approach achieves superior accuracy, with computational time approximately half that of the other two models, thereby highlighting its efficiency and effectiveness. The comprehensive framework, which encompasses feature selection, data processing, deep learning model construction, and interpretation, demonstrates significant potential for addressing a broad range of engineering problems through deep learning methodologies.