Traditional fluid–structure interaction (FSI) simulation is computationally demanding, especially for bi-directional FSI problems. To address this, a masked deep neural network (MDNN) is developed to quickly and accurately predict the unsteady flow field. By integrating the MDNN with a structural dynamic solver, an FSI system is proposed to perform simulation of a flexible vertical plate oscillation in fluid with large deformation. The results show that both the flow field prediction and structure response are consistent with the traditional FSI system. Furthermore, the masked method is highly effective in mitigating error accumulation during temporal flow field predictions, making it applicable to various deformation problems. Notably, the proposed model reduces the computational time to a millisecond scale for each step regarding the fluid part, resulting in an increase in nearly two orders of magnitude in computational speed, which greatly enhances the computational speed of the FSI system.