To achieve fast imaging and large field of view (FOV), we improved our multimodal imaging system, which integrated optical resolution photoacoustic microscopy, optical coherence tomography (OCT), and confocal fluorescence microscopy in one platform, by combining optical scanning with mechanical scanning. To ensure good focusing of the objective lens over all the imaged area, we employed OCT-guided dynamic focusing. Different from our previous point-by-point dynamic focusing, we employed an area-by-area focusing adjustment strategy, in which each fast optical scanning area has a fixed focusing depth. We have demonstrated the performance of the system by imaging biological samples ex vivo (plant leaf) and in vivo (mouse ear). The system has achieved uniform resolution in an FOV of 10 mm × 10 mm with an imaging time of about 5 min.