Face detection and recognition systems have recently achieved encouraging results using deep learning, especially Convolutional Neural Network (CNN). Face Recognition (FR) systems have many challenges in unconstrained environments that decrease the accuracy; for overcoming these challenges, a deep learning-based features combination has been proposed. The scheme performs feature-level combination by applying two pre-trained GoogLeNet and VggNet-16 models as deep feature extractors. First, faces are detected and aligned using the Multi-Task Convolutional Neural Networks (MTCNN) face detector. The deep features are extracted from a face image using each individually pre-trained CNN. Second, features obtained from GoogLeNet and VggNet-16 models are combined using the serial-feature combination method. Finally, a classification task is performed using a multiclass Support Vector Machine (SVM) classifier. Experiments on the following datasets: VggFace2, LFW, Essex, and ORL, indicate the efficacy of the proposed system as the combination of the two pre-trained CNN models improves performance. The combination strategy, in particular, yields an accuracy of 95.33% to 99.29% on all datasets. The proposed system was compared to existing models in terms of the LFW, and ORL datasets, the findings showed that the proposed system outperformed most current models in terms of accuracy.