Developing an effective feature representation for face recognition in an un-constrained environment is still a challenging research field. However, in that environment, face images suffer from several issues that could affect the achieved results such as facial expression, occlusion, low resolution, noise, illumination and pose variation. In this paper, an end-to-end multi-input deep convolu-tional neural network architecture was designed for face recognition. The newly designed CNN structure can take full advantage of multi-modal 2D/3D features extracted from face such as: 3DMM-Mesh LBP image, LBPC image, and the face image itself. Through experiments conducted on the LFW, YTF, Multi-PIE, and Bosphorus databases, the obtained results showed a significant improvement compared to traditional state-of-the-art methods which demonstrated the strength of introducing a multi-input convolutional neural network for face recognition in uncontrolled environments.