Digital transformation of enterprises is driving the need for a digital identity to recognize people for delivering services. The implementation of digital identity is complex, requiring several technological solutions and much coordination. Capturing and processing data is challenging because biometric issues may arise due to imaging errors. This article addresses this issue and proposes a computer vision-based framework for contactless recognition process using a focus group discussion approach for inputs from experts. The proposed framework enhances image capturing process, extraction of high-quality features from captured images, image processing, contactless face detection, and authentication. The study also derives lessons for other biometric-based identity projects based on image analysis. The proposed framework can be used as a reference for understanding multidimensional perspectives, scalability, and adoption of technological solutions in other similar projects in developing countries in future.