As luck would have its recent innovations in computer vision grant us to make considerable pace abate the data classification and analysis of enormous data in different organizations like training material, policy guides, and project documents that can be used internally. In addition, cloud service providers are rising in text detection techniques and offer many computer vision contributions included Google Vision, A WS Textract, and Azure OCR. In this paper image classification is achieved by CNN algorithm which is the top choice for image classification. Then using CNN, we extract text data from images. CNN has better performance rate on large datasets as it controls the problem of overfitting. Hence, the accuracy of these algorithms can be enhanced by increasing the epochs and integrating a large dataset. This approach used a methodology to resolve the number of convolution and pooling layers with the number of nodes in network. Lastly, CNN algorithms are better when data is appropriate if it has mortal features that can be analyzed and utilized by algorithm.