Abstract.Effective text region extraction and binarization of
image embedded text documents on mobile devices having limited computational
resources is an open research problem. In this paper, we present one
such technique for preprocessing images captured with built-in cameras of
handheld devices with an aim of developing an efficient Business Card Reader.
At first, the card image is processed for isolating foreground
components. These foreground components are classified as either text or
non-text using different feature descriptors of texts and images. The
non-text components are removed and the textual ones are binarized with a
fast adaptive algorithm. Specifically, we propose new techniques
(targeted to mobile devices) for (i) foreground component isolation, (ii)
text extraction and (iii) binarization of text regions from camera captured
business card images. Experiments with business card images of various
resolutions show that the present technique yields better accuracy and
involves low computational overhead in comparison with the state-of-the-art.
We achieve optimum text/non-text separation performance with images of
resolution 800×600 pixels with an average recall rate of 93.90% and
a precision rate of 96.84%. It involves a peak memory
consumption of 0.68 MB and processing time of 0.102 seconds on a moderately
powerful notebook, and 4 seconds of processing time on a PDA.