At present scenario, there is growing demand for the software system to recognize characters in a computer system when information is scanned through paper documents. This paper presents detailed review in the field of Optical Character Recognition. Various techniques are determined that have been proposed to realize the center of character recognition in an optical character recognition system. OCR (Optical Character Recognition) translates images of typewritten or handwritten characters into the electronically editable format and it preserves font properties. Different techniques for preprocessing and segmentation have been surveyed and discussed in this paper.
General TermsPattern Matching.
Association rule mining and classification are two major task of data mining. They are attracted wide attention in both research and application area recently. I propose a method for classification rules from multi-label dataset using association rule analysis. Multi label dataset contains multiple class label attribute for predict target variable. We classify that attribute using different approaches like naviye-baies, decision tree, Back propagation, Neural based classification and association rule based classification. Finding association rule from dataset we have to apply various algorithms like Apriori, Fp-Growth, etc. I proposed Fp-Growth algorithm for finding association rule from dataset because of Fp-Growth is an improved algorithm of Apriori and Fp-Growth is more efficient than Apriori. The number of associations present in even moderate sized databases can be, however, very large-usually too large to be applied directly for classification purposes. Therefore, any classification learner using association rules has to perform three major steps: Mining a set of potentially accurate rules, evaluating and pruning rules, and classifying future instances using the found rule set. Implementation of improved Fp-Growth algorithm gives accurate and classify rule. This approach is more effective, accurate and efficient than other tradition algorithms.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.