In this paper, we present a strategy of Arabic words recognition by combining two levels which are based on global and analytical approaches according to the topological properties of Arabic handwriting. In the first level (global), we consider the visual indices which can be generated by: diacritics and strokes (denoted tracing) that form the main shapes of the word. Each word is described as a sequence of visual indices which is treated by a "global" classifier based on Hidden Markov Model (HMM). In the second level, the word is segmented into graphemes, then each grapheme is transformed into a HMM observation by a vector quantization phase. An analytical HMM is developed in order to manage the observation sequences. At this level the diacritics are not taken in consideration which allows to reduce the number of estimated character models. Finally we combine the two approaches to decide on the class of an unknown word. In fact, the global model serves as a filter. It produces a set of hypotheses to the analytical model, which in turns, defines and outputs the final decision.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.