This paper reports a new address interpretation system which allows both non-use of postal codes and omission of address elements, such as the omission of county or state when a city name has been given. While lexicon-driven recognizers show good recognition performance when they are input with adequate word images and a lexicon containing correct word strings, it is difficult to design one which would be of practical use when postal codes are not in use and elements of addresses may have been omitted. That is to say, an inadequate design in this area is likely to result in an impractically high erroneous recognition rate. In response to this problem, we propose here an advanced address interpretation system that utilizes both an improved address interpretation method and improved word recognition methods. The improved address interpretation method has been designed to satisfy as completely as possible the need to accommodate non-use of postal codes and the omission of address elements, while the improved word recognition methods have been designed to achieve low erroneous recognition rates in cases in which that need has not been fully satisfied. When we applied our new system to approximately 2000 actual address images for which that need would be relevant, we achieved a ¾± rate of correct outward sorting with only a ¼ ± rate of erroneous outward sorting. These rates are good enough for practical applications.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.