A major problem in form reading applications is that form fields can not be located exactly because of nonlinear distortions on the form images. Such nohlinear distortions appear f o r example on photocopied forms or on forms transmitted by fax. One way to solve this problem is to determine the form fields by considering the positions of the form lines. This paper describes a new method to find pairs of corresponding form-lines on a reference form and a filled form. The advantage of this method is that the corresponding line-pairs can be used to map any pixel of the filled form and the reference form without any assumption about the kind of distortion. The core of this method is an algorithm that is based on the A *-search-algorithm. Given two sets of horizontal or vertical lines, one from the reference form and one from the filled form, it is searched for pairs of corresponding lines. The algorithm's runtime keeps low and nonlinear distortions of the form-images hardly influence its results. With increasing complexity -i.e. increasing number of lines or decreasing image quality -the number of rejected form-lines grows, but the error-rate stays low.
We present a new tool for gathering textual information according to a query (texts) on arbitrary web sites specified by an information-seeking user. This tool is helpful in any knowledgeintensive area. Its technology is based on the vector space model with optimized feature definition.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.