A prototype document image analysis system for technical journals

Nagy, George; Seth, Sharad; Viswanathan, Mahesh

doi:10.1109/2.144436

Cited by 314 publications

(139 citation statements)

References 1 publication

Supporting

Mentioning

134

Contrasting

Unclassified

Order By: Relevance

“…Studies and research for document image analysis systems have been reported [23]- [31]. As related works to ruled line extraction, the detection methods using the Hough transform technique are reported by literature [23]- [27].…”

Section: Related Workmentioning

confidence: 99%

Document Image Processing for Hospital Information Systems

Kawanaka¹,

Yamamoto²,

Takase³

et al. 2012

Modern Information Systems

View full text Add to dashboard Cite

Section: Related Workmentioning

confidence: 99%

Document Image Processing for Hospital Information Systems

Kawanaka¹,

Yamamoto²,

Takase³

et al. 2012

Modern Information Systems

View full text Add to dashboard Cite

“…Numerous methods using one of these strategies have been proposed for the analysis of machine printed documents. Among the most popular we can cite Kise's method [13] based on area Voronoi diagram, O'Gorman's Docstrum method [14] based on neighbor clustering and Nagy's X-Y cut [15] based on the analysis of projection profiles. These methods provide good results on printed documents, but are not directly adapted to handwritten documents, because they generally take only into account global features of the page, and are thus dedicated to well structured documents.…”

Section: General Problem Of Document Analysismentioning

confidence: 99%

Enriching Historical Manuscripts: The Bovary Project

Nicolas¹,

Paquet²

2004

Document Analysis Systems VI

View full text Add to dashboard Cite

Abstract. In this paper we describe the Bovary Project, a manuscripts digitization project of the famous French writer Gustave FLAUBERT first great work, which should end in 2006 by providing an online access to an hypertextual edition of "Madame Bovary" drafts set. We first develop the global context of this project, the main objectives, and then focus particularly on the document analysis problem. Finally we propose a new approach for the segmentation of handwritten documents.

show abstract

“…In addition, we note that knowledge used in top-down approaches is typically derived from the relations between the geometric and the logical structures of specific classes of documents. This is the case of page grammars (Nagy et al 1992) and geometric trees (Dengel and Barth 1988), which are used to segment document images and simultaneously associate some layout components with the logical structure. In WISDOM++ this class-specific knowledge is solely required in the document classification and understanding steps and it is automatically learned from examples of documents, as explained in the next section.…”

Section: Knowledge-based Detection Of the Layout Structurementioning

confidence: 99%

“…Typically such rules are handcoded for particular classes of documents (Nagy et al 1992), requiring fine-tuning and great human effort. In WISDOM++ rules are automatically generated by means of machine learning algorithms that induce them from a set of training examples, for which the final user has already defined the correct class and has specified the layout components with a logical meaning (logical components) (Esposito et al 1999).…”

Section: Document Classification and Understandingmentioning

confidence: 99%