Proceedings of the 2003 ACM Symposium on Document Engineering - DocEng '03 2003
DOI: 10.1145/958232.958233
|View full text |Cite
|
Sign up to set email alerts
|

Creating reusable well-structured PDF as a sequence of component object graphic (COG) elements

Abstract: Portable Document Format (PDF) is a page-oriented, graphically rich format based on PostScript semantics and it is also the format interpreted by the Adobe Acrobat viewers. Although each of the pages in a PDF document is an independent graphic object this property does not necessarily extend to the components (headings, diagrams, paragraphs etc.) within a page. This, in turn, makes the manipulation and extraction of graphic objects on a PDF page into a very difficult and uncertain process.The work described he… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
25
0

Year Published

2004
2004
2011
2011

Publication Types

Select...
5
3

Relationship

4
4

Authors

Journals

citations
Cited by 12 publications
(25 citation statements)
references
References 0 publications
0
25
0
Order By: Relevance
“…Our sample implementation is built around our existing work in PDF and Component-Object Graphics (COGs) [1], but there is no reason why it could not be implemented in any other format capable of tightly specifying page imaging operations. It builds on existing software, principally pdfdit, in conjunction with COG Manipulator, as these tools are already capable of producing modular documents with tightly specified rendering.…”
Section: A Sample Implementationmentioning
confidence: 99%
“…Our sample implementation is built around our existing work in PDF and Component-Object Graphics (COGs) [1], but there is no reason why it could not be implemented in any other format capable of tightly specifying page imaging operations. It builds on existing software, principally pdfdit, in conjunction with COG Manipulator, as these tools are already capable of producing modular documents with tightly specified rendering.…”
Section: A Sample Implementationmentioning
confidence: 99%
“…Document Engineering research relative to document representations to scanned images includes algorithms for image thresholding [33], and relative to PDF includes approaches for the creation of reusable well-structured PDF documents [1]. Arguing that the most important document representation is the Extensible Markup Language (XML), Munson remarks its widespread use on the Web, in business as well as in non-document applications [22].…”
Section: Document Engineering: Reviewing a Few Abstractmentioning
confidence: 99%
“…n PDF PDF is the descendent of Postscript and is oriented toward presentation. Though recent research has experimented with new, more versatile image models (Bagley, Brailsford, and Hardy 2003), these have not yet been incorporated in the commercial format, which is still basically Postscript. PDF is a format of digital dissemination that has replicated paper documents for many years.…”
Section: Disabilities That Affect Reading and Assistive Technologiesmentioning
confidence: 99%