Creating reusable well-structured PDF as a sequence of component object graphic (COG) elements


Portable Document Format (PDF) is a page-oriented, graphically rich format based on PostScript semantics and it is also the format interpreted by the Adobe Acrobat viewers. Although each of the pages in a PDF document is an independent graphic object this property does not necessarily extend to the components (headings, diagrams, paragraphs etc.) within a page. This, in turn, makes the manipulation and extraction of graphic objects on a PDF page into a very difficult and uncertain process.The work described here investigates the advantages of a model wherein PDF pages are created from assemblies of COGs (Component Object Graphics) each with a clearly defined graphic state. The relative positioning of COGs on a PDF page is determined by appropriate 'spacer' objects and a traversal of the tree of COGs and spacers determines the rendering order. The enhanced revisability of PDF documents within the COG model is discussed, together with the application of the model in those contexts which require easy revisability coupled with the ability to maintain and amend PDF document structure.

DOI: 10.1145/958220.958233

Extracted Key Phrases

2 Figures and Tables

Cite this paper

@inproceedings{Bagley2003CreatingRW, title={Creating reusable well-structured PDF as a sequence of component object graphic (COG) elements}, author={Steven R. Bagley and David F. Brailsford and Matthew R. B. Hardy}, booktitle={ACM Symposium on Document Engineering}, year={2003} }