12.01.2015 Views

Download - Academy Publisher

Download - Academy Publisher

Download - Academy Publisher

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

language, people can design complex graphics<br />

application procedure.<br />

Head<br />

part<br />

Preface<br />

Default<br />

part<br />

Process<br />

Definiti<br />

on part<br />

Document<br />

Documen<br />

ts<br />

Setting<br />

Preface<br />

Page<br />

part<br />

Figure 1. composition of PS text structure<br />

Rear<br />

part<br />

III. OBJECTS AND STRUCTURES OF PDF<br />

PDF (Portable Document Format) is a structurized<br />

document format. The initial objective to design PDF file<br />

format by Adobe is to adapt to digital pre-press workflow<br />

and needs of the development of electronic publishing<br />

market. Although PDF file format relies on imaging<br />

model of PostScript language, and describing the text and<br />

image have nothing to do with the equipment and the<br />

resolution.<br />

PDF document is transitivity, support the interaction,<br />

support the sound and the flash, and support random<br />

access of the contents pages, which also improve the<br />

various operating speed for pages, support the alter mode<br />

of supplement unceasingly in order to reduce the revises<br />

and raise the efficiency, and support kinds of encoding<br />

methods for a more compact structure of the file. In<br />

addition, it has some other functions like fontindependent,<br />

platform-independent and security control.<br />

These characteristics of PDF mean its dual nature, that is,<br />

it continues to support prepress workflow and makes it<br />

more perfect, meanwhile, PDF also provides a basis of<br />

the spread of digital files for the field of electronic<br />

publishing and thus it becomes the major method of the<br />

cross-media publishing file.<br />

A. The major difference between PDF and PostScript<br />

PostScript language is not only a page description<br />

language, but also a common programming language,<br />

which includes the process, the variables and the control<br />

structure. Therefore, a program written by PostScript<br />

language must be translated into page description<br />

information for being accepted by the output device.<br />

Compared with these, PDF does not have the<br />

characteristic of the programming language, which causes<br />

lots of differences between PDF and PostScript.<br />

a) PDF is a document format, but not a programming<br />

language, so it adopts the method, which has nothing to<br />

do with the application software of producing documents,<br />

hardware and operating system, to describe the document.<br />

Therefore, a common programming structure is not exists<br />

in the PDF files, the result of which is the imaging is<br />

much simpler and describing pages is much easier too.<br />

b) Although PostScript describes the objects of pages<br />

having nothing to do with the equipment, it is related<br />

with the pages. Each page in the PDF file is unrelated to<br />

other pages, which eliminates the randomicity of<br />

PostScript language. Hence, PDF file provides the basic<br />

of the available consistency and the divinable result for<br />

the printing and producing system.<br />

c) PDF document in principle can not be directly<br />

downloaded to the PostScript device for outputting. If in<br />

the hope of outputting PDF document in the PostScript<br />

device, these steps below could be carried out.<br />

i. Insert the PostScript language process definition sets<br />

Prosets of the instruction characters which can realize the<br />

PDF page description.<br />

ii. Extract each page’s content and the description of<br />

each page is necessary, because the description part of the<br />

traditional PostScript language procedure uses the<br />

appropriative process.<br />

iii. Decode the compressed text, graphics and image<br />

data, without the data encoded by Flate filter, and not<br />

decode the compressed data in the document when the<br />

device is PostScript Level 2, because it can accept the<br />

compressed data in the PS.<br />

iv. Insert the resource, such as the font, into PostScript<br />

files, replace the font definition and insert according to<br />

the necessary. The basic is the font specifications of the<br />

PDF files.<br />

v. Place the information by the correct order, the result<br />

should be, in a usually sense, a PostScript program file,<br />

containing all the visible part of the document, but<br />

hyperlinks, comments, bookmarks and other PDF units<br />

will no longer be included in the documents.<br />

vi. Sent PostScript language documents to the printer.<br />

B. The object of PDF and the structure of the document<br />

PDF supports following seven kinds of basic objects.<br />

Object encoding and produce encoding are used as the<br />

unique identifier of the indirect object, and the indirect<br />

object always keeps object encoding and produce<br />

encoding of the beginning during its existence. Even if<br />

the indirect object was revised, these two encoding will<br />

still remain. Each indirect object has its unique object<br />

encoding, and the encoding of the indirect object usually<br />

starts continuously from one, but it is necessary. When<br />

the object is deleted from the document, all the produce<br />

code will become zero.<br />

The structure of PDF (i.e. physical structure) includes<br />

four parts: document head, document body, crossreference<br />

table and document rear. The method of the<br />

PDF file organization can be represented as the following<br />

structure:<br />

::= < document head ><br />

< document body ><br />

< cross-reference table ><br />

<br />

Document head specifies the PDF standard version<br />

mumble which the document complies with, and it<br />

appears in the first line of the PDF document. Document<br />

body is composed of a series of PDF indirect object<br />

(IndirectObject). These objects are the fundamental types<br />

of objects, which are used for the different components of<br />

documents, such as fonts, pages and images. Crossreference<br />

table is the address index table of the indirect<br />

object which is established for the random access of the<br />

indirect object. Document rear specifies the address of the<br />

cross-reference table, namely it not only points out the<br />

Catalog of the document body, but also holds the<br />

encryption and other security information.<br />

32

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!