06.01.2013 Views

The use of SGML and XML at the Publications Office

The use of SGML and XML at the Publications Office

The use of SGML and XML at the Publications Office

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong><br />

<strong>at</strong> <strong>the</strong> Public<strong>at</strong>ions <strong>Office</strong><br />

Dr. Holger Bagola<br />

Dir A – Cell “Form<strong>at</strong>s”<br />

Holger.Bagola@cec.eu.int


Table <strong>of</strong> contents<br />

Historical overview<br />

Formex<br />

O<strong>the</strong>r areas <strong>of</strong> <strong>XML</strong> usage<br />

Conclusion<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

2


Table <strong>of</strong> contents<br />

Historical overview<br />

Formex<br />

O<strong>the</strong>r areas <strong>of</strong> <strong>XML</strong> usage<br />

Conclusion<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

3


Historical overview<br />

Among <strong>the</strong> missions <strong>of</strong> <strong>the</strong> Public<strong>at</strong>ions<br />

<strong>Office</strong>:<br />

– Archiving <strong>of</strong> legisl<strong>at</strong>ive public<strong>at</strong>ions<br />

Choice <strong>of</strong> <strong>SGML</strong><br />

– Independent from any pl<strong>at</strong>form<br />

– Distinction between structure <strong>and</strong> present<strong>at</strong>ion<br />

– Support for synoptic document management in<br />

a multilingual environment<br />

Migr<strong>at</strong>ion to <strong>XML</strong><br />

– Basic advantage: availability <strong>of</strong> tools<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

4


Table <strong>of</strong> contents<br />

Historical overview<br />

Formex<br />

O<strong>the</strong>r areas <strong>of</strong> <strong>XML</strong> usage<br />

Conclusion<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

5


<strong>SGML</strong> versions<br />

Formex (1)<br />

– Version 1: adopted in 1984<br />

First deliveries in 1985<br />

Characteristics:<br />

– Mixture <strong>of</strong> <strong>SGML</strong> <strong>and</strong> CCF (Common<br />

Communic<strong>at</strong>ion Form<strong>at</strong>) for meta-d<strong>at</strong>a,<br />

– Markup not very detailed<br />

– Character encoding based on ISO 2022<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

6


Formex (2)<br />

– Version 2: adopted in 1989, revised in<br />

1992<br />

First deliveries in 1989<br />

Characteristics:<br />

– Mixture <strong>of</strong> <strong>SGML</strong> <strong>and</strong> CCF (Common<br />

Communic<strong>at</strong>ion Form<strong>at</strong>) for meta-d<strong>at</strong>a,<br />

– Introduction <strong>of</strong> a logical table model<br />

– Character encoding based on ISO 2022<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

7


Formex (3)<br />

– Version 3: adopted in 1999<br />

Beginning <strong>of</strong> <strong>the</strong> specific<strong>at</strong>ions 1994<br />

First deliveries in 1999<br />

Characteristics:<br />

– Markup <strong>of</strong> semantic role <strong>of</strong> a document<br />

component<br />

– Definition <strong>of</strong> text entities for 11 languages<br />

– Character encoding based on ISO 2022 (after<br />

discussion <strong>of</strong> moving to Unicode UTF-8)<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

8


<strong>XML</strong> version<br />

Formex (4)<br />

– Version 4: adopted in 2004<br />

First deliveries in 2004<br />

Characteristics:<br />

–<strong>XML</strong>,<br />

– Character encoding based on Unicode (UTF-8)<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

9


Basic principles<br />

Formex (5)<br />

– <strong>XML</strong> Schema instead <strong>of</strong> DTD<br />

– One single schema<br />

– Number <strong>of</strong> root elements 12 instead <strong>of</strong><br />

30<br />

– Number <strong>of</strong> elements about 350 instead<br />

<strong>of</strong> 1200<br />

– Distinction between semantic <strong>and</strong><br />

physical markup<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

10


Formex (6)<br />

ARTICLE (TI.ARTICLE, (PARAG+ | ALINEA+))<br />

TI.ARTICLE (#PCDATA)<br />

PARAG (NO.PARAG, ALINEA+)<br />

NO.PARAG (#PCDATA)<br />

ALINEA<br />

. . .<br />

((#PCDATA | NOTE | HT| FT)* |<br />

(P | LIST | TABLE)+)<br />

Blue: semantic markup<br />

Red: physical markup<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

11


Table model<br />

Formex (7)<br />

– Analysis <strong>of</strong> CALS, HTML, Formex v. 3<br />

– Choice:<br />

Model close to HTML (top-down approach,<br />

nested tables)<br />

Maintenance <strong>of</strong> semantic inform<strong>at</strong>ion such<br />

as in Formex v. 3<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

12


Formex (8)<br />

Footnotes<br />

– Distinction between notes in text <strong>and</strong><br />

tables for readability <strong>and</strong> production<br />

simplicity<br />

– Insertion <strong>of</strong> text notes into <strong>the</strong><br />

surrounding text<br />

– ID/IDREF to signal identical footnotes<br />

– Numbering is an object <strong>of</strong> present<strong>at</strong>ion<br />

– Table notes assembled <strong>at</strong> <strong>the</strong> top <strong>of</strong> <strong>the</strong><br />

table<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

13


Quot<strong>at</strong>ions<br />

Formex (9)<br />

– Structured quot<strong>at</strong>ions vs. ‘#PCDATA’<br />

quot<strong>at</strong>ions<br />

– Elements signaling start <strong>and</strong> end <strong>of</strong> a<br />

quot<strong>at</strong>ion (quot<strong>at</strong>ion marks)<br />

– Element with function <strong>of</strong> a container for<br />

structured quot<strong>at</strong>ions.<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

14


Example:<br />

Formex (10)<br />

Article 2<br />

In article 1(2) <strong>of</strong> regul<strong>at</strong>ion (EC) 1234/94 <strong>the</strong> word ‘car’ is replaced by ‘bus’.<br />

Article 6 <strong>of</strong> <strong>the</strong> same regul<strong>at</strong>ion is replaced by <strong>the</strong> following text:<br />

‘Article 6<br />

This is <strong>the</strong> new text <strong>of</strong> article 6.’<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

15


Example:<br />

Formex (11)<br />

<br />

Article 2<br />

In article 1(2) <strong>of</strong> regul<strong>at</strong>ion (EC) 1234/94 <strong>the</strong> car is replaced by bus.<br />

<br />

Article 6 <strong>of</strong> <strong>the</strong> same regul<strong>at</strong>ion is replaced by <strong>the</strong> following<br />

text:<br />

<br />

<br />

Article 6<br />

This is <strong>the</strong> new text <strong>of</strong> article<br />

6.<br />

<br />

<br />

<br />

<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

16


Formex (12)<br />

Splitting large documents<br />

– Fragment<strong>at</strong>ion by definition <strong>of</strong> inclusions<br />

for <strong>the</strong> main document<br />

– Secondary instances referencing <strong>the</strong><br />

inclusions by means <strong>of</strong> <strong>XML</strong> entity<br />

mechanism<br />

– Inclusions may not necessarily be valid<br />

<strong>XML</strong> instances<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

17


main.xml<br />

<br />

<br />

title<br />

<br />

<br />

<br />

<br />

Formex (13)<br />

frag-1.frg<br />

…<br />

…<br />

container.xml<br />

<br />

<br />

&cnt;<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

18


Character set<br />

Formex (14)<br />

– OJ public<strong>at</strong>ions in 20 (21) languages<br />

– Different alphabets<br />

– Intern<strong>at</strong>ional character set definition<br />

Unicode (UTF-8)<br />

– Definition <strong>of</strong> allowed character ranges<br />

– Special font ‘EU-Albertina’<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

19


Meta-d<strong>at</strong>a<br />

Formex (15)<br />

– OJ public<strong>at</strong>ions are composed <strong>of</strong><br />

different levels:<br />

Public<strong>at</strong>ion<br />

Document<br />

‘Contents’<br />

– Meta-d<strong>at</strong>a separ<strong>at</strong>ed according to <strong>the</strong>se<br />

levels<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

20


Public<strong>at</strong>ion<br />

Meta-d<strong>at</strong>a<br />

concerning <strong>the</strong><br />

public<strong>at</strong>ion<br />

Structure <strong>of</strong> <strong>the</strong><br />

public<strong>at</strong>ion with<br />

references to<br />

documents<br />

ProC<strong>at</strong><br />

Formex (16)<br />

Document<br />

Meta-d<strong>at</strong>a for<br />

document<br />

References to<br />

components<br />

Document<br />

Meta-d<strong>at</strong>a for<br />

document<br />

References to<br />

components<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

Contents<br />

main part<br />

001<br />

Contents<br />

Annex 1<br />

001.001<br />

Contents<br />

Annex 2<br />

001.002<br />

Contents<br />

main part<br />

002<br />

21


Formex (17)<br />

• Meta-d<strong>at</strong>a (continued)<br />

– Extraction <strong>of</strong> meta-d<strong>at</strong>a by means <strong>of</strong><br />

autom<strong>at</strong>ic processes (pre-notices)<br />

– Extension <strong>of</strong> pre-notices by juridical analysis<br />

– Availability <strong>of</strong> notices in ProC<strong>at</strong> for o<strong>the</strong>r<br />

productions (Celex) <strong>and</strong> projects<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

22


Formex (18)<br />

Final remark on Formex<br />

specific<strong>at</strong>ions<br />

– Only few complete production chains<br />

from <strong>the</strong> author to <strong>the</strong> printer<br />

– Concentr<strong>at</strong>ion on public<strong>at</strong>ion <strong>of</strong> Official<br />

Journal<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

23


Formex (19)<br />

Valid<strong>at</strong>ion <strong>of</strong> Formex deliveries<br />

– In-depth valid<strong>at</strong>ion necessary<br />

– Autom<strong>at</strong>ic procedures<br />

– Manual procedures<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

24


Formex (20)<br />

Valid<strong>at</strong>ion <strong>of</strong> Formex deliveries<br />

(continued)<br />

– Autom<strong>at</strong>ic procedures<br />

Control <strong>of</strong> filename conventions<br />

Parsing <strong>of</strong> various components<br />

Control <strong>of</strong> completeness<br />

Execution <strong>of</strong> additional valid<strong>at</strong>ion rules<br />

Comparison <strong>of</strong> contents between Formex<br />

<strong>and</strong> PDF<br />

⇒ Report (<strong>XML</strong> instance)<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

25


Formex (21)<br />

Valid<strong>at</strong>ion <strong>of</strong> Formex deliveries<br />

(continued)<br />

– Manual procedures<br />

Verific<strong>at</strong>ion <strong>of</strong> <strong>the</strong> report gener<strong>at</strong>ed by <strong>the</strong><br />

autom<strong>at</strong>ic valid<strong>at</strong>ion procedure<br />

Control <strong>of</strong> <strong>the</strong> <strong>use</strong> <strong>of</strong> Formex specific<strong>at</strong>ions<br />

in all language versions<br />

⇒ Report (<strong>XML</strong> instance) = basis for<br />

archiving or rejection<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

26


Formex (22)<br />

Conversion <strong>of</strong> Formex v. 3 into<br />

Formex v. 4<br />

– Conversion <strong>of</strong> character set (ISO 2020 – UTF8)<br />

– Transform<strong>at</strong>ion <strong>of</strong> <strong>SGML</strong> instances into wellformed<br />

<strong>XML</strong> instances<br />

– Extraction <strong>of</strong> tables <strong>and</strong> conversion into an<br />

intermedi<strong>at</strong>e model<br />

– Gener<strong>at</strong>ion <strong>of</strong> meta-d<strong>at</strong>a levels<br />

– Conversion <strong>of</strong> old elements <strong>and</strong> gener<strong>at</strong>ion <strong>of</strong><br />

new elements<br />

– Valid<strong>at</strong>ion <strong>of</strong> <strong>the</strong> results<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

27


Specific<strong>at</strong>ions:<br />

Formex (23)<br />

http://formex.public<strong>at</strong>ions.eu.int/<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

28


Table <strong>of</strong> contents<br />

Historical overview<br />

Formex<br />

O<strong>the</strong>r areas <strong>of</strong> <strong>XML</strong> usage<br />

Conclusion<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

29


O<strong>the</strong>r areas <strong>of</strong> <strong>XML</strong> usage<br />

(1)<br />

Index <strong>of</strong> OJ public<strong>at</strong>ions<br />

– Biannual issues<br />

– Monthly issues<br />

– Extraction from Celex/ProC<strong>at</strong><br />

– Transform<strong>at</strong>ion into PDF by means <strong>of</strong><br />

XSLT <strong>and</strong> XSL FO (biannual version<br />

only)<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

30


O<strong>the</strong>r areas <strong>of</strong> <strong>XML</strong> usage<br />

(2)<br />

Consolid<strong>at</strong>ion <strong>of</strong> legal documents<br />

– Mainly based on Formex<br />

– Additional administr<strong>at</strong>ive d<strong>at</strong>a in <strong>XML</strong><br />

– Rel<strong>at</strong>ions between historical levels<br />

Description <strong>of</strong> <strong>the</strong> composition <strong>of</strong> a given<br />

historical level<br />

Concordance <strong>of</strong> inform<strong>at</strong>ion on numbering<br />

schemes (articles, …) for each level<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

31


O<strong>the</strong>r areas <strong>of</strong> <strong>XML</strong> usage<br />

(3)<br />

Conversion to RTF<br />

– Comp<strong>at</strong>ibility with o<strong>the</strong>r EU services<br />

– Input in <strong>SGML</strong> or <strong>XML</strong><br />

– Results with LegisWrite templ<strong>at</strong>es<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

32


O<strong>the</strong>r areas <strong>of</strong> <strong>XML</strong> usage<br />

<strong>SGML</strong><br />

instance<br />

(Formex v. 3)<br />

Character<br />

conversion<br />

Transform<strong>at</strong>ion<br />

into wellformed<br />

<strong>XML</strong><br />

(4)<br />

Transform<strong>at</strong>ion<br />

into internal<br />

<strong>XML</strong> form<strong>at</strong><br />

Transform<strong>at</strong>ion<br />

into RTF<br />

(LegisWrite)<br />

Output in<br />

RTF (Legis-<br />

Write)<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

<strong>XML</strong><br />

instance<br />

(Formex v. 4)<br />

33


O<strong>the</strong>r areas <strong>of</strong> <strong>XML</strong> usage<br />

(5)<br />

Production <strong>of</strong> <strong>the</strong> EU budget<br />

– Cre<strong>at</strong>ion <strong>and</strong> maintenance <strong>of</strong> a common<br />

central repository (<strong>XML</strong>)<br />

– Markup <strong>of</strong> modified elements during <strong>the</strong><br />

decision process in working language<br />

– Transl<strong>at</strong>ion only <strong>of</strong> parts modified<br />

– Upd<strong>at</strong>e <strong>of</strong> repository after public<strong>at</strong>ion<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

34


O<strong>the</strong>r areas <strong>of</strong> <strong>XML</strong> usage<br />

Budget<br />

services<br />

Printer<br />

Budget <strong>XML</strong><br />

repository<br />

(6)<br />

Public<strong>at</strong>ions <strong>Office</strong><br />

Formex<br />

archive<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

Transl<strong>at</strong>ion<br />

service<br />

pre-printing<br />

post-printing<br />

35


O<strong>the</strong>r areas <strong>of</strong> <strong>XML</strong> usage<br />

(7)<br />

‘Secondary legisl<strong>at</strong>ion’<br />

– Public<strong>at</strong>ion <strong>of</strong> legisl<strong>at</strong>ion in force in ‘new’<br />

languages<br />

– <strong>XML</strong> production on basis <strong>of</strong> Formex<br />

archive<br />

– Transform<strong>at</strong>ion <strong>of</strong> transl<strong>at</strong>ed input<br />

– Transform<strong>at</strong>ion <strong>of</strong> <strong>SGML</strong> into <strong>XML</strong> <strong>of</strong><br />

Formex instance<br />

– Merging <strong>of</strong> <strong>XML</strong> instances<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

36


O<strong>the</strong>r areas <strong>of</strong> <strong>XML</strong> usage<br />

(8)<br />

Word<br />

document Formex<br />

archive<br />

Conversion<br />

into <strong>XML</strong><br />

Extraction<br />

<strong>of</strong> text<br />

Merging<br />

skeleton &<br />

text<br />

Conversion<br />

into <strong>XML</strong><br />

Extraction<br />

<strong>of</strong> skeleton<br />

Simplify<br />

structure<br />

Public<strong>at</strong>ion<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

Celex<br />

ProC<strong>at</strong><br />

37


O<strong>the</strong>r areas <strong>of</strong> <strong>XML</strong> usage<br />

(9)<br />

European document repository<br />

– TIFF <strong>of</strong> public<strong>at</strong>ions<br />

– PDF <strong>of</strong> public<strong>at</strong>ions<br />

– Formex instances <strong>of</strong> OJ public<strong>at</strong>ions<br />

– Exchange <strong>of</strong> inform<strong>at</strong>ion by <strong>XML</strong><br />

messages<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

38


O<strong>the</strong>r areas <strong>of</strong> <strong>XML</strong> usage<br />

(10)<br />

Public<strong>at</strong>ion <strong>of</strong> calls for tender (OJ-S)<br />

– Input in different (electronic) form<strong>at</strong>s<br />

– Harmoniz<strong>at</strong>ion in <strong>XML</strong><br />

– Upd<strong>at</strong>ing d<strong>at</strong>abase TED<br />

– Production <strong>of</strong> CD-ROM version<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

39


Table <strong>of</strong> contents<br />

Historical overview<br />

Formex<br />

O<strong>the</strong>r areas <strong>of</strong> <strong>XML</strong> usage<br />

Conclusion<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

40


Conclusion<br />

Difficult start with <strong>SGML</strong><br />

Successful <strong>use</strong> <strong>of</strong> <strong>XML</strong> as well as <strong>of</strong><br />

o<strong>the</strong>r st<strong>and</strong>ards such as XSLT/XP<strong>at</strong>h,<br />

XSL FO<br />

Powerful possibilities <strong>of</strong> re-<strong>use</strong> <strong>of</strong><br />

<strong>XML</strong> instances<br />

<strong>The</strong> <strong>use</strong> <strong>of</strong> <strong>SGML</strong> <strong>and</strong> <strong>XML</strong> <strong>at</strong> <strong>the</strong><br />

Public<strong>at</strong>ions <strong>Office</strong><br />

41

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!