08.01.2014 Views

cen workshop agreement cwa 15778 - Higher Education Academy

cen workshop agreement cwa 15778 - Higher Education Academy

cen workshop agreement cwa 15778 - Higher Education Academy

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

CEN<br />

WORKSHOP<br />

CWA <strong>15778</strong><br />

February 2008<br />

AGREEMENT<br />

ICS 35.240.30<br />

English version<br />

Document Processing for Accessibility<br />

This CEN Workshop Agreement has been drafted and approved by a Workshop of representatives of interested parties, the constitution of<br />

which is indicated in the foreword of this Workshop Agreement.<br />

The formal process followed by the Workshop in the development of this Workshop Agreement has been endorsed by the National<br />

Members of CEN but neither the National Members of CEN nor the CEN Management Centre can be held accountable for the technical<br />

content of this CEN Workshop Agreement or possible conflicts with standards or legislation.<br />

This CEN Workshop Agreement can in no way be held as being an official standard developed by CEN and its Members.<br />

This CEN Workshop Agreement is publicly available as a reference document from the CEN Members National Standard Bodies.<br />

CEN members are the national standards bodies of Austria, Belgium, Bulgaria, Cyprus, Czech Republic, Denmark, Estonia, Finland,<br />

France, Germany, Greece, Hungary, Iceland, Ireland, Italy, Latvia, Lithuania, Luxembourg, Malta, Netherlands, Norway, Poland, Portugal,<br />

Romania, Slovakia, Slovenia, Spain, Sweden, Switzerland and United Kingdom.<br />

EUROPEAN COMMITTEE FOR STANDARDIZATION<br />

COMITÉ EUROPÉEN DE NORMALISATION<br />

EUROPÄISCHES KOMITEE FÜR NORMUNG<br />

Management Centre: rue de Stassart, 36<br />

B-1050 Brussels<br />

© 2008 CEN All rights of exploitation in any form and by any means reserved worldwide for CEN national Members.<br />

Ref. No.:CWA <strong>15778</strong>:2008 D/E/F


CWA <strong>15778</strong>:2008 (E)<br />

Contents<br />

Page<br />

Foreword .............................................................................................................................................................4<br />

1 Introduction............................................................................................................................................5<br />

1.1 Scope ......................................................................................................................................................5<br />

1.2 Purpose ..................................................................................................................................................5<br />

1.3 Formal liaisons ......................................................................................................................................6<br />

1.4 Target audience .....................................................................................................................................6<br />

1.5 Opening new markets ...........................................................................................................................7<br />

2 Standards for document processing for accessibility ......................................................................8<br />

2.1 Problem Statement................................................................................................................................8<br />

2.2 Standards and structures .....................................................................................................................9<br />

2.3 Current situation..................................................................................................................................10<br />

2.4 Current state of the art........................................................................................................................11<br />

2.5 Current problems.................................................................................................................................14<br />

2.6 Influencing factors in Document Processing for Accessibility ......................................................15<br />

3 Formats for document processing ....................................................................................................18<br />

3.1 Printed paper........................................................................................................................................18<br />

3.2 Printed Braille ......................................................................................................................................19<br />

3.3 Audio.....................................................................................................................................................20<br />

3.4 ASCII Text.............................................................................................................................................21<br />

3.5 HTML documents.................................................................................................................................21<br />

3.6 XML .......................................................................................................................................................22<br />

3.7 Multi-type composite formats ............................................................................................................24<br />

4 Considerations for structuring documents ......................................................................................27<br />

4.1 Define and use document style guidelines.......................................................................................27<br />

4.2 Define and use structure guidelines..................................................................................................27<br />

4.3 Edit / add structure where needed.....................................................................................................28<br />

4.4 Edit DRM settings................................................................................................................................28<br />

4.5 Adaptation ............................................................................................................................................28<br />

5 Conversion processes ........................................................................................................................29<br />

5.1 Convert Multimedia Material to structured Multimedia Material.....................................................29<br />

5.2 Convert structured Multimedia Material to XML...............................................................................30<br />

5.3 Convert Multimedia Material to XML..................................................................................................30<br />

5.4 Convert traditional print to XML.........................................................................................................30<br />

5.5 Convert DTP to XML............................................................................................................................31<br />

5.6 Convert XML to Print...........................................................................................................................31<br />

5.7 Convert XML to Braille ........................................................................................................................31<br />

5.8 Convert XML to Large Print ................................................................................................................32<br />

5.9 Convert XML to HTML .........................................................................................................................32<br />

5.10 Convert XML to structured Multimedia Material...............................................................................32<br />

5.11 Convert DTP to Multimedia Material..................................................................................................33<br />

5.12 Convert Audio to structured Audio ...................................................................................................33<br />

5.13 Convert XML to XML ...........................................................................................................................33<br />

6 S<strong>cen</strong>arios introducing accessibility within publishing workflows.................................................35<br />

6.1 S<strong>cen</strong>ario 1 - Delivering XML files.......................................................................................................36<br />

6.2 S<strong>cen</strong>ario 2 - Accessibility enhancement in general.........................................................................38<br />

6.3 S<strong>cen</strong>ario 3 - Increasing web accessibility.........................................................................................40<br />

6.4 S<strong>cen</strong>ario 4 - Accessibility policy........................................................................................................42<br />

6.5 S<strong>cen</strong>ario 5 - Spoken documents for everyone .................................................................................43<br />

6.6 S<strong>cen</strong>ario 6 - Accessible and protected PDFs ...................................................................................45<br />

2


CWA <strong>15778</strong>:2008 (E)<br />

6.7 S<strong>cen</strong>ario 7 - Working hand in hand ...................................................................................................47<br />

6.8 S<strong>cen</strong>ario 8 - Accessible design..........................................................................................................49<br />

6.9 S<strong>cen</strong>ario 9 - Accessibility on a large scale .......................................................................................51<br />

6.10 S<strong>cen</strong>ario 10 - What authors can do....................................................................................................52<br />

6.11 S<strong>cen</strong>ario 11 - Repair and adaptation .................................................................................................54<br />

6.12 Common s<strong>cen</strong>ario requirements .......................................................................................................56<br />

6.13 Specific s<strong>cen</strong>ario requirements .........................................................................................................56<br />

7 Application-oriented s<strong>cen</strong>ario implementation ................................................................................56<br />

7.1 Harry Potter and the RNIB ..................................................................................................................56<br />

7.2 Magazine and Newspaper distribution in the Netherlands .............................................................57<br />

7.3 Time Warner and Dolphin Audio Publishing ....................................................................................58<br />

7.4 <strong>Education</strong>al publishing in Austria .....................................................................................................59<br />

7.5 Best practice for distributing accessible content............................................................................61<br />

8 Identified gaps and areas for further research.................................................................................66<br />

8.1 Descriptions & Requirements ............................................................................................................66<br />

8.2 Process & Content Modelling.............................................................................................................66<br />

8.3 Introducing and using metadata for accessibility purposes ..........................................................67<br />

8.4 Standards and personalisation of content .......................................................................................68<br />

8.5 Li<strong>cen</strong>sing and technical protection measures .................................................................................69<br />

9 Conclusion and future work ...............................................................................................................70<br />

Appendix A – Relevant standards ..................................................................................................................72<br />

Appendix B – Relevant European organisations ..........................................................................................79<br />

Appendix C – Sustainability: network of interested parties for ongoing support and further<br />

development.........................................................................................................................................82<br />

Appendix D – Abbreviations List ....................................................................................................................88<br />

3


CWA <strong>15778</strong>:2008 (E)<br />

Foreword<br />

The decision for this CEN Workshop Agreement (CWA) was taken by the DPA Workshop at its kick-off meeting on 13<br />

May 2005.<br />

This CWA provides a first elaboration on how the accessibility of publishing content can be enhanced by altering existing<br />

publishing workflows and introducing accessibility considerations where appropriate. For reaching this goal, in each step<br />

where accessibility is introduced, relevant formats and conversions are detailed out, as well new workflow items<br />

described.<br />

The document has been developed through the collaboration of a number of contributing partners such as publishers,<br />

blind organizations and universities. The names of the individuals and their affiliations that have expressed support for<br />

this CWA are available from the CEN/ISSS Secretariat.<br />

The final draft of this CWA was put on CEN’s web site for a 60 days period of external comments between 06 July and 10<br />

September 2007.<br />

The formal process followed by the Workshop in the development of the CEN Workshop Agreement has been endorsed<br />

by the National Members of CEN but neither the National Members of CEN nor the CEN Management Centre can be<br />

held accountable for the technical content of the CEN Workshop Agreement or possible conflict with standards or<br />

legislation. This CEN Workshop Agreement can in no way be held as being an official standard developed by CEN and<br />

its members.<br />

The final review/endorsement round for this CWA was successfully closed on 2007-09-18.The final text of this CWA was<br />

submitted to CEN for publication on 2007-11-30.<br />

This CEN Workshop Agreement is publicly available as a reference document from the National Members of CEN:<br />

AENOR, AFNOR, ASRO, BDS, BSI, CSNI, CYS, DIN, DS, ELOT, EVS, IBN, IPQ, IST, LVS, LST, MSA, MSZT, NEN,<br />

NSAI, ON, PKN, SEE, SIS, SIST, SFS, SN, SNV, SUTN and UNI.<br />

Comments or suggestions from the users of the CEN Workshop Agreement are welcome and should be addressed to<br />

the CEN Management Centre.<br />

4


CWA <strong>15778</strong>:2008 (E)<br />

1 Introduction<br />

1.1 Scope<br />

Given the widespread adoption of ICT within the publishing industries, there is a general interest in the creation and<br />

provision of well-formatted digital documents. For those people who are dependent on accessible information, this<br />

interest is of <strong>cen</strong>tral importance, and it is this convergence of interests that stimulated the creation of this Workshop. The<br />

WS/DPA has examined some of the ways in which this convergence is helping to build consensus and create new<br />

strategies and technologies for the provision of information in formats that are more accessible for everyone.<br />

In the real world, publishers rely on accessibility experts and consider accessible information only at the end of the<br />

content production chain. This requires considerable amount of efforts to make information accessible for everyone and it<br />

is a very hard problem to tackle. This <strong>workshop</strong> introduces accessibility as a design element in the content production<br />

and provides guidelines and best practices how more accessible documents can be produced. Another important issue is<br />

that the user requirements for accessible information are not well defined. In this work, we therefore base the elaboration<br />

on publishing use cases and s<strong>cen</strong>arios that have been derived together with publishers in order to analyse at least partly<br />

the user requirements. Additionally those s<strong>cen</strong>arios provide specific examples of accessible information provision as an<br />

entry point to publishing stakeholders.<br />

Sustaining the provision of useful services and meaningful accessible content can be considered vital to the growth of<br />

the Information Society as a whole. When designing, specifying and building applications and infrastructures to store<br />

accessible content, several apparently unrelated issues arise. How do we describe the knowledge and capabilities we<br />

possess and capture the repository of resources we can use to implement these capabilities? How do we describe the<br />

questions and problems of end users and content providers? How do we marry both within manageable and consistent<br />

frameworks? How do we re-apply this knowledge and combine these resources with new insights to solve new<br />

problems? How can we accelerate the processes described above and provide solutions to enable accessible<br />

information processing?<br />

The DPA Workshop (CEN Workshop on Document Processing for Accessibility) brought together some of the key players<br />

working in the fields of publishing and accessibility. The topics addressed ranged from generic document and knowledge<br />

structures, through all aspects of accessible document processing, to Digital Rights Management and copyright issues.<br />

Perhaps the most striking aspect was the level of convergence between the needs of accessibility communities and<br />

those of content creators and providers. Indeed, with the introduction of accessibility from scratch, the information needs<br />

of all consumers are better served, particularly as content providers seek innovative solutions for re-aggregating their<br />

content for new marketplaces.<br />

1.2 Purpose<br />

The DPA Workshop as detailed in its business plan has the following objectives:<br />

<br />

<br />

<br />

To bring together all the players in the information provision and e-publishing chain in order to<br />

achieve the critical mass significantly to enhance the provision of accessible information at a<br />

European level.<br />

To provide guidelines needed on integrating accessibility approaches and workflows within the<br />

document management and publishing process rather than as just a specialised additional service.<br />

To raise awareness and stimulate the adoption at local, regional, national and European levels of the<br />

emerging formats and standards for the provision of accessible information and to find ways of<br />

ensuring that technological protection measures do not inadvertently impede legitimate access to<br />

information by people with print impairments<br />

Based on those objectives this document:<br />

<br />

<br />

<br />

describes the outcomes from the DPA Workshop activities<br />

provides an elaboration of relevant standards and their possible use in the publishing sector<br />

examines the different formats required for accessible information provision<br />

5


CWA <strong>15778</strong>:2008 (E)<br />

<br />

<br />

<br />

<br />

provides a systematic overview of relevant conversion processes and related structured information<br />

activities<br />

examines possible s<strong>cen</strong>arios of use within the publishing sector<br />

provides real-life case studies and instances of best practice<br />

identifies areas for further research and systematisation<br />

1.3 Formal liaisons<br />

The Workshop has been initiated and supported by the EUAIN Project. The EUAIN network is a co-ordination action 1 cofunded<br />

by the INFSO DG of the European Commission within the RTD activities of the Thematic Priority Information<br />

Society Technologies of the 6th Framework Programme. The EUAIN project, co-ordinated by DEDICON Amsterdam,<br />

aims to promote e-Inclusion as a core horizontal building block in the establishment of the Information Society by<br />

creating a network to bring together the different actors in the content creation and publishing industries around a<br />

common set of objectives relating to the provision of accessible information. Accessibility for print impaired people can be<br />

an increasingly integrated component of the document management and publishing process and should not be a<br />

specialised or additional service.<br />

The DPA Workshop has established the following liaisons with relevant activities to its work programme. Liaisons<br />

established:<br />

AIIM PDF Access Working Group<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

CEN/ISSS Data Protection & Privacy Workshop<br />

CEN/ISSS Learning Technologies Workshop<br />

CEN/ISSS Dublin Core Metadata Workshop<br />

COST 219 TER<br />

EDeAN European Design for All e-Accessibility Network<br />

ETSI TC/HF ETSI Technical committee Human Factors<br />

ICTSB/DATSCG Design for All and Assistive Technologies Standardisation Co-ordination Group<br />

OASIS TCs concerning ODF and DITA<br />

W3C Web Accessibility Initiative (WAI)<br />

DAISY (NISO z39.86)<br />

ISO/IEC JTC 1 Special Working Group on Accessibility<br />

ISO/IEC JTC 1/SC 29 Coding of audio, picture, multimedia and hypermedia information<br />

ISO/IEC JTC 1/SC 7/WG 2 Software and Systems Documentation<br />

1.4 Target audience<br />

This CWA addresses accessibility in the publishing value chain and examines ways to introduce and enhance<br />

accessibility of publishing content inside publishing workflows. The intended audience includes actors and stakeholders<br />

within the publishing value chain (publishers, authors, content producers and distributors, publishing system developers<br />

and vendors) and the content accessibility area (specialised libraries, accessibility consultants, and accessible system<br />

developers and vendors)<br />

The CWA aims to provide a first elaboration on how the accessibility of publishing content can be enhanced by altering<br />

existing publishing workflows and introducing accessibility considerations where appropriate. For reaching this goal, in<br />

1 Contract number 511497, DG INFSO, EC, FP6. See http://www.euain.org<br />

6


CWA <strong>15778</strong>:2008 (E)<br />

each step where accessibility is introduced, relevant formats and conversions are detailed out, as well new workflow<br />

items described.<br />

The primary target group of this CWA are actors in the publishing value chain. The document is structured in a form that<br />

enables publishers to get quick insights on what they need to do in order to produce accessible content. For that reason<br />

several specific s<strong>cen</strong>arios are provided that, although not exhaustive, can serve as an entry point for publishers in their<br />

accessible content implementations.<br />

1.5 Opening new markets<br />

Accessible information is not a special type of information aimed at a specific group of a certain population. Accessible<br />

information is information that can be accessed by anyone, with or without a disability, aimed at a general market where<br />

anyone interested is a possible customer. Structured information is the first step in the accessible information process. A<br />

document whose internal structure can be defined and its elements isolated and classified, without losing sight of the<br />

overall structure of the information, is a document that can be navigated.<br />

Most adaptive technology allows the user to access a document, and to read it following the "outer" structure of the<br />

original. But if the same information has also an "inner" structure that allows the adaptive device to distinguish between a<br />

phrase and a measure, between a paragraph and a sentence, highlighting particular annotations, then the level of<br />

accessibility (and therefore usability) of the whole document will be greatly enhanced, allowing the user to move through<br />

it in the same way as those without impairments do when looking at a printed document, and following the same integral<br />

logic.<br />

In an ideal world, all documents made available in electronic formats should contain that internal structure that benefits<br />

everyone. Highly-structured documents are becoming more and more popular due to reasons that very seldom pertain to<br />

making it accessible to persons with disabilities. The move to XML related formats and associated standards for<br />

metadata have provided an impetus for far greater document structuring than before. Whatever the reasons behind those<br />

decisions are, the use of highly-structured information is of great benefit to anybody accessing them for any purpose.<br />

In re<strong>cen</strong>t years, the market for accessibility and assistive technologies has started to gain recognition. It is clear that the<br />

integration of accessibility notions into mainstream technologies would provide previously unavailable opportunities in the<br />

provision of accessible multimedia information systems. It would open up modern information services and provide them<br />

to all types and levels of users, in both the software and the hardware domain. Additionally, new consumption and<br />

production devices and environments can be addressed from such platforms and this would provide very useful<br />

information provision opportunities indeed, such as information on mobile devices with additional speech assistance.<br />

It is equally clear that we remain at the very beginning of the move to incorporate accessibility within mainstream content<br />

processing environments.<br />

7


CWA <strong>15778</strong>:2008 (E)<br />

2 Standards for document processing for accessibility<br />

2.1 Problem Statement<br />

Accessible solutions are required for anyone who requires assistance in using the mainstream solution. This<br />

could be because a user is blind, visually impaired, or impaired in some other way, and the term printimpaired<br />

is often used in this context. Accessible solutions range from small assistive applications, (such as<br />

screen magnifiers) to full scale operating systems and screen reading environments. The traditional problem<br />

with accessible solutions is that they are normally implemented as an afterthought or a piggy-back solution.<br />

This results in solutions that are not fully integrated (or not well integrated) with the mainstream solutions.<br />

These independent applications are then at a disadvantage whenever software versions or operating systems<br />

are updated. In order to make this integration process easier, and provide more intuitive designs for the future,<br />

it is essential that “design for all” and accessible design methodologies are widespread. Standard, policy and<br />

legislation also helps ensure that accessible designers have a solid standard to meet to ensure futureproofing<br />

2 .<br />

Notions of “accessibility” are normally equated with the adaptation and conversion of digital content, where this content<br />

can be made available. On a European level, and indeed often on a national level, much of the existing expertise on<br />

creating accessible adaptations of digital content is of a highly distributed nature. Within specialist organisations<br />

supporting print impaired people; or within university research laboratories; or indeed within publishing houses, many<br />

automated tools have been designed and implemented at least partially to execute the necessary adaptation procedures.<br />

However, each automated tool has its own, highly specific, field of application. Furthermore, the knowledge required to<br />

build these very specific tools is equally distributed, so that there is currently very little re-use of either tools or knowledge.<br />

The content provider’s perspective on digitisation is further complicated by security issues. In the modern environment<br />

driven by the internet for content dissemination, security is a vital issue for publishers. DRM is a complex problem for all<br />

content holders. Every publisher’s content, client base and requirements are different, which often results in a<br />

personalised set of requirements for each case. As a result, <strong>agreement</strong>s on accessibility are often negotiated on a caseby-case<br />

basis. Naturally, publishers have to be confident that any digital format is being delivered through secure<br />

gateways to only the people who are intended to receive it.<br />

Accessibility can also be viewed from a wider angle. Being able to see content in whatever modality; perceive its context;<br />

and attach a useful meaning to it requires that the user be able to access this content, its context and relevant software<br />

application in a way that meets that particular user's consumption preferences. These preferences may become<br />

requirements over time - we all get older. Being able to attach useful meanings to content is what lies at the very basis of<br />

the preservation and education of thought. Attaching useful meanings to content underpins the basis of culture,<br />

commerce and civilisation. Being able to access software and the content and the potential for understanding it<br />

unleashes, requires us to be able to gain access to software and not be hindered by huge costs, complexity, lack of<br />

support and additional barriers.<br />

Given the differences between the traditional approach to accessibility and the wider view outlined above, we are in<br />

something of a transitional phase at this time. From the software producer, business community and the Open Source<br />

System community we see a move towards the inclusion of accessibility features into systems, tools and the<br />

programming languages themselves as system wide core functionalities (examples being KDE 3 , GNOME 4 , and Java<br />

Accessibility 5 . From the accessibility community we see a move towards more advanced and abstract descriptions of the<br />

procedures involved in moving from 'common' content towards content that is processed to be granted accessible<br />

certification. A good example of such a move is the Web Content Accessibility Guidelines 1.0 6 and 2.0 7 , which provide<br />

2 See, for example, the work of EIDD-Design For All Europe, http://www.design-for-all.org<br />

3 KDE Accessibility Project, http://accessibility.kde.org/<br />

4 GNOME Accessibility Project, http://developer.gnome.org/projects/gap/GNOME-Accessibility.html<br />

5 JAVA Accessibility, http://java.suncom/j2se/1.4.2/docs/guide/access/<br />

6 http://www.w3.org/TR/WCAG10/<br />

8


CWA <strong>15778</strong>:2008 (E)<br />

detailed guidelines on how to (re)structure and enhance websites and their content to ensure a sufficient level of<br />

accessibility.<br />

The transitional stage described above involves relatively slow change when compared with general exhilarating<br />

technological developments. However, this relatively slow pace also creates an opportunity to take a step back and<br />

observe all the individual processes that touch upon the notion of accessibility. This allows us to explicate similarities and<br />

possible complementarities, a process of convergent gradualism if you like. The opportunity then arises to synchronise<br />

various efforts in the accessibility arena and offer them to end users and business as a ‘package’. Such a package<br />

contains scientific knowledge about accessibility, as well as technological knowledge about how to implement such<br />

notions. This package also contains detailed descriptions of the requirements of the end users, producers and<br />

distributors of content, as well as tools aiming towards market segments that rely on these requirements. Such an<br />

approach that aims to unify 'common' content, system, service and tool provision and the more 'specialised' content,<br />

system, service and tool provision, can be called Accessible Information Processing (AIP).<br />

2.2 Standards and structures<br />

Structured information is the first big step towards high-quality accessible information. A document whose internal<br />

structure can be defined and its elements isolated and classified, without losing sight of the overall structure of the<br />

document, is a document that can be navigated.<br />

As noted above, most adaptive technology allows the user to access a document, and to read it following the "outer"<br />

structure of the original. If that structure is left to a range of visual cues, like bold capital letters for the title of a chapter<br />

and bold italics for the heading of a subchapter, the adaptive device will surely flatten that visual structure, leaving a<br />

document with no structure at all. But if the same document has also an "inner" structure that makes possible for the<br />

adaptive device to distinguish between a paragraph and a footnote, between a chapter and a subchapter, then the level<br />

of accessibility of the whole document will be greatly enhanced, allowing the user to move through it in the same way<br />

those without disabilities do when looking at the printed document, following the same "logic".<br />

In an ideal world, any document made available in electronic format should contain that inside structure that benefits to<br />

everyone. Highly-structured documents are becoming more and more popular due to reasons that very seldom have to<br />

do with making it accessible to persons with disabilities.<br />

Some of the largest publishers are converting their old electronic texts into full XML documents so that it will allow them<br />

to look for certain bits of text that they can re-use in further editions, as well as to help them to avoid double-production of<br />

the same text. Whatever the reasons behind those decisions are, the use of highly-structured information is of great<br />

benefit to anybody accessing them for whatever purpose. And equally important is to structure the following information<br />

standards as this allows for:<br />

Consistency in the description of structural elements<br />

Understanding and predictability of structures<br />

Interaction with other standards<br />

Technical compliance with different devices<br />

Interchangeability of materials<br />

Flexibility and evolution.<br />

Standards are needed for many reasons, but probably the most relevant one is that they help manufacturers to make<br />

their products accessible in a detailed, coherent way. The existence of standards though does not imply that accessibility<br />

will be implemented in the same way or with the same results in all products. The existence of a number of standards for<br />

producing the same product (a document) may occasionally lead to two different levels of accessibility for the same<br />

"accessible" final product. Even within the same standard it sometimes happen that some features are considered<br />

essential while others may be considered expendable – as a result of this, the application of the same standard with<br />

different views about what is needed and what is not to make a document accessible may provide a wide range of<br />

accessibility levels for the same product, making it fully accessible for some users and just slightly accessible for others.<br />

7 http://www.w3.org/WAI/intro/wcag20<br />

9


CWA <strong>15778</strong>:2008 (E)<br />

It is therefore important to decide beforehand the level of accessibility to apply to a certain document according to<br />

different variables, such as:<br />

<br />

<br />

<br />

The depth of the structure that the document allows/needs to make it sufficiently navigable<br />

The level of navigability actually needed by the potential users<br />

The resources available to make a document accessible<br />

It may also happen that different standards are developed for the same purpose and though they deliver the same level<br />

of accessibility they are not compatible. This usually leads to confusion for manufacturers and service providers while it<br />

also "divides" users between the different existing standards (we all remember the Beta vs. VHS example). We can<br />

distinguish between formal (or de jure) standards and de facto standards. The former are those who have been<br />

"formalised" by standards organisations, while the latter are technical solutions that have been adopted informally by<br />

users due to its usefulness and/or reliability. Among these de facto standards, we also have two categories – proprietary<br />

standards (those developed by a commercial company) and open standards (outside vendor’s control, freely developed<br />

and updated by independent programmers).<br />

A list of relevant standards for accessible information processing has been collated and is included in Appendix A. It<br />

quickly becomes clear that no single standard or set of standards can help fully to implement accessible information<br />

processing within mainstream workflows. We rely on the existence and the promotion of accessibility standards to prove<br />

that accessibility can be built from the first stage of production: that Design For All can be applied to emerging standards<br />

so that all the features needed to grant accessibility to the final product are built into the system right from the beginning,<br />

instead of the traditional approach of adding those features later. There are many different standardisation agencies<br />

around the world, but what is considered to be fully accessible in the United Kingdom may not be seen as accessible in<br />

the same degree in Australia.<br />

2.3 Current situation<br />

A publisher can create a product that can be marketed for everyone, that includes accessibility features right from the<br />

first stages of the creative process and that looks just like its non-accessible version. This does not apply to all types of<br />

information (printed information, for instance), but it certainly applies to electronic publishing, audio publishing, the movie<br />

industry, etc. Printed information relies too heavily on its visual appearance and cannot be distributed "as is" to people<br />

with a print disability, but printed information is usually the last presentational output stage. Many other formats<br />

containing the same information are likely to have been created. Those files are sometimes documented, structured,<br />

catalogued and stored for future use, and those files can be created in a way in which the text and images they contain,<br />

the structure they rely on, may be of use for people with disabilities.<br />

One clear example of how accessible information need not necessarily look different or be marketed differently is the<br />

PDF format as demonstrated by Adobe PDF version 8 onwards. 8 A PDF document can be created and distributed either<br />

tagged or untagged. The former will allow print impaired people to read the document; the latter will be of use only to<br />

those who can see it. They both look the same, and they are equally expensive or inexpensive to create. Likewise, web<br />

pages can be designed in a way that can be accessed by assistive technology or not. They have the same look and, for<br />

a programmer who knows which elements to use, making an accessible webpage is not a much bigger task than making<br />

an inaccessible one.<br />

Therefore, what would make information providers decide to create their documents in a way that can be accessible to<br />

everyone? It could be to open up their market to anybody interested in the product they sell; or to comply with the<br />

legislation 9 that requires that particular information is made public in a format that is accessible to all possible; or simply<br />

to reach a much wider audience.<br />

The publishing value chain may change significantly if publishers begin to ascribe greater value the electronic files they<br />

use for printing. Tagged and structured electronic files not only have an intrinsic value (in the e-publishing sector) but<br />

may also be used to create other formats that people with or without disabilities may use or prefer to paper. For example,<br />

8 Adobe Reader from http://www.adobe.com and associated authoring/processing tools from<br />

http://www.adobe.com/products/acrobat/<br />

9 Sullivan, J., (2007) Study on Copyright Limitations and Exceptions for the Visually Impaired, SCCR/15/7,<br />

WIPO, Geneva.<br />

10


CWA <strong>15778</strong>:2008 (E)<br />

the cost of producing a structured talking book using these files together with good quality synthetic speech is minimal<br />

and would increase their potential market.<br />

The "accessibility features" needed to make an electronic document usable for people with print disabilities are not<br />

"exclusive features". This expression has a two-fold meaning – a fully structured XML document will not exclude people<br />

without disabilities from using it just because it is structured and in a readable format. Besides, we do not need to create<br />

an exclusive document to be used only with special equipment (except when copyright exceptions may apply).<br />

What is sometimes considered "accessible" is simply "usable" – structures, navigation, are features that make a<br />

document simply easier to use and more friendly to ANY user. When we add to a printed book a CD-ROM with the text or<br />

the audio (or both) we are not making it available only to people with disabilities - we are also helping students to carry<br />

the full text of the book anywhere in any portable device, and we are also giving them the opportunity to cite parts of the<br />

book in their papers without typing them. We are allowing buyers of the book to listen to it while driving home after work,<br />

as well as giving a blind person the possibility of hearing its contents.<br />

In some cases, electronic documents offer much more possibilities and a better reading experience than the printed<br />

edition. Books full of references to other books and authors, with words that can be found in a glossary or notes that are<br />

explained at the end of the book, are better used and enjoyed when they are published in an electronic format. When<br />

properly edited, entries can be created to any chapter, subchapter or any other significant part of the structure from the<br />

table of contents; an entry to the words in the glossary every time those words are used in the book; or links to the<br />

bibliography every time another author is mentioned. All this can be accomplished with a simple mouse click or by<br />

pressing the space bar. This enhances the value of certain books, and it is a format that has already been used by some<br />

publishers of reference material. These are not books for people with print disabilities, but they are (in most cases)<br />

perfectly accessible.<br />

2.4 Current state of the art<br />

Document processing and accessibility both have a very wide focus, it has therefore been decided to structure the<br />

elaboration of standards in this <strong>workshop</strong> according to the value chain in publishing, namely content creation, content<br />

production and content distribution. Following this workflow the elaboration will be structured in a comprehensive way as<br />

each step in the publishing process will be examined. A general publishing chain can be defined in simple terms as<br />

“economic activities that facilitate the creation, production, circulation, delivery and consumption of information-based<br />

products” 10<br />

In Figure 1 the above elements are visualised to exhibit the intention of how the elaboration is going to be demonstrated.<br />

The content processing chain requires on one side actors that create content, a framework and infrastructure for content<br />

production, storage and management, and at the end user side, consumption and delivery interfaces. In each of these<br />

building blocks of the content processing chain, several actors interact in order to create, produce and consume content.<br />

Those interactions form processing workflows along publishing channels and products, leading to single or multiple<br />

channel publishing. Dealing with each content processing building block requires modelling content, user and their<br />

interactions in a way that implicit steps and knowledge become explicit. Then, in each of these steps, accessibility<br />

requirements and enhancements need to be analysed in order to introduce accessibility right in the beginning rather than<br />

at the end of the content processing chain. This is exactly the methodology followed in the current CWA: in each of those<br />

workflow steps identified, actors and processes are analysed and accessibility requirements elaborated, links to useful<br />

and related standards made explicit, and possible content accessibility enhancement steps described in more detail. For<br />

feasibility reasons we limit the potential workflows in order to present at least in those cases a thorough analysis.<br />

10 South Africa Department of Arts, Culture, Science and Technology (1998), Final Report: The cultural<br />

industries growth strategy (CIGS): The South African publishing industry report<br />

11


CWA <strong>15778</strong>:2008 (E)<br />

Figure 1 - Generic Publishing Process<br />

2.4.1 Current practices in accessible publishing<br />

2.4.1.1 Document Conversion<br />

Much of the current practice in document conversion <strong>cen</strong>tres on the use of Word Processor files and conversion to<br />

formats such as Adobe PDF. There are various methods for achieving this by use of plug-in type software 11 for a host<br />

application or larger scale document conversion services 12 . There are methods which can improve the accessibility and<br />

meta enhanced aspects of these formats, but in order to be successful there is a responsibility on the author to oversee<br />

and ensure the quality of this process.<br />

Other methods are using desktop publishing applications such as Adobe InDesign 13 or QuarkXpress 14 but the latter<br />

cannot generate accessible PDF documents.<br />

There is a lot of industry wide interest in using XML for publishing. There are many practical advantages to using XML<br />

based languages, these include the dynamic reusability and repurposing of content and the development of XML-<strong>cen</strong>tric<br />

workflows by publishers which promises the potential to create structured content that can be outputted in accessible<br />

formats required by people with disabilities 15 .<br />

In certain countries this is due, in part, to changes in legislation where accessible content is a requirement 16 but also to a<br />

growing recognition on the part of the publishing industry of the changes in the ways end users will access content due to<br />

technological advances 17 and the need for users to be able to access published content in the medium of their choice.<br />

11 http://www.adobe.com/support/downloads/detail.jsp?ftpID=1161<br />

12 http://createpdf.adobe.com/<br />

13 http://www.adobe.com/de/products/indesign/<br />

14 http://www.quark.com/<br />

15 http://www.webaim.org/intro/<br />

16 http://www.ncd.gov/newsroom/publications/2004/inclusion_whitepaper.htm<br />

17 http://www.audible.co.uk/<br />

12


CWA <strong>15778</strong>:2008 (E)<br />

While the referenced examples relate primarily to Web accessibility, the issues encountered by users and subsequent<br />

methodologies used to overcome barriers to access, are relevant to the publishing industry 18.<br />

2.4.1.2 Trusted Intermediaries<br />

Trusted intermediaries establish a personalised relationship between content holders and specialist organisations<br />

whereby publishers and agencies serving blind and partially sighted people work together in a secure and trusting<br />

environment to increase the quantity and timeliness of titles available in an accessible format. Within trusted intermediary<br />

frameworks, DRM is an enabler of controlled access. A number of different security methods are being developed or are<br />

already in use for making content available in this way.<br />

As far as security is concerned, the higher the level the more likely publishers are to allow content to be made available<br />

in accessible digital formats. At present, the security systems used are simple, they use basic encryption technologies<br />

with key exchange mechanisms. The potential for the release of content is considerable – although there are few<br />

recorded instances of such occurring. Once decrypted, content is available to anyone, authorised or not. The ability to<br />

attach content to particular devices, or better to provide access only to authorised users, requires a level of DRM<br />

sophistication that is not yet generally in place in services catering to the needs of visually impaired people 19 . Current<br />

examples of such practices include countries like France 20 and Austria 21.<br />

2.4.1.3 Enterprise content management (ECM)<br />

ECM plays an important role because this will likely be the basis for future workflows in publishing environments as the<br />

tools and processes converge. In mid 2002, 20% of EU media and publishing enterprises with a website had content<br />

management systems in place, though only 3% had supply chain CMS 22 . As illustrated in Figure 1 the content production<br />

process consists of many different steps and there are also different actors involved in those processes. ECM plays an<br />

important role in being the <strong>cen</strong>tral element of these processes since all the data can be captured, managed, stored,<br />

preserved, and delivered within the organisation and thus support the publishing process. It is obvious that integrating<br />

the accessibility requirements is an important issue related to ECM and publishing environments. Accessibility<br />

considerations should be taken into account during the whole publishing process rather than at the end as an add-on<br />

feature and so correspond with the design for all principles 23 .<br />

2.4.1.4 Electronic Publishing<br />

The landscape of electronic publishing is constantly evolving but as technology changes ever faster there is an ever<br />

increasing need for publishers to be able to archive their content, easily retrieve it and export it in a wide variety of<br />

necessary output formats. This is a complex and multi layered process and as with any workflow there are certain points<br />

in the digital publishing chain that require specific knowledge and processing methodologies in order to manage them<br />

18 http://www.afb.org/Section.asp?SectionID=4&TopicID=222&DocumentID=1224<br />

19 http://www.indicare.org/tiki-read_article.php?articleId=169<br />

20 du Bourguet, Guillaume; Guillon, Benoit; Burger, Dominique (2003): Helene: a collaborative server to<br />

create and securely deliver documents for the blind. Proceedings AAATE 2003<br />

21 Miesenberger, K.; Ruemer, R.: Schulbuch Barrierefrei (Accessible School Books) - Co-operation Between<br />

Publishers and Service Providers in Austria. In: Proceedings of the 10th International Conference on<br />

Computers Helping People with Special Needs(ICCHP '06 ), Linz, Springer, pp32-39<br />

22 OECD report on scientific publishing, DSTI/ICCP/IE(2004)11/FINAL<br />

23 Darzentas, J., Miesenberger, K.: Design for All in Information Technology: a Universal Concern (Keynote),<br />

in: Andersen, K., V., Debenham, J., Wagner, R. (eds.): Database and Expert Systems Applications, 16th<br />

Internaitonal Conference, DEXA 2005, Copenhagen, Denmark, August 2005, Proceedings, Springer LCNS<br />

3588, Berlin/Heidelberg 2005, pp. 406 – 420<br />

13


CWA <strong>15778</strong>:2008 (E)<br />

successfully. These include archiving data and legacy document usage 24 , data exchange, intellectual property rights and<br />

Digital Rights Management 25 , and finding the best practice models that work for the widest range of publishers 26 .<br />

Though each publisher may have specific needs that differ in the broadest sense, there are common issues for all. These<br />

include, but are not limited to, the need for logical document structure that not only makes sense to readers but also<br />

makes it possible to retrieve, index and update a document by using metadata to identify the various parts of an<br />

electronic document. Processes such as these are essential in order to modify the document format or extract relevant<br />

parts of a document for insertion in another. Much technical work has therefore to be done to create methodologies and<br />

mechanisms to realise and develop these processes.<br />

2.4.1.5 XML-Publishing<br />

There is a transition underway from the limited tag set and functionality of SGML to XML based tools and procedures and<br />

there are various XML based standards, specifications and initiatives for the printing and publishing industry 27 , one of the<br />

most powerful being DITA 28 .<br />

DITA is an XML-based specification for modular and extensible topic-based information. DITA provides a model for<br />

defining and processing new information types as specialisations of existing types. DITA populates the model with an<br />

extensible hierarchy of standard types. DITA encourages re-use by reference either of topics or of fragments of topics.<br />

DITA topics:<br />

<br />

<br />

<br />

can be assembled in different combinations for many deliverables or output formats;<br />

are optimised for navigation and search;<br />

are well suited for concurrent authoring and content management.<br />

DITA is customisable, which allows for the introduction of specific semantics for specific purposes without increasing the<br />

size of other DTDs, and which allows the inheritance of shared design and behaviour and interchangeability with<br />

unspecialised content 29 .<br />

2.5 Current problems<br />

Although initiatives and projects for incorporating accessibility in publishing are ongoing 30 , the current situation relating to<br />

the publishing process is the following:<br />

<br />

<br />

There is a lack of knowledge of accessibility related standards and formats within the publishing<br />

industry. Experiences in the work with publishers also show, that there are different understandings<br />

of “what is structure” between publishers and the accessible content producing communities.<br />

There is also a lack of knowledge of publishing workflows and standards within specialist accessible<br />

content producing communities. This lack of understanding makes it hard for consulters and experts<br />

to give comprehensive suggestions to publishers.<br />

24 http://www.dlib.org/dlib/january00/01hodge.html<br />

25 http://www.publishers.org/press/pdf/DRMWhitePaper.pdf<br />

26 http://www.tbs-sct.gc.ca/im-gi/references/pub/epub-topic_e.asp<br />

27 http://publishing.xml.org/standards/index.shtml#ice<br />

28 http://dita.xml.org/<br />

29 http://xml.coverpages.org/DITA-OASIS-CFP.html<br />

30 BMSG-533048/0001-V/10/2004: Project funded by the Austrian government, for making educational<br />

material accessible using TEI-XML (creation), produce those in formats like large print Braille, PDF and HTML<br />

(production) and distribute them electronically in a variety of formats and taking into account DRM (Digital<br />

Rights Management)<br />

14


CWA <strong>15778</strong>:2008 (E)<br />

<br />

<br />

<br />

<br />

There is a perception that the provision of digital files in alternative formats will compromise technical<br />

protection measures. In the digital world, where copyright infringements cost companies millions of<br />

Euro each year 31 , publishers are understandably afraid, to lose the control over the usage and<br />

distribution of their content.<br />

There is a perception that the provision of accessible format materials is expensive and timeconsuming.<br />

This perception might also be a result of the fact that there is less knowledge on<br />

Accessible Information Processing outside of the accessible content producing communities.<br />

There is a lack of knowledge of the scale of the accessible format market. Its is hard for publishers to<br />

estimate the monetary cost-benefit ratio, since there are less comprehensive statistics and market<br />

estimations for accessible content products. One should also take into account, that accessible<br />

content is also usable for mainstream users and this means also an increase of the target market.<br />

There is no <strong>agreement</strong> on a single unified accessible format which meets the stakeholders<br />

requirements. The file formats used by publishing companies is very specific to the requirements in<br />

the publishing environment. But when it comes to produce an accessible document of the same<br />

content , the requirements of the formats are entirely different to those in the publishing field. The<br />

problem is to unify all the requirements under one common file format.<br />

2.6 Influencing factors in Document Processing for Accessibility<br />

This section describes the different factors that affect on one hand the publishing process and on the other hand the<br />

results of the process – the Documents and the way they can be accessed.<br />

2.6.1 Structure of digital source file<br />

By speaking of structure we have to distinguish between the implicit structure of a document (structure that is only<br />

visually recognisable) and the explicit structure of a document (semantic structure or structure that is describing the<br />

document on a META-Level). Examples for implicit structures are the visual formatting of a text document through line<br />

breaks etc. Examples for explicit structures are the usage of structure elements to describe the content e.g. in HTML this<br />

would mean to use for a Heading or in a Microsoft Word document this would mean to use the “Heading 1”-Format<br />

option to mark a text as heading. If a document has such an explicit structure this makes it possible to transform the<br />

information without loosing the structure of the document. Structure is also the basic requirement and an fundamental<br />

prerequisite for accessible content.<br />

But since publishing is a very diverse process there are many different formats involved. For example, one author might<br />

hand his script to the publisher in Microsoft Word format while another author might do the same but uses a TeX-based<br />

format. Within one file format, for example Microsoft Word, there is also wide range of possibilities for structuring the<br />

document 32 (e.g. by using Headings, Lists, etc.). In Desktop Publishing Tools (DTP), the structure is usually only made<br />

visible through visual formatting of the content. Other file formats do not or only minimally support content structuring (e.g.<br />

plain text format). It can therefore be said that there is not one typical method of making content accessible but that each<br />

case must be looked at individually.<br />

2.6.2 Different accessibility requirements of target user groups<br />

The different target groups (blind 33 , visually impaired 34 , 35<br />

or cognitive 36 ) have different needs in terms of how they<br />

access the information. These requirements differ even within the groups. For example, one visually impaired person<br />

31<br />

http://www.ipi.org/ipi%5CIPIPublications.nsf/PublicationLookupFullText/E274F77ADF58BD08862571F8001BA<br />

6BF<br />

32 http://www.webaim.org/techniques/word/<br />

33 http://www.webaim.org/articles/visual/blind.php<br />

34 http://www.webaim.org/articles/visual/lowvision.php<br />

35 http://www.webaim.org/articles/visual/colorblind.php<br />

15


CWA <strong>15778</strong>:2008 (E)<br />

needs the material in a large print form, while another visually impaired person with a form of colour blindness might<br />

need the information with a different foreground-background colour setting. Besides that, the requirements for a deaf<br />

person are again totally different from those of the groups mentioned before. This variety of needs for different types of<br />

output creates a challenge for the producer of accessible information.<br />

So it is important, particularly for the content authoring process, to consider the diverse range of user’s needs to ensure<br />

that they are met in the content authoring process. In order to better understand how the publishing industry can meet<br />

the accessibility needs of their users, a consultation took place between the members and the stakeholders of the<br />

<strong>workshop</strong>. Based on the results of this consultation, the next chapter elaborates on possible s<strong>cen</strong>arios for publishers<br />

introducing accessibility in their workflow.<br />

2.6.3 Different types of content<br />

Different modalities (text, image, video and audio) have different characteristics, purposes and uses. Every type of<br />

content raises also different issues in terms of accessibility. Images need a descriptive text to be accessible 37 and video<br />

or audio sequences should have a textual transcript 38 .<br />

When modalities are wrapped in particular aggregations or bundles, as they must be in publishing workflows (such as<br />

PDF containing text and images) we hit a huge increase in complexity and confusion because there are so many<br />

different aggregations, no universal <strong>agreement</strong> about terms used and what they mean in each aggregation. This CWA<br />

begins the process of simplifying this specific source of confusion by establishing a number of common s<strong>cen</strong>arios usable<br />

to support accessibility of different types of modality and aggregation.<br />

2.6.4 Different publishing domains<br />

Publishing processes in each domain (general, educational, scientific, etc.) have their own requirements. Some<br />

segments may be consumer-driven and some more producer-driven. For scientific content, stability and content review<br />

are important (as in scientific journals) and particular domains can present strong notational requirements that need tricky<br />

notational representation issues such as representing mathematics in device-independent formats.<br />

Many general domains may require only content distribution with little consumer feedback – this is useful in that it<br />

provides easy ways for publishers to control the processes and apply resources but it is difficult in that any feedback from<br />

customers about what is needed is less direct.<br />

The educational domain requires more flexibility so as to be able to meet the requirements of specific contexts and in<br />

many institutions adaptation for particular learners needs to be carried out. In many parts education also is rapidly<br />

shifting from a <strong>cen</strong>tral distribution (i.e. lecture or book) mode to one where learner participation and learner authoring<br />

needs to be provided for. This can create a serious challenge for publishing.<br />

Also the content itself varies among the different domains. A novel usually has a very flat structure while a biology book<br />

for 10 th graders has a lot of figures, tables and other elements.<br />

2.6.5 Target audience of content<br />

Cultural and religious issues notwithstanding, it is probably desirable that public content (governmental content, news,<br />

etc.) is accessible to the broadest range of people and audiences possible. Commercial content on the other hand may<br />

target particular niches markets (e.g. Product brochure of a machine manufacturer). Still, there are commercial<br />

advantages in expanding the range of a product to the widest market possible. The hidden demand for a product or<br />

feature or some particular content may be wider than the producer would anticipate (e.g. visually impaired, blind and/or<br />

dyslexic persons can benefit also from the production of structured audio books). A distinct advantage of creating<br />

accessible content is the enhanced usability for many other users who may not have a disability. In general we can say<br />

that accessibility interest and usability in general are two converging issues 39 .<br />

36 http://www.webaim.org/articles/cognitive/<br />

37 http://www.w3.org/TR/WAI-WEBCONTENT/#gl-provide-equivalents<br />

38 http://www.washington.edu/accessit/articles?79<br />

39 http://joshuakaufman.org/articles/pdf/Accessible_and_Usable_Web_Design-Kaufman.pdf<br />

16


CWA <strong>15778</strong>:2008 (E)<br />

2.6.6 Digital Rights Management (DRM), usage restrictions and li<strong>cen</strong>ce <strong>agreement</strong>s<br />

Blind, partially sighted and other print disabled people read electronic material by modifying the way in which it is<br />

presented, without modifying the content. They may do this through magnification, transformation into synthetic audio, or<br />

the use of a temporary, or "refreshable" Braille display. In some instances the software with which to make these<br />

changes is incorporated in mainstream packages, but the most flexible and adaptable solutions are achieved via<br />

dedicated "screen reader" software. The term "assistive technology" is used in this document to refer to this form of<br />

access.<br />

Digital rights management schemes, or the technological protection measures within them, can react to assistive<br />

technology as if it was an illicit operation. Thus, the DRM systems applied to e-Books and e-documents can prevent<br />

access by people who use assistive technology to read the screen or to control the computer 40 .<br />

We see that usage restrictions and li<strong>cen</strong>ce <strong>agreement</strong>s can present a particular challenge for accessibility because<br />

making content accessible for some particular context or user may require adaptation of the content which in turn may<br />

require access to parts of the content or even modification of the content using tools other than those it is provided with.<br />

These <strong>agreement</strong>s may be good for producers and intermediate suppliers (such as educational establishments) but<br />

unless crafted very carefully may not serve the needs of actual consumers well.<br />

40 http://www.indicare.org/tiki-read_article.php?articleId=170<br />

17


CWA <strong>15778</strong>:2008 (E)<br />

3 Formats for document processing<br />

In order to systematically describe the processes needed for accessible information processing a three layer generic<br />

system architecture is used to categorise the formats. The formats used can be divided into three categories: input<br />

formats, interchange formats and output formats. Some of the formats belong to more than one category.<br />

Input formats refer to any format that is used to add information to the start of a process. In some cases this involves<br />

getting analog formats such as paper formats digitised in order to convert those to more accessible formats for computer<br />

use. In other cases, the input layer will be the output from another process and may be a purely machine readable format.<br />

For accessible information processing the following formats are often used for input to processes:<br />

Printed paper<br />

<br />

<br />

<br />

<br />

<br />

Printed Braille<br />

ASCII Text<br />

HTML<br />

XML<br />

Multi-type composite formats<br />

Interchange and storage formats (sometimes referred to as representation formats) are extremely important. They are<br />

the formats in a system upon which the main logic or intelligence acts. For accessible information processing it is<br />

essential that this stage of the system is well designed. As systems and technologies become more advanced, these<br />

formats will likely reduce to purely XML formats of well specified international standardized multi-type composite formats.<br />

For the purpose of this document, the following formats have been identified as interchange formats:<br />

ASCII Text<br />

<br />

<br />

<br />

HTML<br />

XML<br />

Multi-type composite formats<br />

Output formats describe the result returned from a completed process. These can be any of the available formats as<br />

they can deliver a format to an end user (be it an actor or another process).<br />

Printed paper<br />

<br />

<br />

<br />

<br />

<br />

<br />

Printed Braille<br />

Audio<br />

ASCII Text<br />

HTML<br />

XML<br />

Multi-type composite formats<br />

3.1 Printed paper<br />

Printed paper is an essential part to document production. It is the traditional end product of production processes, and<br />

can often be the input stage of a Digitisation process. Printed Paper is still the main format in our society for authors,<br />

magazine publishers etc. to make their content available to the public.<br />

Printed paper is an essential part of document production. It is the traditional end product of the production processes,<br />

and can often be the input stage of a digitisation process. How useful it is in terms of generating alternative versions from<br />

a printed paper material depends largely on the quality of the material.<br />

18


CWA <strong>15778</strong>:2008 (E)<br />

3.1.1 Clear print<br />

Many publishers are recognising the market need for (slightly larger format) books using 12 or 14 point text in medium or<br />

bold.<br />

The resulting text is easier to read for the growing number of people who have difficulty reading smaller sized print. This<br />

is increasingly becoming a viable alternative.<br />

3.1.2 Large Print (16 point or over)<br />

Large Print is a format based on Printed Paper. A Large Print document can be either a scaled copy of the original or, if<br />

the electronic source document exists and is available in a word processor format (Microsoft Word, OpenOffice,<br />

StarOffice,...), it is possible to scale the font size and, if necessary, change the colour of the font and background colour<br />

as well as the font itself to satisfy the reader’s needs and print the document in this special setup.<br />

Many readers need larger print than conventional publications have. The font used should be clear and simple, medium<br />

or bold and with good line spacing. For large print documents, the process of conversion depends on the complexity of<br />

the original document and also on the quality requirements for the output document. For simple text documents such as<br />

novels designed to be read in a linear form, production of large print versions can be quite easy: once the original<br />

document’s content is available in a word processing software, characters are increased to a predetermined font size.<br />

The software usually ensures that the text flows on to new pages depending on the page format chosen. The document's<br />

page numbers may be modified (adapted book's page numbers do not match the original page numbers any more). The<br />

page numbering may not be important for a novel but is likely to be relevant in a non-fiction or reference work. In this<br />

case the original page numbering is often marked in the text. Complex documents require more care in their conversion<br />

to large print. Documents containing cross references need to be thoroughly checked because of the changes made to<br />

page numbers 41 . Table of contents and indexes must be regenerated. Production of large print books may involve some<br />

of the techniques of prepress production normal to publishing.<br />

3.2 Printed Braille<br />

Braille has been in use by blind and visually impaired people since it was invented by Louis Braille in 1821. It exists as a<br />

code based on a series of 6 raised dots (scientific Braille uses 8 dots).<br />

Printed Braille is produced by physically pressing dots into paper so that the points can be read by the fingers of the blind<br />

reader. This is a mechanical process converting the digital file into the analog version. Documents can be produced on a<br />

machine like a typewriter which produces single pages or on a printing machine driven by a computer feeding it a Braille<br />

file converted from a text file. Braille is usually printed on larger pages than conventional books in order to allow enough<br />

characters on a line to make reading efficient. The paper is thicker (200 gram/m2 or more) to retain the dots through<br />

many readings. The result is that documents in Braille are much bigger than the conventional print version. With<br />

precision equipment Braille can be printed at both sides of a paper (Interpoint technique) which halves more or less the<br />

amount of paper needed (but not its volume!). For example, a Braille version of the Bible takes up the same shelf space<br />

as 24 box files. There are methods of reproducing printed Braille using the pages as moulds for thin sheets of special<br />

plastic (thermoplast technique), but the current practice is to keep the digital file and print off copies when they are<br />

required.<br />

It is possible to convert existing Braille back to a digital Braille file by scanning, but this requires expensive machinery<br />

and there are only a couple of institutions regularly involved in this practice. It is being used to recover and archive rare<br />

documents such as music Braille where there is no easy route to a printed original which can be scanned. It is a<br />

therefore a finite process and will be less relevant in the future.<br />

Printed Braille output refers to any hard copy which represents Braille. It is the only and universal medium for blind<br />

people to read books without the use of modern information technologies 42 . Despite the new alternatives which came<br />

with Information and Communication technologies it is still very relevant to Accessible Information Processing. This is<br />

especially so when it comes to complex types of content e.g. math, music. This will be described in the following<br />

paragraphs.<br />

41 Unless they are softcoded.<br />

42 One should, however, been aware that only 10 % of the blind population can read Braille. This is due to the<br />

fact that elderly persons (who form the majority of blind people) cannot distinguish the dots anymore.<br />

19


CWA <strong>15778</strong>:2008 (E)<br />

3.2.1 Braille Music<br />

Braille music is a system of converting music notation to be represented in Braille code so that music can be read by<br />

visually impaired musicians. Almost all standard print music notations can also be written in Braille music notation. Braille<br />

music notation is an independent and well-developed notation system with its own conventions and syntax. This is<br />

constantly being expanded by a small group of transcribers around the world who communicate with each other to<br />

secure <strong>agreement</strong> as new instrumentation and musical figures are transcribed for the first time. It is therefore a genuinely<br />

universal notation standard.<br />

Visually impaired musicians' gain the same benefits by becoming musically literate through learning to read Braille music<br />

as do sighted musicians who learn to read printed music. It is therefore an important format in the AIP.<br />

3.2.2 Mathematics in Braille<br />

The need to convey mathematics linearly - without the use of special typesetting and, often, with a limited character set<br />

as well - is a common one. Linearising mathematical formulas is necessary for Braille. Contrary to music, there is no<br />

universal system in place for the linearization or for the codes used. Furthermore, some systems rely on six-dot Braille,<br />

others on 8 dots.<br />

Sometimes the use of spoken mathematics is promoted. Here again there are no universal rules on how to pronounce<br />

(complex) formulas.<br />

Mathematical Braille is largely country or region dependent. Therefore producers should bear in mind the location of the<br />

users they are producing for. Some of the better known codes are:<br />

The US Nemeth code 43 , 44<br />

The German Marburg code 45<br />

The UK Braille Mathematics Notation 46<br />

W3C's MathML 47<br />

In many circumstances the use of Latex for mathematics has been promoted. Latex is a typesetting system based on a<br />

combination of text and codes (somewhat similar to XML, but exclusively layout based). Latex has its own system for<br />

linearising mathematics which can be very useful for higher education purposes and screen reading but not for printing<br />

on paper as it is extremely verbose.<br />

A complete overview of mathematics handling and conversion between "standards" has been made in the European<br />

Lambda project 48 .<br />

3.3 Audio<br />

Audio files have been an integral part of specialist formats for some time. Originally metallic tape and vinyl were used to<br />

distribute spoken materials to print impaired users. These analogue formats have mostly been replaced by digital formats<br />

such as WAV, MP3 and other MPEG formats.<br />

43 More information on http://www.brailleauthority.org/Math-science.htm<br />

44 A. Nemeth. The Nemeth Braille Code for Mathematics and Science Notation 1972 Revision. American<br />

Printing House for the Blind, 1972.<br />

45 H. Epheser, D. Pograniczna, and K. Britz, Internationale Mathematikschrift für Blinde, Deutsche<br />

Blindenstudienanstalt, Marburg (Lahn) 1992.<br />

46 http://www.bauk.org.uk/docs/bmn.pdf<br />

47 http://www.w3.org/TR/MathML2/<br />

48 http://www.lambdaproject.org<br />

20


CWA <strong>15778</strong>:2008 (E)<br />

In modern workflows and supply chains, these are mostly packaged with some kind of multimedia framework such as<br />

SMIL 49 or MPEG 50 but audio can still be used in some very specific processes.<br />

Audio books are by far the most popular medium for the distribution of alternative format publication. Not only are the<br />

specialist libraries around the world producing far more audio than all the other formats put together but commercial<br />

audio is becoming an increasingly important part of the output of many major publishing houses.<br />

Most of the currently commercially available audio books are read by a professional narrator. Often these are actors but<br />

some authors read their own books which might make them more commercially attractive.<br />

Most of the currently commercially available audio books are distributed in uncompressed audio format (PCM).<br />

Commercial production using expensive narrators and professional studios meant that adding even a rudimentary<br />

structure was only a marginal cost. Most productions have at least chapter identifiers and many have chosen to add<br />

structure to a finer level, but always using the basic "one level tracks structure" of audio CDs.<br />

Existing audio CDs can hold a maximum of 80 minutes per disc 51 which results either in a large number of discs for most<br />

of the large books, or in abridged versions of those same works, which is an unacceptable solution for print disabled<br />

users wanting to access the full content of a certain book.<br />

Mp3 versions of books can be produced from a human narration or automatically through computer generated synthetic<br />

speech. The range of existing synthetic speech engines is very wide. Many languages are available though the most<br />

common languages tend to have the best speech quality because of the level of investment in development which<br />

reflects the potential market. A further advantage is that these files are considerably smaller than uncompressed audio,<br />

which allows producers to fit up to 10-15 times the standard duration of uncompressed audio CDs on one data CD<br />

(depending on the desired audio quality).<br />

3.4 ASCII Text<br />

ASCII refers to an internationally recognised text standard. In terms of Accessible Information Processing it is used to<br />

represent either a text file or codes which represent information in Braille. It is becoming increasingly obsolete as supply<br />

chains move towards XML formats, but ASCII remains the base element of much of the information archived by national<br />

specialist providers.<br />

3.5 HTML documents<br />

HTML is a very common output format. As a mark-up language, structure is given a high importance, so when correctly<br />

formatted, an HTML document can be searched and browsed in a very logical manner, and headings, paragraphs, links,<br />

lists, and so on can be used for full easy navigation. One of the advantages of HTML is that the files can be read on any<br />

web browser in any platform. A key disadvantage of HTML is that it is not possible to edit the content while reading it in a<br />

browser, as we can do when reading documents in text-processors. This may sometimes be necessary and can be<br />

compared to adding notes in a traditional book. 52<br />

It does allow for the user, however, to open the source file of the<br />

document and add information if they know how to do so.<br />

A bigger disadvantage is that the layout of the information is mixed in the same HTML document with the information<br />

itself and its structure, which may be a handicap when accessing the content.<br />

49 SMIL: Synchronized Multimedia Integration Language, http://www.w3.org/AudioVideo/<br />

50 MPEG Moving Picture Experts Group, http://www.chiariglione.org/mpeg/<br />

51 This is due to the initial purpose of audio CDs’: high quality recording of music. On audio CDs no trade off<br />

between quality and length of recording can be made.<br />

52 Editing of HTML based texts on the web can be done through WIKI technology, a system for collaborative<br />

working on one single document, mostly known from the wikipedia encyclopedia pages but applicable to other<br />

documents too (cf. EUAIN wiki below).<br />

21


CWA <strong>15778</strong>:2008 (E)<br />

The Internet is becoming for many the dominant information location and retrieval source. Accessibility of websites and<br />

search engines (based mostly on HTML) is therefore essential to the needs of visually impaired people. Different<br />

guidelines exist on how to structure correctly HTML files, as well as guidelines on how to make them fully accessible. 53<br />

3.6 XML<br />

The Extensible Markup Language (XML) is a W3C-recommended general-purpose markup language that supports a<br />

wide variety of applications. XML is a simplified subset of Standard Generalised Markup Language (SGML). Its primary<br />

purpose is to facilitate the sharing or the re-use of data across different areas of application. Formally defined languages<br />

based on XML allow diverse software to reliably understand information formatted and passed in these languages.<br />

Based on these standardised languages it is also possible to write converters to transform the information into other<br />

formats.<br />

The format is important for accessible information processing as it provides interchange formats for converting<br />

information to accessible information. XML is the name of the general family of several describing languages. Two<br />

implementations of XML (technically called DTDs or Schemas) are the TEI standard and the already mentioned DAISY<br />

standard.<br />

The Extensible Markup Language (XML) is a mark-up language for describing data regardless of its external appearance.<br />

XML is also focussing mainly on the structure of the content. In a different process numerous different transformation<br />

style guidelines can be applied to one XML document to present the information in the most suitable way. In the context<br />

of accessible information, XML has two main advantages over other formats – the first is that the elements of books like<br />

chapters, paragraphs, headings, can be perfectly represented by XML's structural elements; the second advantage is<br />

that, being a layout-agnostic language, different transformation style guidelines may be applied to the same content to fit<br />

different user’s needs and preferences.<br />

This format is therefore of great importance for accessible information processing as it provides interchange formats for<br />

converting information to accessible outputs. 54<br />

3.6.1 XHTML<br />

XHTML is in principle a reformulation of the older HTML 4 specification. The formal specification of XHTML can be read<br />

here http://www.w3.org/TR/xhtml1/, but in the context of accessible materials a number of points could be made to help<br />

understanding the concept. XHTML is an application of XML. This means that even though XHTML has the semantics of<br />

HTML, it has the syntax and the very strict rules of XML. There are both advantages and disadvantages compared to<br />

HTML:<br />

Among the advantages are that there are a number of tools that are specifically made for working with XML, called XMLparsers.<br />

These are built into software that use or read XHTML documents, and they make the processing of the<br />

document both easier and more accurate. Ironically, this creates a disadvantage for the user. Since XHTML documents<br />

must be valid (e.g. conform to the very strict rules of XML) they are difficult to create and edit by hand and as a<br />

consequence most XHTML documents are machine-generated.<br />

Software that requires valid XHTML documents include Digital Talking Book players and some e-text readers. It must be<br />

noted however, that most Internet browsers – such as Internet Explorer 6 and 7 - are very tolerant and can render<br />

XHTML documents even if they are not valid.<br />

3.6.2 TEI XML<br />

TEI is a specific implementation of the XML standard.<br />

53 HTML standard. ISO/IEC 15445:2002 Information technology - Document description and processing<br />

languages - HyperText Markup Language (HTML). HTML 4.01 Specification (HyperText Markup Language),<br />

http://www.w3.org/TR/html401/<br />

54 XML standard Extensible Markup Language (XML) 1.0 (Fourth Edition) 29 September 2006.<br />

http://www.w3.org/TR/REC-xml/<br />

22


CWA <strong>15778</strong>:2008 (E)<br />

The Text Encoding Initiative (TEI) Guidelines are an international and interdisciplinary standard that enables libraries,<br />

museums, publishers, and individual scholars to represent a variety of literary and linguistic texts for online research,<br />

teaching, and preservation. 55<br />

3.6.3 Daisy XML<br />

The term “DAISY” covers a variety of standards and specifications that are maintained and supported by the DAISY<br />

Consortium (www.daisy.org). DAISY is an acronym that means “Digital Accessible Information System”. The term<br />

covers at least 3 different standards and specifications:<br />

DAISY 2.02: A specification that defines the DAISY Digital Talking Book (DTB) format. DAISY 2.02 is by far the most<br />

widespread and popular of the DAISY formats and has gained support from both producers of digital talking books as<br />

well as producers of players and production software. When people talk about “a DAISY book” they are probably<br />

referring to DAISY 2.02.<br />

DAISY 2.02 is based on XHTML and SMIL (Synchronised Multimedia Integration Language – also an application of XML).<br />

A number of different audio formats can be used in a DAISY 2.02 book – the most common one used being MP3. A<br />

DAISY 2.02 book can contain the full text of a printed work as well as pictures and other multimedia content, or it can be<br />

an audio only book.<br />

The heart of a DAISY 2.02 book is the NCC file (Navigation Control Centre) that both presents itself to the user as the<br />

Table of Content and works as the reading software’s main system file. The NCC is always present in a DAISY 2.02 book<br />

even if it is without text.<br />

DAISY 2.02 books can be read using either a hardware player or a computer with software for this purpose. DAISY 2.02<br />

is platform independent. A number of commercially available production tools for producing DAISY 2.02 books are<br />

presented on the DAISY Consortium website. 56<br />

Daisy/NISO Standard 2005 (aka DAISY 3/ Z39.86): This standard covers a complex of sub-specifications and Document<br />

Type Definitions (DTDs) that together constitute a Digital Talking Book. Compared to DAISY 2.02 this standard is very<br />

advanced and can be used to represent any book in text and audio format.<br />

The sub-systems in a DAISY 3 book include:<br />

The manifest, that contains a complete list of all the files that make up the DTB<br />

The Spine, that defines the reading order of the DTB<br />

Tours, that defines alternative reading orders of the DTB<br />

The heart of the DAISY 3 book is the NCX file that functions in much the same manner as the NCC file does in a DAISY<br />

2.02 book. Unlike the NCC, the NCX represents the structure of the book in a true hierarchy.<br />

Many of the features of DAISY 3 are drawn from Open eBook Publication Structure Version 1.2. DAISY 3 is a<br />

fairly new standard and books are not yet widely distributed by DTB producers. Some producers use DAISY 3<br />

as a production and interchange format where the great richness in detail makes it ideal for XML/XSLT<br />

transformations to other formats, mainly DAISY 2.02 for distribution. 57<br />

DTBook: The text component of a Z39.86 book consists of one ore more DTBook files. The DTBook<br />

specification has, however, found widespread use outside the framework of the Digital Talking Book. DTBook<br />

is an element set that represents European/American book tradition in an XML context. The element set<br />

consists of 79 elements, but can be extended with e.g. math, poetry etc. DTBook is specially suited as an<br />

55 Guidelines for Electronic Text Encoding and Interchange (2002). http://www.tei-c.org/P4X/<br />

56The full specification can be seen here: http://www.daisy.org/z3986/specifications/daisy_202.html<br />

57 The standard can be seen here: http://www.niso.org/standards/resources/Z39-86-2005.html<br />

23


CWA <strong>15778</strong>:2008 (E)<br />

interchange format and is widely used as such. Using XML tools like XSLT, a DTBook file can be transformed<br />

into PDF, RTF (Word), XHTML, HTML and virtually any other text format. 58<br />

NIMAS (National Instructional Materials Accessibility Standard): A standard that was developed 2002-2004<br />

by the National File Format Technical Panel. NIMAS is a sub-set of Z39.86.<br />

“NIMAS is a technical standard used by publishers to produce source files (in XML) that may be used to develop<br />

multiple specialised formats (such as Braille or audio books) for students with print disabilities.<br />

The source files are prepared using Extensible Markup Language (XML) to mark up the structure of the original content<br />

and provide a means for presenting the content in a variety of ways and styles. For example, once a NIMAS fileset has<br />

been produced for printed materials, the XML and image source files may be used to create Braille, large print, HTML<br />

versions, DAISY talking books using human voice or text-to-speech, audio files derived from text-to-speech<br />

transformations, and more.<br />

The separation of content from presentation is an important feature of the NIMAS approach. In most cases, a human will<br />

need to enhance the source files to provide additional features needed by diverse learners.<br />

The various specialised formats created from NIMAS file sets may then be used to support a very diverse<br />

group of learners who qualify as students with print disabilities. It is important to note that most elementary<br />

and secondary educational publishers do not own all of the electronic rights to their textbooks and related<br />

core print materials and a copyright exemption allows them to deliver the electronic content of a textbook and<br />

the related core print materials to the NIMAC, a national repository which began operations on 12/3/06, as<br />

long as the publishers possess the print rights. NIMAS applies to instructional materials published on or after<br />

7/19/06. “ 59<br />

3.6.4 MathML<br />

Mathematical Markup Language (MathML) is an application of XML for describing mathematical notation and capturing<br />

both its structure and content. It aims at integrating mathematical formulas within other XML documents. It is a<br />

recommendation of the W3C math working group.<br />

3.7 Multi-type composite formats<br />

Those formats are especially important to the publishing industry, because they are widely used and part of the<br />

publishing process, In general we consider here:<br />

<strong>Education</strong>al multimedia documents<br />

<strong>Education</strong>al publishing may require a greater degree of flexibility than other forms of publishing<br />

because of the direct engagement with the persons using the publication. Typically it is more<br />

interactive (as it must engage the learner’s attention in a more demanding context). <strong>Education</strong> often<br />

involves direct learner participation, authoring, and other processes such as assessment. Content<br />

and processes must match the learner’s particular context and requirements as closely as possible<br />

because the closer the match the better the learning outcome. This is very demanding. Given the<br />

huge variety and individual nature of assistive technology and personal adaptations (such as<br />

cognitive adaptations) it is unrealistic to expect that content producers can tailor content for each<br />

circumstance or even that they can have the accessibility knowledge necessary to do so.<br />

<br />

Scientific, technical and medical documents (STM)<br />

The global market for English-language STM (scientific, technical and medical) journals is about $5<br />

billion. The industry employs 90,000 people globally, of which 40% or 36,000 are employed in the EU.<br />

Another 20– 30,000 full time employees are indirectly supported 60 One of the main typesetting<br />

58 A comprehensive set of guidelines for applying DTBook mark-up is available from the Daisy Consortium.<br />

(http://www.daisy.org/z3986/guidelines/sg-daisy3/structguide.htm). Other sources: Theory behind the DTBook DTD<br />

(http://www.daisy.org/publications/docs/theory_dtbook/theory_dtbook.html)<br />

59 From the NIMAS web site (http://nimas.cast.org/about/nimas/index.html):<br />

60 http://www.stm-assoc.org/storage/Scientific_Publishing_in_Transition_White_Paper.pdf<br />

24


CWA <strong>15778</strong>:2008 (E)<br />

3.7.1 PDF<br />

systems used is TeX, and the main document mark-up language and document preparation system<br />

is LaTex. It is widely used by mathematicians, scientists, philosophers, engineers, and scholars in<br />

academia and the commercial world, and by others as a primary or intermediate format (e.g.<br />

translating DocBook and other XML-based formats to PDF) because of the quality of typesetting<br />

achievable by TeX. It offers programmable desktop publishing features and extensive facilities for<br />

automating most aspects of typesetting and desktop publishing, including numbering and crossreferencing,<br />

tables and figures, page layout and bibliographies 61 .<br />

In the past, Adobe PDF files could be very inaccessible, especially to people using screen readers. When the PDF is<br />

made by attaching an image of a page to another it is still completely inaccessible as screen readers have no text to read.<br />

This began to change with Acrobat 5, when Adobe introduced the ability to tag PDF files for accessibility. Although PDF<br />

tags could not be manipulated as easily as HTML tags, they made the content more accessible to some users with<br />

screen readers. Adobe Reader 7 continues to improve the user's access to PDF files offering the possibility to customise<br />

preferences extensively. Additionally, Adobe has included a DRM (Digital Rights Management) mechanism into Acrobat<br />

Reader, but there are also several other DRM plug-ins that work with Acrobat Reader.<br />

Due to the major characteristic of PDF, the fact that it is rendered the same no matter on which viewer or operating<br />

system it is viewed on, PDF has become the most popular format used by publishers and increasingly among other<br />

content creators. With PDF the difference between an accessible and an inaccessible document depends on proper<br />

usage of the programmes used to create PDF files. Much more education and training in creating accessible PDF is still<br />

needed.<br />

3.7.2 QuarkXpress® file format<br />

Quark®, provides a layout software, called QuarkXpress®, that used for writing, editing, and typography with colour and<br />

pictures to produce rich final outputs for print and Web 62 . It is used by more than three million users worldwide. The<br />

software is used mainly for creative design and page layout. QuarkXpress Version 7, provides a Voluntary Product<br />

Accessibility Template 63 (VPAT) that details the accessibility features of Quark's product in order to help customers to<br />

determine its compliance with Section 508.<br />

With QuarkXPress users can import and export XML Documents. With Quark Digital Media Server content can be stored<br />

in a <strong>cen</strong>tral database. It then can be used in multiple forms according to the principles of multi-channel publishing. Quark<br />

XTensions software, which are plugins, can automate functions and eliminate repetitive steps with palettes, commands,<br />

tools, and menus. Tests with QuarkXPress 6.5 Passport (international Edition) showed that QuarkXPress was not able to<br />

import the TEI-DTD. To tag the text of the book, a new, flat DTD had to be written. With the new DTD the mapping from<br />

layout formats to XML tags was possible. The content then is exported into a XML file. This is the basic version for the<br />

accessibility work.<br />

3.7.3 In Design file format<br />

Adobe® markets InDesign® to produce professional page layouts 64 . Riche and complex documents and outputs for<br />

multiple media. Adobe InDesign CS3 software version, supports accessible cross-media publication, allowing export of<br />

InDesign documents into PDF, XHTML, and XML. Users can add tags and alternative text attributes to InDesign<br />

documents that support the production of accessible content in these exported formats. 65<br />

InDesign from Adobe Inc. is a desktop publishing application (DTP) which can work with XML files. It is possible to import<br />

XML into InDesign and then prepare the document for output e.g. printed book. This feature is an important step toward<br />

61 http://en.wikipedia.org/wiki/LaTeX<br />

62 http://www.quark.com/products/xpress/<br />

63 http://www.quark.com/products/xpress/pdf/VoluntaryProductAccessibilityTemplate.pdf<br />

64 http://www.adobe.com/products/indesign/<br />

65 http://www.adobe.com/accessibility/products/indesign/<br />

25


CWA <strong>15778</strong>:2008 (E)<br />

multi-channel and cross-media publishing. Tests with Adobe InDesign CS2 showed that it is possible to tag the text of the<br />

layout document. Further investigations are done to efficiently map layout to the structure. InDesign supports the<br />

mapping of text-formats to XML-Tags but the structure had to be added afterwards. The mapping feature can be used, if<br />

the text is in a proper layout. Otherwise the user has to mark the specific text area (e.g. one chapter) and then to assert<br />

the XML tag to the text.<br />

3.7.4 Digital Talking Book (DTB) documents<br />

For many years, "talking books" have been made available to print-disabled readers on analog media such as<br />

phonograph records and audiocassettes. These media served their users well in providing human-speech recordings of<br />

a wide array of print material in increasingly robust and cost-effective formats. However, analog media are limited in<br />

several respects when compared to a printed book. Firstly, they are by their nature linear presentations, which leave<br />

much to be desired when reading reference works, textbooks, magazines, and other materials that are often accessed<br />

randomly. In contrast, digital media offer readers the ability to move around in a book or magazine as freely as (and more<br />

efficiently than) a sighted reader flips through a print book. Secondly, analog recordings do not allow users to interact<br />

with the book by placing bookmarks or highlighting material. A DTB offers this capability, storing the bookmarks and<br />

highlights separate from, but associated with, the DTB itself. Thirdly, talking book users have since long complained that<br />

they do not have access to the spelling of the words they hear. As will be explained below, some DTBs will include a file<br />

containing the full text of the work, synchronised with the audio presentation, thereby allowing readers to locate specific<br />

words and hear them spelled. Finally, analog audio offers readers only one version of the document. If, for example, a<br />

book contains footnotes, they are either read where referenced, which burdens the casual reader with unwanted<br />

interruptions, or grouped at a location out of the flow of the text, making them difficult for interested readers to access. A<br />

DTB allows the user to easily skip over or read footnotes. The Digital Talking Book offers the print-disabled user a<br />

significantly enhanced reading experience -- one that is much closer to that of the sighted reader using a print book.<br />

The DTB goes far beyond the limits imposed on analog audio books because it can include not just the audio rendition of<br />

the work, but the full textual content and images as well. Because the textual content file is synchronised with the audio<br />

file, a DTB offers multiple sensory inputs to readers, a great benefit to, for example, learning-disabled readers or people<br />

with dyslexia. Some visually impaired readers may choose to listen to most of the book, but find that inspecting the<br />

images provides information not available in the narrative flow. Others may opt to skip the audio presentation altogether<br />

and instead view the text file via screen-enlarging software. Braille readers may prefer to read some parts or the entire<br />

document via a refreshable Braille display device connected to their DTB player and accessing the textual content file.<br />

Digital Talking Books are not tied to a single distribution medium. CD-ROMs will be used first but DTBs are portable to<br />

any digital distribution medium capable of handling the large files associated with digital audio recordings. Regardless of<br />

how a DTB is distributed, however, it will normally be in the context of a digital rights management system.<br />

One implementation of DTBs is based on the Daisy Standard (DAISY 2.02 / ANSI/NISO Z39.86-2005). Daisy books are<br />

described above in the sections 3.6.3.<br />

This standard for creating digital content in structured multimedia is developed and maintained by the DAISY Consortium.<br />

Using XML text files and MP3 audio files, the DAISY format can create a range of text only, fully synchronised text and<br />

audio and audio-only books that are fully accessible and navigable for blind and visually impaired users as well as people<br />

with other disabilities such as dyslexia. It allows up to 6 levels of structure (chapter, subchapter, paragraph, and so on) as<br />

opposed to the one-level structure of commercial audio CDs, which makes it suitable for complex books like educational<br />

materials.<br />

The DAISY standard has been adopted as the standard to be used by publishers in the United States of America to<br />

comply with the Instructional Materials Accessibility Act (2002). In Europe DAISY is also used by a wide range of<br />

alternative media publishers to create accessible material.<br />

26


CWA <strong>15778</strong>:2008 (E)<br />

4 Considerations for structuring documents<br />

Structured information is the first step in the accessible information process. A document whose internal structure can be<br />

defined and its elements isolated and classified, without losing sight of the overall structure of the information, is a<br />

document that can be navigated.<br />

Most adaptive technology allows the user to access a document, and to read it following the "outer" structure of the<br />

original. But if the same information has also an "inner" structure that allows the adaptive device to distinguish between a<br />

phrase and a measure, between a paragraph and a sentence, highlighting particular annotations, then the level of<br />

accessibility (and therefore usability) of the whole document will be greatly enhanced, allowing the user to move through<br />

it in the same way as those without impairments do when looking at a printed document, and following the same integral<br />

logic. In an ideal world, all documents made available in electronic formats should contain this internal structure that<br />

benefits everyone. Highly-structured documents are becoming more and more popular due to reasons that very seldom<br />

pertain to making them accessible to people with disabilities.<br />

The move to XML related formats and associated standards for metadata have provided an impetus for far greater<br />

document structuring than before. Whatever the reasons behind those decisions are, the use of highly-structured<br />

information is of great benefit to anybody accessing them for any purpose. In re<strong>cen</strong>t years, the market for accessibility<br />

and assistive technologies has started to gain recognition. It is clear that the integration of accessibility notions into<br />

mainstream technologies would provide previously unavailable opportunities in the provision of accessible multimedia<br />

information systems. It would open up modern information services and provide them to all types and levels of users, in<br />

both the software and the hardware domain. Additionally, new consumption and production devices and environments<br />

can be ad- dressed from such platforms and this would provide very useful information provision opportunities indeed,<br />

such as information on mobile devices with additional speech assistance.<br />

Structuring content is essential for the subsequent processing and transformation of it into accessible content.<br />

Document style sheets and guidelines need to be followed though out the content creation process, so to<br />

enable successful conversions into accessible content and eliminate costly post-editing tasks. For that reason,<br />

careful consideration of styles and usage <strong>agreement</strong>s with stakeholders are of special importance. The next<br />

sections present the steps that need to be introduced in publishing workflows to enable accessible content<br />

production. These workflow steps listed below are also part of the s<strong>cen</strong>arios introducing accessibility within publishing<br />

workflows (section 0) and given here for allowing the reader to refer back when reading this section.<br />

4.1 Define and use document style guidelines<br />

This activity basically defines style guidelines that will be used by all actors in the accessible content processing value<br />

chain. Introducing style guidelines requires an <strong>agreement</strong> between all actors in the publishing chain. These style<br />

guidelines can be used to tag specific content and map it into a specific conversion. The consistent use of style<br />

guidelines through the whole publishing chain enables efficient processing and automatic structuring of content and thus<br />

enhances significantly the accessibility of it. Actually it is not necessary for all actors use the same style guidelines but<br />

only if the mappings between the used style guidelines exist (which is not always the case!) and are well known.<br />

There are several style guidelines available, very commonly used are those based on the Chicago Manual of Style 66<br />

4.2 Define and use structure guidelines<br />

In this specific activity guidelines that define structure are agreed between the relevant actors. Structural guidelines are<br />

more abstract than style guidelines. They actually define how structure is tagged in the content. In the case of MS Word,<br />

structural guidelines will need to define how different headings are tagged. By consistently following the structural<br />

guidelines, automatic conversion can be achieved. Structure also deals with tagging of sections that contain graphics,<br />

images, drawings, other rich media, math, music etc. Structural guidelines also enable navigation on the structural<br />

elements and by doing so ease information retrieval and content consumption. A good point to start and examples on<br />

how to handle images or drawings can also be found in the structure guidelines for DAISY at<br />

http://www.daisy.org/z3986/structure/<br />

66 http://www.chicagomanualofstyle.org/home.html<br />

27


CWA <strong>15778</strong>:2008 (E)<br />

4.3 Edit / add structure where needed<br />

This activity deals with editing or adding structural elements, where either those were lost in the conversion process, or<br />

have not been tagged in the beginning because of inconsistent use of style guidelines. If it is a post-processing operation<br />

then it requires a lot of manual effort. Therefore, those tasks should be kept to a minimum.<br />

In order to edit or add structure in this task, guidelines on how to add structure by hand need to be developed and used<br />

consistently as well. A list of relevant guidelines has been compiled by the EUAIN project and can be found in the<br />

literature 67<br />

4.4 Edit DRM settings<br />

Unless very carefully handled, DRM has the potential to disable adaptation of content to make it accessible in another<br />

context. For example, a format may include text that is locked into the format, cannot be copied and is not accessible<br />

with assistive technology such as a screen-reader. The text may be only accessible as a bitmapped image, meaning that<br />

it cannot be simplified or rendered in some other format. Whatever the format and tool the content is produced with, it is<br />

unlikely that the designer will have considered and provided for all the contexts in which it may be used. For best results<br />

the separate modalities within material need to be available for other software and hardware tools that are able to make<br />

the material accessible for unforeseen contexts to use. This usually means leaving the text unlocked 68 . This topic is for<br />

expansion in further work.<br />

4.5 Adaptation<br />

Adaptation for a context typically will involve matching the content to the context and making changes to meet that<br />

context. This is much easier if the content exists in a form that can be taken apart (disaggregated), supplemented with<br />

other content (such as subtitling a video) then put together again. The reason for this is that because adaptations are<br />

often needed for only parts of the material and not for all of it. It may be that a visually-impaired person cannot use the<br />

diagrams in the content, for example, so that it is necessary to include alternatives for the diagrams in the content.<br />

In all cases it will be necessary to examine both the context AND the material to see if they match. If, in the production<br />

chain, this process takes place close to the user it is more likely that the materials will match the user’s needs. In some<br />

cases, such as described in s<strong>cen</strong>ario 11, the user’s requirements will be directly available. In others it will be necessary<br />

to incorporate many alternatives within the content (so that the learner can select the modalities she can use) but still to<br />

allow for the content to be adapted as late as possible to meet unanticipated user needs.<br />

When the user’s requirements are not directly available, as in a pre-sale publication process, it is useful to test the<br />

material against such checkpoints as are provided in sets of guidelines elsewhere in this document. For web content the<br />

Web Accessibility Initiative Web Content Accessibility Guidelines can form a useful base to do this and a number of<br />

automated tools that can assist with this process are available. It is important to remember that doing this is using virtual<br />

or average requirements and will not meet all circumstances but does help.<br />

67 http://wiki.euain.org/doku.php?id=wiki:guidelines_for_accessible_information_processing<br />

68 This topic was studied in the European SEDODEL project<br />

(http://canada.esat.kuleuven.be/docarchwebsite/show.jsp?page=projects&id=SEDODEL) but due to rapid<br />

changes in DRM technology, the work should be continuously updated.<br />

28


CWA <strong>15778</strong>:2008 (E)<br />

5 Conversion processes<br />

This list of conversion processes is not exhaustive but it should cover the most important conversions in the accessible<br />

content processing workflows. The conversions listed below are also part of the s<strong>cen</strong>arios introducing accessibility within<br />

publishing workflows (section 0) and given here for allowing the reader to refer back when reading this section.<br />

5.1 Convert Multimedia Material to structured Multimedia Material<br />

By multimedia material we mean all content that consists of different media types. Multimedia content can be a Microsoft<br />

Word document that consists of text and images but it can also be a video sequence. In most of the following s<strong>cen</strong>arios<br />

the term Multimedia Material is used for PDF files.<br />

Accessible PDF can be seen in some cases as an output format which is passed to the reader. Sometimes it might be an<br />

interchange format from which further transformations into other formats are performed.<br />

If the accessible PDF is intended as output format, creating accessible tagged PDF files will produce PDF files<br />

accessible to standard screen readers which support tagged PDF (like JAWS and Window Eyes).<br />

This circumvents the need for end users to learn how to use Adobe's embedded speech synthesiser.<br />

However, it is not always easy to make PDF files directly accessible to screen readers. Documents with complex layouts<br />

can be extremely difficult, if not impossible, to convert into an accessible PDF file, due to the fact that the content does<br />

not linearise correctly. It can also be very challenging to make documents with extensive charts or with embedded videos<br />

accessible.<br />

Converting PDF into accessible PDF may include the following steps (a detailed description of these steps can be found<br />

on the EUAIN Training Resource Centre 69 ):<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

Apply OCR to Image only PDF<br />

Converting Existing PDF Files into Tagged PDF Files<br />

Change Tag Type<br />

Add Alternate Text to Images<br />

Create New Tag<br />

Delete Tag<br />

Reorder Tags<br />

Reordering Tags Using the Order Tab<br />

Artefacts<br />

Adding Tags to Untagged PDF Files<br />

Add Tags to Documents Feature<br />

Add All Tags Manually<br />

TouchUp Reading Order Feature<br />

Adding Tags Using TouchUp Reading Order<br />

69 http://wiki.euain.org/doku.php?id=wiki:processes:conversions:html_to_xml<br />

29


CWA <strong>15778</strong>:2008 (E)<br />

5.2 Convert structured Multimedia Material to XML<br />

By structured Multimedia Material we mean multimedia content that has information on its structure included. This can be<br />

a fully navigable audio document, a Microsoft Word document that makes correct usage of heading etc or a fully<br />

accessible video document with subtitling. In most of the s<strong>cen</strong>arios we mean accessible PDFs when speaking of<br />

structured Multimedia Material.<br />

When Multimedia Material is structured it makes it possible to transform the Material into various output formats. In case<br />

of PDFs it is, if possible generally more cost effective to start from the original word processing documents. Quite often<br />

the original file used to create the PDF is unavailable. In that case you can create a XML file using Acrobat, but the file<br />

will probably be more complex and will require more post processing and restructuring work to make it accessible.<br />

If you have images, only the alternate description will be saved, but not the image, and there are no tables in the HTML<br />

file, even if the table was an appropriately-tagged data table in the original PDF file. More information can be found on<br />

the EUAIN Training Resource Centre.<br />

5.3 Convert Multimedia Material to XML<br />

PDF is widely recognised as a de facto standard for electronic end user documents. Public services, banks, insurance<br />

companies and others that distribute electronic documents to end users utilise the advantages of a format that is easily<br />

printed by the user and has a fixed unchangeable layout. It is also widely recognised that PDF is unsuitable for anything<br />

else than visual presentation. Information in a PDF-document is deliberately made un-flexible and is not easily retrieved.<br />

In spite of these facts, one of the most common tasks for national providers of accessible information is the conversion of<br />

PDF to an accessible format. At the same time, this particular conversion is both costly and time-consuming.<br />

There are two different approaches to extract information from a PDF-document. Since PDF can be compared to a highquality<br />

TIFF picture – the same picture that is the result of a scanning process – it is possible to use the same OCR<br />

processes with PDF-documents as the processes used with scanned paper-documents – and the results are comparable.<br />

Most of the OCR-software providers on the market are recommending this method. The other approach is to try to extract<br />

the text-information that is present in a PDF-document. This is done by software that reads both the text content and the<br />

visual information in a PDF-document and translates this information to the common attributes of a text document –<br />

paragraphs, headings, etc.<br />

The result of both processes can then be exported into an XML file which is the basis for further accessible information<br />

processing. Both of these methods require extensive use of human resources and are not easily automated. Especially<br />

rendering of visual components such as columns, tables, page-headers, unusual fonts, etc. have proved very difficult to<br />

automate. Considering the large amount of human resources employed in this type of conversion, it could be concluded<br />

that research efforts used on improving these methods would be very well spent. It should be noted that conversions to<br />

accessible formats earlier in the production chain – when the conversion process is still easily automated – would have<br />

similar results.<br />

5.4 Convert traditional print to XML<br />

This activity refers to the transformation of printed-paper into an electronic interchange format. Usually this is completed<br />

through scanning of printed pages.<br />

The result is a set of image files (one file per page unless the multiple page TIFF format is used). These image files can<br />

then be saved or be processed through optical character recognition (OCR) software. This software recognises the text<br />

in the images and transforms the images into a text file. Either by specific software or by hand this text can be converted<br />

into a structured XML file.<br />

This output will refer to structured content which is either being fed into an XML reader which accesses the content and<br />

the structural data which surrounds that content or the XML is being fed into the next stage of an XML production stream<br />

for further conversion.<br />

This process describes the first stage of a digitisation project where an analog format is being taken in and converted into<br />

a format (XML), which allows further conversion, storage or processing.<br />

30


CWA <strong>15778</strong>:2008 (E)<br />

5.5 Convert DTP to XML<br />

This, primarily multimedia, input will refer to a complex package of media files, which are held together by one governing<br />

structure. Well known DTP packages currently are QuarkXpress and Adobe InDesign. These documents also have the<br />

most complex formats, which can be used in accessible information processing, and the ability to process these formats<br />

forms the key to integrating accessibility with mainstream workflows and processes.<br />

In the case of Accessible Output from a Multimedia package, guidelines are often needed for describing images and<br />

other multimedia formats which cannot be directly translated into a suitable format.<br />

XML output will refer to structured content which is either being fed into an XML reader which accesses the content and<br />

the metadata which surrounds that content or the XML is being fed into the next stage of an XML production stream for<br />

further conversion.<br />

This process is likely to be taking some (or all) of the content in a multimedia package and processing it into a more<br />

generic format (XML) for conversion into further formats. More information can be found on the EUAIN Training Resource<br />

Centre 70 .<br />

5.6 Convert XML to Print<br />

XML to printed paper conversion processes (specific to accessible information processing) would refer to any process,<br />

which uses XML data to prepare content for accessible use based on a paper format.<br />

Primarily XML input will refer to part of an XML production stream where XML is used as the core interchange format.<br />

This being the case, it is likely that there will be some pre-processing, as content is rarely created by hand straight into<br />

XML. From XML, content can be converted to almost any format so it is a very good starting point in accessible<br />

information processes.<br />

The transformation itself is usually done automatically by software. It uses specific transformation templates (XSLT) to<br />

convert the XML data into a readable format. The transformation templates contain information on font sizes, colours,<br />

and also layout information.<br />

Printed-paper output refers to any hard copy, which represents the information. It is probably coming out of a printer or a<br />

photocopier. In terms of accessible information processing printed-paper output can be a clear print output without<br />

sophisticated lay outing of the text. This makes the text easier to read.<br />

This conversion process is likely to be one output branch of an XML production process that uses XML as its <strong>cen</strong>tral<br />

archiving and interchange format. This would be the output node for one particular format.<br />

5.7 Convert XML to Braille<br />

A common conversion process in accessible information processing is that of a generic XML format to a specialist format<br />

with specific niche user requirements. As many specialist organisations move towards an XML based production process,<br />

this process will become commonplace.<br />

Printed Braille output refers to any hard copy, which represents Braille. Braille can be created through various means.<br />

These means will probably involve a Braille Embosser or any other Braille printer. One should be aware that Braille<br />

printer files are machine dependent.<br />

The transformation itself is usually done automatically by software. It uses a specific transformation style guideline to<br />

convert the XML data into the format, which is fed into the Braille printer. The style guideline must do the pre-formatting<br />

for Braille output. This includes layout information like adding line-breaks and other Braille print specific information. One<br />

of the trickier transformations is the generation of language dependent contractions.<br />

This conversion process is likely to be one output branch of an XML production process that uses XML as its <strong>cen</strong>tral<br />

archiving and interchange format. This would be the output node for one particular format.<br />

70 http://wiki.euain.org/doku.php?id=wiki:processes:conversions:multimedia_to_xml<br />

31


CWA <strong>15778</strong>:2008 (E)<br />

5.8 Convert XML to Large Print<br />

XML to printed paper conversion processes (specific to accessible information processing) would refer to any process<br />

that uses XML data to prepare content for accessible use based on a paper format. This probably refers to large print<br />

representations.<br />

Primarily XML input will refer to part of an XML production stream where XML is used as the core interchange format.<br />

This being the case, it is likely that there will be some pre-processing, as content is rarely created by hand straight into<br />

XML. From XML content can be converted to almost any format, so it is a very good starting point in accessible<br />

information processes.<br />

The transformation itself is usually done automatically by software. It uses a specific transformation style guideline to<br />

convert the XML data into a readable format. The style guideline contains information on font sizes, colours, and also<br />

layout information. High-precision XML to print transformation generally will be done through the use of XSLT or XSL:FO<br />

style guidelines (cf. also below: XML->PDF transformation).<br />

Printed-paper output refers to any hard copy, which represents the information. It is probably coming out of a printer or a<br />

photocopier. In terms of accessible information processing printed-paper output is likely to be a large print representation.<br />

This is likely to be one output branch of an XML production process that uses XML as its <strong>cen</strong>tral archiving and<br />

interchange format. This would be the output node for one particular format.<br />

5.9 Convert XML to HTML<br />

XML to HTML conversion processes (specific to accessible information processing) would refer to any process, which<br />

uses XML data to prepare content for the use within web environments be it a web site or just for offline reading within a<br />

web browser.<br />

Since both the input format and the output format can contain high amounts of structural data or structured information,<br />

we can assume that these processes are quite modern and that they represent good practice in accessible information<br />

processing.<br />

In essence, XML input will refer to part of an XML production stream where XML is used as the core interchange format.<br />

This being the case, it is likely that there will be some pre-processing, as content is rarely created by hand straight into<br />

XML. XML content can be converted to almost any format (using e.g. XSLT or other processing systems such as Stilo-<br />

Omnimark), so it is a very good starting point for accessible information processes.<br />

An output medium of HTML suggests that the content is being prepared for the usage within web environments. This<br />

means that processing means should be aware of the relevant standards for accessible web content (e.g. W3C, WCAG<br />

1.0, WCAG 2.0).<br />

The HTML will possibly be published on the web or possibly as part of a Content Management System (CMS).<br />

This conversion process is likely to be one output branch of an XML production process that uses XML as its <strong>cen</strong>tral<br />

archiving and interchange format. This would be the output node for one particular format.<br />

5.10 Convert XML to structured Multimedia Material<br />

XML to structured Multimedia Material conversion processes refer to a process where for example XML data is<br />

transformed into the PDF format. PDF is a very common output format, it has the advantages that the content cannot be<br />

changed by the user. It is also a format that is platform independent, which means that a PDF document will look the<br />

same on any computer and is not dependent on any specific reading software.<br />

Though PDF output refers to an electronic document format it can also be used within accessible information processing<br />

for preparing documents for Large Print output. These Large Print PDF can either be read on a computer screen or, if<br />

allowed, be also printed out for use as Printed Paper output.<br />

The transformation itself is usually done automatically by software. It uses a specific transformation style guideline to<br />

convert the XML data into a readable format. This implies that processing means should be aware of the relevant PDF<br />

32


CWA <strong>15778</strong>:2008 (E)<br />

transformation (XSL:FO).The conversion style guideline contains information on font sizes, colours, and also layout<br />

information. XSL:FO processing requires additional software called an FO-processor 71 .<br />

In essence, XML input will refer to part of an XML production stream where XML is used as the core interchange format.<br />

This being the case, it is likely that there will be some pre-processing, as content is rarely created by hand straight into<br />

XML. XML content can be converted to almost any format, so it is a very good starting point for accessible information<br />

processes.<br />

5.11 Convert DTP to Multimedia Material<br />

This primarily multimedia input will refer to a complex package of media files, which are held together by one governing<br />

structure. Well known DTP packages currently are QuarkXpress and Adobe InDesign. These documents also have the<br />

most complex formats, which can be used in accessible information processing, and the ability to process these formats<br />

forms the key to integrating accessibility with mainstream workflows and processes.<br />

In the case of Accessible Output from a Multimedia package, guidelines are often needed for describing images and<br />

other multimedia formats which cannot be directly translated into a suitable format.<br />

Usually the PDF output from DTP software is used for submission to the printing house for print output. New production<br />

techniques and also improvements and new features in the DTP software now make it possible to create a basic PDF<br />

output version that can be used as input to other conversion processes such as the conversion of Multimedia Material to<br />

structured Multimedia Material.<br />

PDF as an output format is a very common one and it has the advantages that the content cannot be changed by the<br />

user. It is also a format that is platform independent, which means that a PDF document will look the same on any<br />

computer and is not dependent on any specific reading software.<br />

5.12 Convert Audio to structured Audio<br />

Audio files have been an integral part of specialist formats for some time. Currently all audio streams are available or<br />

easily convertible to digital formats (WAV, MP3, other MPEG based formats etc.)<br />

The conversion process includes the modification of the audio content and adding structural data to the audio content. In<br />

general, this implies cutting large files into smaller ones or producing a list of timing markers so that audio rendering can<br />

start at any desired point in a larger audio file.<br />

Structured Audio as an output format refers to audio content that is fully navigable (headings, chapters, paragraphs). This<br />

can be reached by using relevant standards or frameworks for handling structured audio content such as ANSI/NISO<br />

Z39.86 (DAISY) and NIMAS/DAISY. These frameworks can provide a combination of different output formats. In the case<br />

of DAISY this can be a combination of textual representation of the content combined with audio. A special case is the<br />

addition of time markers on human-read audio files so that they can be rendered in small pieces of audio. This process is<br />

part of the production chain of hybrid (text + audio) Daisy books.<br />

5.13 Convert XML to XML<br />

Primarily XML input will refer to part of an XML production stream where XML is used as the core interchange format.<br />

This being the case, it is likely that there will be some pre-processing, as content is rarely created by hand straight into<br />

XML. From XML content can be converted to almost any format, so it is a very good starting point in accessible<br />

information processes.<br />

XML output will refer to structured content which is either being fed into an XML reader which accesses the content and<br />

the metadata which surrounds that content or the XML is being fed into the next stage of an XML production stream for<br />

further conversion.<br />

In general this process is the core of many compound production processes. XML is being converted in some way. This<br />

could be an adaptation from one XML format to another (e.g. TEI to DAISY) or a change in presentation of an XML<br />

format, possibly through use of XSLT.<br />

71 An example of this is the Altova Stylevision processor, http://www.altova.com/downloadtrialstylevision.html<br />

33


CWA <strong>15778</strong>:2008 (E)<br />

Due to the fact that XML can also be used as storage format it is likely that different publishers may use their own XML<br />

format. For accessible information processing it might be necessary to transform one XML format into another to be able<br />

to apply the follow up conversions into the final delivery format. This might include a conversion from proprietary XML to<br />

an open standard such as TEI or DAISY.<br />

The transformation itself is usually done automatically by software. It uses a specific transformation style guideline<br />

(XSLT) to convert the XML data into another XML format. To be able to preserve all the structural information during the<br />

transformation process it is necessary for the people involved to be aware of the involved XML grammaticism.<br />

Technically this means that both input and output structuring definitions (DTD or Schemas) must be known.<br />

34


CWA <strong>15778</strong>:2008 (E)<br />

6 S<strong>cen</strong>arios introducing accessibility within publishing workflows<br />

The term "fully accessible" or "full accessibility" is always controversial. Accessibility is a mixture of two main factors –<br />

first, the type and degree of the print disability of the user, and second, the combination of "readability" and "navigability"<br />

of the information offered to the user. Some formats are only readable, but hardly navigable and some are both. Some<br />

users may have problems with colours and others with the size of the fonts used, and some with both.<br />

As far as formats go, a pure ASCII file is perfectly readable with most (if not all) adaptive devices, but it is hard to<br />

navigate as there is usually no structure at all, neither visual nor spatial. However, an XML text (or HTML, XHTML, etc.)<br />

are both readable and navigable, as they allow for different degrees of internal structure. Some text formats (ASCII,<br />

marked-up languages and word processor documents) allow for customisation of the characters shown in the screen<br />

(both in size and appearance) and the background colours. Some others may be readable and navigable but not<br />

customisable (like, for example, a tagged PDF document read with Acrobat Reader – here the size of the whole page<br />

can be changed but it does not allow for changing only parts of the text to the size or appearance that may be needed).<br />

According to this pure images can be put on one side of the scale and a DAISY book on the other side – a pure<br />

untagged PDF document is completely inaccessible for most print disable people (as much as a printed book is), while a<br />

full-audio full-text DAISY book includes all possible accessibility features that might be thought of. In between these two<br />

extremes the possibilities are endless. There are proprietary file formats that can create good accessible documents if<br />

only used properly (PDF, Word, etc.).<br />

But making documents accessible for customers does not mean that there is an obligation to satisfy everyone’s needs.<br />

That is never possible, not even with printed books – some people prefer smaller books, other people would like a<br />

slightly bigger font, while yet others prefer glossy to matte paper. To try to address as many customers as possible<br />

(according to economical, social, cultural and personal differences) publishers put in the market different editions of the<br />

same book – hardback editions, paperback editions, pocket editions… and they make as many translations of the book<br />

as needed to sell as many copies of it in as many countries as possible. All these different editions are considered to be<br />

in the same format – printed paper. But, in fact, they are "customised" versions of the same format, and some people<br />

may even consider them to be "different formats". And they all originate in the same electronic file, slightly or largely<br />

modified to meet the special characteristics of a certain edition. So publishers are already publishing the same book in<br />

different "formats" when needed.<br />

Formats that are only "readable" are only advisable if there is no other possibility of producing an accessible version of a<br />

book. These formats are, for instance, pure ASCII files or a continuous wav or mp3 file for an audio book. The book thus<br />

produced can be read, but the reading experience is far from satisfactory. Sometimes, if the book is not the type of book<br />

that is usually read sequentially, plain readable formats are of very little or no use at all.<br />

On the other side of what an accessible book may be stand DAISY books. They can reproduce the experience of moving<br />

and browsing through a printed book for those who cannot read print. Navigation can be taken as far as the word level<br />

when needed and both text and audio (when they are both present) are perfectly synchronised. DAISY books have many<br />

layers of navigability that go from full-audio and full-text (on the top of the scale) to fully structured text-only or audio-only<br />

books. DAISY books can be produced directly from properly created XML files with the appropriate DTD.<br />

35


CWA <strong>15778</strong>:2008 (E)<br />

6.1 S<strong>cen</strong>ario 1 - Delivering XML files<br />

We are an SME operating largely within one large EU country. All of our document design and structuring is out-sourced<br />

to different design agencies, depending on the type of materials we are publishing. We have been asked to provide XML<br />

source files to our national organisation for the blind in order to create accessible versions of the materials. Our material<br />

is largely educational and includes multimedia materials. How can we comply with this request?<br />

In this s<strong>cen</strong>ario the libraries have to be specific in their wishes. E.g. do they want source documents or XML?<br />

Publisher's structures (if not, structures will be lost when the material is exported)<br />

Guidelines need to be produced instructing actors what to ask for in specific situations.<br />

Actors<br />

<br />

<br />

<br />

National Publisher, <strong>Education</strong>al Publishing, Multimedia Publishing, SME<br />

Library for the blind<br />

Service Provider<br />

Conversions<br />

Multimedia to structured Multimedia [5.1]<br />

Structured Multimedia to XML [5.2]<br />

Multimedia to XML [5.3]<br />

XML to XML [5.13]<br />

S<strong>cen</strong>ario description<br />

There are three basic ways to create accessible information in this s<strong>cen</strong>ario:<br />

Convert the Multimedia files (e.g. PDFs) to XML and add then the structural information to the XML<br />

file and deliver those to the Library for the Blind<br />

Edit the the structure within the Multimedia document and convert it afterwards to XML. Deliver the<br />

XML to the Library for the Blind<br />

Convert a multimedia document into a structured multimedia document and then into XML. This can<br />

then be delivered to the Library for the Blind<br />

36


CWA <strong>15778</strong>:2008 (E)<br />

Figure 2 - S<strong>cen</strong>ario 1 Delivering XML files<br />

37


CWA <strong>15778</strong>:2008 (E)<br />

6.2 S<strong>cen</strong>ario 2 - Accessibility enhancement in general<br />

We are a large publisher operating in several markets across Europe and the US. We already have an enterprise content<br />

management system. To what extent can this be made more accessibility compliant so that we can create accessible<br />

output formats for national accessibility organisations in different countries? In some countries we are required by<br />

legislation to provide these materials, in others it is on a voluntary basis, but in each case we need to know how to do<br />

this.<br />

In this s<strong>cen</strong>ario, the national accessibility organisations need to express their requirements of formats and files that are<br />

needed (e.g. sources files or XML). Guidelines need to be produced instructing actors what to ask for in specific<br />

situations.<br />

Actors<br />

<br />

<br />

International Publisher, General, Multimedia Publishing<br />

Large national specialist providers<br />

Conversions<br />

Multimedia to XML [5.3]<br />

DTP to XML [5.5]<br />

DTP to Multimedia [5.11]<br />

S<strong>cen</strong>ario description<br />

Three possible ways:<br />

The DTP files are converted straight into a multimedia format (e.g. PDF) and then into XML, where<br />

the structure and accessibility information are added<br />

The DTP is directly converted into XML and the structure and accessibility information are added<br />

The structure is adden within the DTP document (e.g. through XML tagging). Then these documents<br />

can be converted into XML again<br />

38


CWA <strong>15778</strong>:2008 (E)<br />

Figure 3 - S<strong>cen</strong>ario 2 Accessibility enhancement in general<br />

39


CWA <strong>15778</strong>:2008 (E)<br />

6.3 S<strong>cen</strong>ario 3 - Increasing web accessibility<br />

We are a major scientific and technical publisher. Most of our material is already available online but we do not really pay<br />

attention to accessibility issues other than on a general level. Our customers tell us this is not enough, and much of our<br />

material remains inaccessible. How can we better distribute our materials over the web to our print impaired users?<br />

Guidelines need to be produced instructing actors what to ask for in specific situations.<br />

Actors<br />

<br />

<br />

International Multimedia STM Publisher<br />

Print impaired end users<br />

Conversions<br />

XML to HTML [5.9]<br />

S<strong>cen</strong>ario description<br />

In this case, the Publisher in question has a large body of content which they would like to make accessible on the web.<br />

This is a case where standards are increasingly important.<br />

There are several relevant standards in this case:<br />

WCAG 1.0 (Web Content Accessibility Guidelines 1.0)<br />

WCAG 2.0 (Web Content Accessibility Guidelines 2.0)<br />

There are several relevant organisations in this case:<br />

CEN/ISSS Workshop on Document Processing for Accessibility (WS/DPA)<br />

W3C (World Wide Web Consortium)<br />

WAI (Web Accessibility Initiative)<br />

In order to make the incorporation of these standards as streamlined as possible, it is important to use a CMS system<br />

suitable for the needs of both the target end users and the producing organisation. For non-XML documents the user<br />

might need access to information or transformation platforms to convert those documents also into HTML.<br />

40


CWA <strong>15778</strong>:2008 (E)<br />

Figure 4 - S<strong>cen</strong>ario 3 Increasing web accessibility<br />

41


CWA <strong>15778</strong>:2008 (E)<br />

6.4 S<strong>cen</strong>ario 4 - Accessibility policy<br />

We are a traditional print publisher but we have absolutely no experience in accessibility. We want to train our staff in<br />

making our materials accessible, what do we do?<br />

Guidelines need to be produced instructing actors what to ask for in specific situations.<br />

Actors<br />

<br />

<br />

International Multimedia STM Publisher<br />

Print impaired end users<br />

Conversions<br />

Traditional print to XML [5.4]<br />

S<strong>cen</strong>ario description<br />

This requires a plan to be put in place concerning an accessibility policy within the organisation. It is important that such<br />

a policy is aware of relevant standards, policies and legislation. These training materials are a good start, but<br />

communication with specialist organisations for the blind and visually impaired could also be useful.<br />

From a technical perspective, the possibility of incorporating accessibility within existing XML processing streams should<br />

be investigated.<br />

And for all new materials, publisher should develop author guidelines for digital publishing workflows to incorporate<br />

accessibility needs within the process.<br />

42


CWA <strong>15778</strong>:2008 (E)<br />

Figure 5 - S<strong>cen</strong>ario 4 Accessibility policy<br />

6.5 S<strong>cen</strong>ario 5 - Spoken documents for everyone<br />

We are a major player in the EU Spoken Book market. We are interested in the accessibility area, as we believe this<br />

could strengthen our hold on the market. How do we produce books that are accessible for everyone, especially people<br />

with dyslexia?<br />

Actors<br />

<br />

<br />

EU audio book publisher<br />

National Specialist provider<br />

Conversions<br />

Audio to structured Audio [5.12]<br />

S<strong>cen</strong>ario description<br />

It is likely that the solution to this s<strong>cen</strong>ario is very simple. With the assistance of the national specialist provider in<br />

question, it should be possibly to convert an audio book (WAV or MP3 based) into an audio book which contains<br />

structure for navigation. This would require the use of a standard such as Daisy.<br />

43


CWA <strong>15778</strong>:2008 (E)<br />

Figure 6 - S<strong>cen</strong>ario 5 Spoken documents for everyone<br />

44


CWA <strong>15778</strong>:2008 (E)<br />

6.6 S<strong>cen</strong>ario 6 - Accessible and protected PDFs<br />

We are a major publishing conglomerate. Much of our content is made available in PDF format exclusively. How do we<br />

make PDFs accessible but still protected from unauthorised use? Is there software I can use?<br />

Actors<br />

<br />

<br />

Major Publishing Conglomerate<br />

National Specialist Provider<br />

Conversions<br />

Multimedia to structured Multimedia [5.1]<br />

S<strong>cen</strong>ario Description<br />

Accessible PDFs are a hot item at the moment and as a result there is a lot of activity on the internet:<br />

Adobe Accessibility Resource Centre 72<br />

Creating Accessible PDF Documents with Adobe Acrobat 73<br />

PDF Universal Access working group 74<br />

In this document, PDF is considered to be a multimedia format which is a packaged set of files of different formats<br />

structured into a document. In the conversion process 5.1 a PDF document is converted through tagging of the PDF into<br />

a structured multimedia document. The DRM within these documents can be considered as another element with specific<br />

and often conflicting requirements. The technical requirements for conversion are much the same as those for any<br />

multimedia package.<br />

72 http://www.adobe.com/accessibility/<br />

73 http://www.adobe.com/enterprise/accessibility/pdfs/acro7_pg_ue.pdf<br />

74 http://www.aiim.org/standards.asp?ID=27861<br />

45


CWA <strong>15778</strong>:2008 (E)<br />

Figure 7 - S<strong>cen</strong>ario 6 Accessible and protected PDFs<br />

46


CWA <strong>15778</strong>:2008 (E)<br />

6.7 S<strong>cen</strong>ario 7 - Working hand in hand<br />

We are an organisation providing accessible format materials. We often receive unstructured (or poorly structured)<br />

publisher source files and convert them to (DAISY) XML and Braille. What specifications can we give to publishers so<br />

that they can better structure their source files themselves, as this will make the process easier?<br />

Actors<br />

<br />

<br />

Specialist organisation<br />

General National Publisher<br />

Conversions<br />

Multimedia to XML [5.3]<br />

XML to Braille [5.7]<br />

XML to XML [5.13]<br />

S<strong>cen</strong>ario description<br />

The communication of requirements between publishers and specialist organisations is difficult. In both directions, there<br />

can be complications because of the language used, priorities and different end user requirements. In order to make the<br />

communication process as seamless as possible, it is essential to start this process as early as possible, and build up a<br />

relationship of trust and knowledge of each other’s requirements. International Standards are also essential to ensure<br />

that there is a starting point and other work can be re-used. In order to create a solution that is as wide as possible it is<br />

important to make use of a standardised XML dialect for support of automatic transformation.<br />

47


CWA <strong>15778</strong>:2008 (E)<br />

Figure 8 - S<strong>cen</strong>ario 7 Working hand in hand<br />

48


CWA <strong>15778</strong>:2008 (E)<br />

6.8 S<strong>cen</strong>ario 8 - Accessible design<br />

We are a design agency working for several private and public organisations. They have asked us to build in accessibility<br />

in our products but how do we do this? We understand the principles of "Design For All" and web accessibility but how<br />

does this apply to the documents we design?<br />

Actors<br />

<br />

<br />

Design Agency<br />

Public and private service provider<br />

Conversions<br />

Multimedia to XML [5.3]<br />

XML to structured Multimedia [5.10]<br />

S<strong>cen</strong>ario description<br />

Multimedia documents are converted into XML. The information can then be structured and afterwards converted again<br />

in multiple output formats.<br />

Assuming the service provider is using modern software and modern XML production streams, it should be a case of<br />

adding new outputs to these streams for the major accessibility formats which are described in these training materials.<br />

49


CWA <strong>15778</strong>:2008 (E)<br />

Figure 9 - S<strong>cen</strong>ario 8 Accessible design<br />

50


CWA <strong>15778</strong>:2008 (E)<br />

6.9 S<strong>cen</strong>ario 9 - Accessibility on a large scale<br />

We are a national library and are responsible for archiving vast amounts of digital material. How can we be sure this<br />

material will be accessible? Should we follow some guidelines?<br />

Actors<br />

<br />

<br />

Library acting as a service provider<br />

National Specialist Organisation for the Blind<br />

Conversions<br />

XML to structured multimedia (5.10)<br />

S<strong>cen</strong>ario description<br />

Since there is an archive of digitised information, it is likely that an XML production stream has been used. If this is the<br />

case, then specific formats for impaired users in question need to be added to the output formats. If no XML processes<br />

have been put in place, it is likely that some manual intervention may be required to add metadata to the information in<br />

order to make it suitable for conversion to accessible formats. As a national Library, it is also important that there is a<br />

degree of communication with the National Library for the Blind in your country.<br />

Figure 10 - S<strong>cen</strong>ario 9 Accessibility on a large scale<br />

51


CWA <strong>15778</strong>:2008 (E)<br />

6.10 S<strong>cen</strong>ario 10 - What authors can do<br />

I am an author who publishes my own works/publishes as part of a collective. How do I make sure that everyone can<br />

read my books without me incurring too many costs?<br />

Actors<br />

<br />

<br />

Author<br />

Accessibility specialist<br />

Conversions<br />

Structured Multimedia to XML [5.2]<br />

Traditional print to XML [5.4]<br />

S<strong>cen</strong>ario description<br />

The author defines together with accessibility specialist guidelines that support the author to create<br />

accessible documents.<br />

On the other hand existing print material is converted into XML which can then be exported into<br />

various output formats.<br />

52


CWA <strong>15778</strong>:2008 (E)<br />

Figure 11 - S<strong>cen</strong>ario 10 - What authors can do<br />

53


CWA <strong>15778</strong>:2008 (E)<br />

6.11 S<strong>cen</strong>ario 11 - Repair and adaptation<br />

We are a disability support unit in a major university. Our staff members are experts in specific disabilities and our role is<br />

to support specific learners at the university by ensuring the multimedia content supplied for their learning is accessible<br />

to each learner, providing or recommending appropriate formats to match access modes available to that learner. How<br />

can we adapt learning materials? What are the processes and what standards are available to support the processes.<br />

Actors<br />

<br />

Disability Support Staff<br />

Conversions<br />

Multimedia to structured Multimedia [5.1]<br />

S<strong>cen</strong>ario description<br />

A Disability Support Staff member will work with the learner and the material assessing the adaptations required to make<br />

the material accessible for the learner’s context. The learner’s requirements for the context might be expressed in a<br />

functional description (e.g. IMS Accessibility for LIP 75 or ISO/IEC JTC 1/SC 36 24751-2 Personal Needs and Preferences<br />

Statement 76 ). There may also be environmental context requirements, such as might be expressed in a device profile.<br />

Content produced by the Educator will be in the form of an aggregation, which might be HTML or SCORM or an IMS<br />

Content Package or other aggregation usable by a Learning Management System. A piece of content may itself contain<br />

multiple aggregations and formats such as PDF files, MPEG videos, text, HTML etc. Determining whether the content<br />

matches the functional requirements and what the needs for repair are might involve examination of Metadata associated<br />

with the content (such as IMS AccessForAll Metadata 77<br />

or ISO/IEC JTC 1/SC 36 24751-3 Digital Resource<br />

Descriptions 78 ) or using automated software tools to determine accessibility properties of content.<br />

Appropriate repair assistance tools might output machine and human-readable statements in the language EARL 79 or<br />

properties related to WCAG 2.0 or Section 508 or other such Accessibility standards and also other properties.<br />

Repair might involve disassembling an aggregation (such as IMS Content Packaging 1.2 80 ) into its constituent parts,<br />

providing alternate/supplemental resources and re-assembling. Repair might also involve provision of offline materials<br />

(such as the notes in large font at some specific time) or online services or offline services to accompany the use of the<br />

material.<br />

75 http://www.imsglobal.org/accessibility/#acclip<br />

76<br />

http://www.iso.org/iso/en/CatalogueDetailPage.CatalogueDetail?CSNUMBER=43603&scopelist=PROGRAMM<br />

E<br />

77 http://www.imsglobal.org/accessibility/accmdv1p0/imsaccmd_oviewv1p0.html<br />

78<br />

http://www.iso.org/iso/en/CatalogueDetailPage.CatalogueDetail?CSNUMBER=43604&scopelist=PROGRAMM<br />

E<br />

79 http://www.w3.org/TR/EARL10-Schema/<br />

80 http://www.imsglobal.org/content/packaging/<br />

54


CWA <strong>15778</strong>:2008 (E)<br />

Figure 12 - S<strong>cen</strong>ario 11 Repair and adaptation<br />

55


CWA <strong>15778</strong>:2008 (E)<br />

6.12 Common s<strong>cen</strong>ario requirements<br />

As is shown in the possible workflow for each s<strong>cen</strong>ario, there are common tasks that need to be covered when<br />

introducing accessibility into the publishing workflow. Those tasks include:<br />

<br />

<br />

<br />

<br />

<br />

Definition of document style guidelines that are agreed and disseminated to the actors.<br />

Reference conversion of document’s references into tagged structured formats.<br />

Manual correction of conversion input/output (in the case where content creators did not follow style<br />

guidelines requirements).<br />

Extension of descriptions regarding visual data (images, videos) and other multimedia material.<br />

Subsequent on demand conversion into accessible formats.<br />

6.13 Specific s<strong>cen</strong>ario requirements<br />

In order to make documents accessible for a specific group, specific format conversions need to take place. It is<br />

important in these format conversions that the structure, annotation and metadata and intellectual property rights of the<br />

parent documents will be inherited by the child documents and not be lost. Therefore, attention must be paid to using the<br />

right conversion order and toolkits. One strategy that seems to be the most viable is to use a media rich (including<br />

annotations, and metadata) structured format as reference and then create the desired output operating directly on that<br />

format. An additional advantage of this strategy is that this format can ideally also be used for backups and can be stored<br />

in the publishing archive or also in libraries for long-term archiving.<br />

The next sections will further elaborate the requirements found for the possible s<strong>cen</strong>arios presented above. An analysis<br />

will be provided of the options for document style guidelines, reference conversion into a structured format, how to<br />

minimise manual correction and enhancement of the conversion output, how to extend descriptions of visual data and<br />

other multimedia material and how to convert the reference format into other accessible formats.<br />

7 Application-oriented s<strong>cen</strong>ario implementation<br />

7.1 Harry Potter and the RNIB<br />

Actors involved: International Publisher, General, Printed paper books, Large, International Publisher, General,<br />

Audiobooks, Large, Public blind Service Provider<br />

Conversions: Audio wav to ASCII, ASCII to printed Braille, Multimedia to printed Braille, Multimedia to printed paper,<br />

Multimedia to Multimedia<br />

This case study concerns the simultaneous release of standard and accessible versions of a popular work of fiction in the<br />

United Kingdom. The latest Harry Potter book (Harry Potter and the Half Blood Prince) was brought out simultaneously in<br />

normal print, large print, Braille, audio and DTB. The RNIB produced the Braille version; the publishers produced the<br />

normal print, large print and audio. A third actor produced the DTB.<br />

7.1.1 Print version<br />

Bloomsbury published both the standard print and large print versions of the book simultaneously. The large print version<br />

was produced in 16 point and was of good quality. A number of individuals seemed to encounter significant difficulties in<br />

securing large print copies of the book; this was despite the large print version being mentioned in press releases and<br />

being included on the Nielsen database (the primary tool for booksellers in identifying availability). Potential customers<br />

for large print were encouraged to pursue the issue with mainstream booksellers. The National Blind Children's Society<br />

also provided customised large print versions of the book to children, in a range of point sizes. They waited until the print<br />

version of the book was published and then scanned it. The customised large print service is only available to children<br />

and the print price is heavily subsidised by the charity. As with other accessible versions of the book (see below), the<br />

stated aim of the charities concerned is to provide the ‘same book, at the same time, at the same price to the print<br />

impaired end user. (List price of Standard Print Book £16.99, List price of large print Book £30.00, Price of NBCS book is<br />

£16.99 to individual children)<br />

56


CWA <strong>15778</strong>:2008 (E)<br />

7.1.2 Braille<br />

RNIB were required to undertake and pay for a security audit before the publication date. Once cleared, a representative<br />

from Bloomsbury arrived with the word file of the book on a CD. The CD was loaded onto a computer that was not<br />

networked. The word file was translated into a Braille file and a stock of hard copy Braille was produced. The artwork for<br />

the cover had been secured earlier than the text. The Braille version of the book was available at exactly the same time<br />

as the print version of the book. This was a major achievement, the cost of production was still heavily subsidised by<br />

RNIB. (Price of Braille copy of book: £16.99 available from RNIB)<br />

7.1.3 Analog Audio<br />

The audio rights for the book were owned not by Bloomsbury but by an individual, Helen Nicholl. The actor Stephen Fry<br />

narrated the book (as he had done with previous Harry Potter titles). The book was recorded using industry standard,<br />

professional, recording software (Pro tools/ Sadie). The digital master was then used to produce both a CDDA (Compact<br />

Disk Digital Audio) copy of the book (17 CDs long) and an audio-cassette version of the book. These audio versions of<br />

the book were available 6 weeks after the publication of the print book. This delay was apparently due to the work<br />

commitments of Stephen Fry. According to UK law you can not use a copyright exception to produce an accessible<br />

version of a book if there is already an equivalently accessible version commercially available. For this reason the<br />

Stephen Fry cassette and CD version of the book is the only one available (albeit at an incredibly high price) (List Price of<br />

analog audio £55.00, List price of CDDA £65.00)<br />

7.1.4 Daisy Audio<br />

Daisy audio is deemed a significantly different format to both analog audio and CDDA so RNIB was within the law to<br />

produce the book in Daisy. There were several options available:<br />

Use the electronic file (used to produce the Braille) to produce a synthesised voice version of the<br />

book<br />

<br />

<br />

Use an RNIB narrator to produce a real voice recording of the book<br />

Use the Stephen Fry recording to produce a Daisy version of the book<br />

The advantages of the first two options were that the Daisy version would be available at the same time as the print<br />

version of the book. However as previous Daisy versions of the series had all used the Stephen Fry audio version it was<br />

decided to go for continuity over ‘same day publication’. The rights owner provided a copy of the CDDA version of the<br />

book. This was used to produce wave files of the recording which were then converted into the Daisy book. The Daisy<br />

book is available for loan via the RNIB talking book service. It is also available for sale. The rights owner set an arbitrary<br />

limit of 250 copies (for sale). This will soon be reached and will need to be re-negotiated. Once again the Daisy version<br />

of the book was given the same price as the standard print price copy of the book. It was available for loan and sale, 4<br />

months after the original publication of the print book. (Daisy copy of book from RNIB £16.99)<br />

7.2 Magazine and Newspaper distribution in the Netherlands<br />

Actors involved: Public blind, Public partially sighted, Public dyslexics, National Publisher, General, Multimedia<br />

Publishing, Large<br />

Conversions: ASCII to XML, XML to audio, XML to Braille, XML to printed paper, XML to XML, HTML to XML, Multimedia<br />

to XML<br />

The Dutch Library for the Blind Dedicon converts 37 newspapers automatically to accessible XML formats, making them<br />

available at the same time if not before the printed editions. Magazines still require some intervention by skilled staff to<br />

convert files to a common format, which sometimes causes short delays between print publication and the availability of<br />

accessible versions.<br />

DEDICON, the Dutch library for print impaired readers distributes 37 newspapers and 60 magazines in a special XMLformat.<br />

The website Anderslezen.nl is used as a digital distribution platform, which reduces delay in delivery to a<br />

minimum. Most of the newspapers are available at the same time or even earlier than the printed versions. This is<br />

possible because conversion and distribution of the content is fully automated. Magazines cannot be delivered so quickly<br />

because some of the work still has to be done by hand. Production of the special “Dedicon” XML format means that<br />

delivery times cannot be guaranteed. Nevertheless delays are reasonable.<br />

57


CWA <strong>15778</strong>:2008 (E)<br />

7.2.1 The production of newspapers<br />

Publishers deliver their content in different XML formats. For example NewsMl, NITF, SGML and special internally<br />

developed XML formats. The files are transferred by FTP to a Dedicon server, where they are converted automatically:<br />

The conversion editor/post processor (Dedicon software) or XSLT software converts the file into Dedicon-XML and sorts<br />

the content in a logical order.<br />

A DMD-file (document metadata, Dedicon software) is created. It contains metadata about the publisher, the newspaper,<br />

the date and the size of the file. It supports the transport and storage of metadata to Anderslezen (website and<br />

distribution platform).<br />

<br />

<br />

<br />

<br />

<br />

<br />

The Dedicon-XML is encrypted and placed, together with a CSS and a XDF-file (the CSS supports<br />

the customer's reader software) in an EXD-file (encrypted XML document).<br />

EXD- and DMD-files are placed on the website Anderslezen (distribution platform).<br />

The costumer gets an automatically generated email with an attached (EXD-file) or a link.<br />

The EXD-file is decrypted and the file is opened by the Dedicon-reader.<br />

The costumer can navigate through the document with his mouse or the buttons of his keyboard.<br />

The customer can choose between different output formats: synthetic speech, Braille and large print,<br />

all of them produced locally out of the XML files.<br />

7.2.2 The production of magazines<br />

The content is not only delivered by FTP but also by CD or email. Dedicon receives many different formats, such as: PDF,<br />

Word, QuarkXpress, Indesign and HTML. It is not possible to convert all of those formats automatically.<br />

QuarkXpress is converted by hand in an Apple environment. The employee places the different parts of the content in the<br />

right order by using Textarch software. Automatic export would lead to an illogical content order.<br />

Indesign is converted in a Windows environment. With the assistance of FIX software (developed by Dedicon) the<br />

employee places the different parts of the content in the right order.<br />

The conversion editor/post processor (Dedicon software) converts the content into Dedicon XML.<br />

The employee controls the quality of the Dedicon XML. The content has to be well formed and valid. In case of an error<br />

the employee restores the content by hand.<br />

Any information which is missing and relevant is added to the DMD file (document metadata, Dedicon software) by the<br />

employee. The rest of the process is similar to the production process of newspapers.<br />

7.3 Time Warner and Dolphin Audio Publishing<br />

Actors involved: International Publisher, General, Audiobooks, Large Service Provider<br />

AFB Talking Books re<strong>cen</strong>tly introduced a new technology developed in partnership with Time Warner AudioBooks and<br />

Dolphin Audio Publishing, for the best-selling author James Patterson's new novel, The Jester. The Jester appeared as<br />

an audio e-book in 2003 and included an unabridged CD audio version of The Jester. The audio e-book, basically a<br />

digital talking book (DTB), was made possible through standards developed by libraries for people who are blind from<br />

around the world through the international DAISY (Digital Accessible Information System) Consortium.<br />

DTB technology allows large amounts of textual information and formatting to be stored, transcribed into a variety of<br />

formats, and easily navigated. The EaseReader software developed by Dolphin Audio Publishing synchronises audio to<br />

the text and plays/displays The Jester on desktop and laptop PCs. Readers can display the text of the book on the<br />

screen, fully synchronised with the audio of a professional narrator. Switching back and forth, or “toggling,” between print<br />

and audio versions of the same work, is also possible. Additionally, users can search both the entire text and audio for<br />

keywords and phrases. These features have a particularly broad appeal for travellers and commuters who may wish to<br />

read the text and listen to the audio independently or simultaneously, depending on their environment.<br />

Electronic hardware manufacturers are already responding to the innovation. In the near future, audio e-book technology<br />

will be integrated into hand-held Personal Data Assistants. Additionally, the Consumer Electronics Association is planning<br />

58


CWA <strong>15778</strong>:2008 (E)<br />

to integrate the DTB file format into CD player technology, allowing any CD player to access the audio portion of the<br />

audio e-book. This in itself would mark a significant advance, since an entire book’s worth of text and audio can fit onto<br />

one CD with the DTB file format.<br />

7.4 <strong>Education</strong>al publishing in Austria<br />

Actors involved: International Publisher - General - Printed paper books - SME, Service Provider - Public blind<br />

Conversions: Multimedia to XML, XML to HTML, XML to Multimedia. This case study outlines the co-operation between<br />

Austrian schoolbook publishers and service providers for people with special needs, to make books available in<br />

electronic formats.<br />

7.4.1 Situation in Austria<br />

In Austria, the Federal Ministry for Social Affairs and Generations is providing educational materials like schoolbooks and<br />

other materials for primary and secondary education. Blind and visually handicapped students - and hopefully soon in the<br />

future other print disabled students - can order books in accessible formats.<br />

Publishers, till this project, did not agree on handing over and distributing digital copies of books. The development of<br />

alternative formats starts from printed books with scanning, OCR or, when lots of graphics and/or formal structures like<br />

math are used, with typing. In this process structure was added to the book, headings were defined and lists and other<br />

structural elements were assigned to the text. This was a very time consuming process.<br />

This situation motivated to start a project which addressed the following issues:<br />

a minimum set of structural elements which documents from publishers have to contain to make<br />

them usable for the production of books in alternative formats (e.g. Braille, large print, ebooks)<br />

know-how and handouts for publishers on how to implement structured design with these elements<br />

using standard desktop publishing (DTP) systems (InDesign, QarkExpress)<br />

examples new books and redesigning existing books to learn how to do accessible document design<br />

in practice<br />

training materials, <strong>workshop</strong>s and seminars to transfer the developed know-how to other publishers<br />

and design agencies<br />

a general <strong>agreement</strong> which gives the right of transferring books in electronic format to students with<br />

disabilities<br />

a Document Rights Management System including to prevent the data to be misused in practice<br />

a workflow for the co-operation between schools/teachers, service providers, publishers and the<br />

ministry.<br />

Publishers were interested to take part as a) the new anti discrimination legislation will ask for accessibility of school<br />

books and b) they experience general problems in the publishing process when they want to use sources for different<br />

publishing purposes (e.g. print, online, CD, audio/multimedia). This convergence of interests led to a strong partnership<br />

for the project named.<br />

7.4.2 "Multi Channel Publishing"<br />

Five publishers take part in the project. Each of them is responsible for the designing or redesigning one of their books<br />

based on a predefined set of structural elements. This basic structural design defined in the project guarantees that the<br />

electronic version of the book can be used for the production of alternative formats. An analysis of the publishing process<br />

at publisher’s sites showed that service providers can only start from the final print ready version as the content, which is<br />

approved by public authorities, changes till this point. This final version today is most of the time a PDF generated from a<br />

DTP Tool (e.g. Adobe InDesign or QuarkXPress). Due to this, if electronic sources should be usable for services<br />

providers, structured design has to be implemented into the DTP work.<br />

7.4.3 Definition of structural elements for electronic versions of books<br />

To be able to collect the data of the source document and convert it into a XML File, we used the element set of the TEI-<br />

Standard, in particular the TEI Lite DTD. The TEI’s Guidelines for Electronic Text Encoding and Interchange were first<br />

published in April 1994. This set of metadata is widely known by publisher and guarantees compatibility or convertibility<br />

to other definitions in use like Daisy [Daisy 06]. Using TEI keeps the process close to the upcoming XML database<br />

59


CWA <strong>15778</strong>:2008 (E)<br />

schemes which publishers might use in the future using database structures for processing their documents. The TEI Lite<br />

DTD still consists of over 120 Elements for the tagging of books, most of them important for librarians. To simplify the<br />

work for all participating parties, a subset of those elements was selected. This subset consists of structural elements<br />

which are of general importance for structured document design and automatic content processing. This subset does not<br />

ask for special knowledge of accessible versions but can be seen as the basis for structured document design in general.<br />

Using this subset guarantees that the sources (or PDFs) can be used as a starting point for the production of accessible<br />

versions. In general this sub-set of the TEI Lite DTD comprises structural metadata elements for:<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

Headings<br />

Divisions / Subdivisions<br />

Images<br />

Tables<br />

Notes<br />

Page breaks<br />

References<br />

It also comprises administrative metadata elements (e.g. Edition, Year of Publishing, Author(s), Publisher, ...). The<br />

experience in the project showed, that this D.T.D. Subset is sufficient to structure the content of the schoolbooks.<br />

Publishers after a short training were able to do the work by themselves. This subset also proved to be in accordance<br />

with new publishing systems based on X.M.L. databases.<br />

7.4.4 Authoring Tools<br />

After the definition of the XML DTD, knowledge was developed how the authoring tools could support the efficient<br />

marking-up ofdocuments in the right way during the layout process. Further on routines for exporting the defined<br />

structure and layout data into XML were developed. The two most widely used authoring tools were examined in detail:<br />

7.4.5 Example Books<br />

The post-processing tasks are necessary, because, as mentioned before, the exported files in some cases have no<br />

structure and there are also parts of some books that could not be exported (e.g. graphics, made in the authoring<br />

systems). The post-processing tasks were:<br />

<br />

<br />

<br />

Adding Structure to the XML<br />

Revise elements, that were not exported properly<br />

Describe Images<br />

The result after the completion of the work is a valid XML version of the book. The next step is to convert the XML via<br />

style sheets into the target format. The style sheets for the conversion are freely available on the internet. They allow to<br />

convert the XML file into a HTML file with one/multiple pages and also to convert the XML file into a PDF file.<br />

7.4.6 Training materials, seminars and <strong>workshop</strong>s<br />

Training materials have been developed which are now used in <strong>workshop</strong>s and seminars to transfer the knowledge to as<br />

many publishers as well as design agencies as possible.<br />

7.4.7 DRM-System<br />

To make sure that the books are not used outside the designated user group a DRM System was customised. The<br />

system consists of a secure-reader-software and a USB dongle, which acts as the key. Every student gets a key and the<br />

software. The key has a code, which allows the student to read the book if the key is plugged into the computer. This<br />

system has the advantage that the user is not bound to one specific computer or piece of hardware. He can read the<br />

book for example at school but also in a learning group or at home. How the students get their books and a detailed<br />

workflow between publishers and the service providers is described in the next paragraph.<br />

7.4.8 Workflow<br />

To start the process, a teacher of a student with special needs orders a book in an accessible format. If the schoolbook<br />

service provider does already have the book in stock, it will be provided directly to the student. Otherwise, the service<br />

provider asks the publisher for the electronic version of the book. The publisher sends his TEI-XML file to the service<br />

60


CWA <strong>15778</strong>:2008 (E)<br />

Provider. The service Provider produces the accessible version of the book. Printed (Braille/enlarged) copies are sent by<br />

standard mail. If an electronic document is ordered, the service provider encodes the files with the DRM system using<br />

the data from the student’s USB dongle. The book is placed on a server, where the student can download the book.<br />

When the student has the reader software installed and the dongle plugged in, he can open the book and read it.<br />

7.4.9 Agreement between Publisher and Service Provider<br />

To ensure that the process works efficiently, an <strong>agreement</strong> between publishers and service providers has been drafted.<br />

The core articles of the <strong>agreement</strong> are:<br />

<br />

<br />

<br />

<br />

The publishers provide their electronic source documents<br />

It must be ensured, that the books are only given to people with a special need<br />

A DRM system must be used therefore<br />

It must be a "closed" system with registered users<br />

The <strong>agreement</strong> will be signed by every publisher and service provider. If a service provider needs a book from a<br />

publisher he can ask for it under the condition of the framework <strong>agreement</strong>.<br />

7.4.10 Conclusion<br />

The most important result of the project is the fact that handing over digital copies of print published documents is<br />

guaranteed in the future.<br />

The project showed that it is technically feasible to create XML versions of books by using the print ready version of a<br />

document. The experience also showed that the quality of the XML after just using the functions provided by the<br />

authoring tools is not good enough. A lot of work has to be done afterwards by cleaning and revising the XML document.<br />

The people who are performing this work will have to have some basic XML skills. It will also be a challenge to convince<br />

the publishers to create documents that can be exported into XML without a lot of additional effort. In some areas at the<br />

moment there are only limited possibilities to sources from publishers, especially in areas, where books consist mainly of<br />

pictures, graphics and other visual content. Another challenge is the integration of non-text content like mathematical or<br />

chemical expressions.<br />

The project made obvious that all publishers pass their layout data to the print office by using PDF. An important task for<br />

the future will be the development of a program to allow authoring systems to create PDF files that are either accessible<br />

or allow a conversion back into a useful format.<br />

In any case these are only first, but important steps towards multi channel publishing. More work is needed for a more<br />

efficient production of different versions of one source document.<br />

7.5 Best practice for distributing accessible content<br />

The “ideal accessible information network” could be a structured and collaborative network of organisations producing<br />

accessible information for print disabled people. To improve its efficiency, it should be a technical network with<br />

normalised tools and practices, with technical experts to keep working with publishers for innovating solutions.<br />

Reliable technical solutions should be set up to distribute accessible electronic documents to print disabled people.<br />

These solutions must guarantee intellectual property rights without restricting access to information. This ideal accessible<br />

information network should be a trust network, where the actors are well identified and work responsibly and accountably.<br />

Exchanges with publishers should be enhanced and organised by trusted intermediaries to discuss intellectual property<br />

rights and structured files provision. These intermediaries should be legally acknowledged as public authorities.<br />

Publishers should be provided with clear specifications on the file formats they can provide to trusted intermediaries. If<br />

necessary, guidelines or tools can be supplied to help publishers in integrating accessibility in their production chains.<br />

Publishers should introduce accessibility in the contract they sign with their subcontractors and service providers. They<br />

should also guide their authors in creating structured information using the prescribed authoring tools.<br />

61


CWA <strong>15778</strong>:2008 (E)<br />

When projects on electronic products are launched, accessibility should be introduced in the functional specifications and<br />

considered in the project design and realisation. 81<br />

7.5.1 Current examples of good practice<br />

These examples of current good practice cover the following areas:<br />

<br />

<br />

<br />

Technical books<br />

School books<br />

Electronic books<br />

7.5.2 Technical books<br />

These are examples of best practice in distributing accessible content in the form of technical books.<br />

7.5.2.1 O'Reilly Media Inc.<br />

Technical books and articles have changed considerably with the advances of new information technologies. The<br />

example of O'Reilly Media Inc. illustrates how technical information can be disseminated in both paper books and<br />

electronic documents. O'Reilly media was originally a technical writing consulting company. Today it has become one of<br />

the most famous publishers of books for software developers with its iconic “animal books” and “In a Nutshell” references.<br />

O'Reilly publications activities are not reduced to paper books; it offers many on-line services as its “Safari Books Online”<br />

82<br />

service, a web-based subscription service that offers a searchable reference library of computer books from different<br />

publishers. This on-line library allows subscribers to search across more than 3,000 books, parts or entire books can be<br />

read on-line, the catalogue can be browsed by category. Chapters of books can be downloaded for viewing off-line.<br />

O'Reilly has published a number of Open Books – books with various forms of “open” copyright. 83 These books can be<br />

out of print or written by authors who thought that their books had to be widely distributed under a particular open<br />

copyright. Through its Open Library project, the Internet Archive 84 is scanning and hosting PDF versions of O'Reilly open<br />

books.<br />

A number of the open books are also available as HTML or PDF e-books on the O'Reilly web site. These documents are<br />

structured in chapters, sub chapters and contain table of contents to access information. 85<br />

To create such a variety of products and services around paper books, O'Reilly has set up a complete publication<br />

process starting from authors to the final products.<br />

Authors are provided with very strict guidelines for the final book submission. The approved formats are: - Microsoft Word<br />

for PC or Mac, tagged to O'Reilly's paragraph and character style template, - XML tagged according to the DocBook Lite<br />

DTD, - Adobe FrameMaker tagged according to the paragraph and character style tags in O'Reilly's templates. Once the<br />

final draft is submitted by the authors and properly reviewed by a technical committee, the book is prepared for print by<br />

O'Reilly staff. Illustrations are re-done by graphic artists, the cover is designed.<br />

The camera and press ready material is then produced by the production group late in the production process. Adobe<br />

FrameMaker is used to prepare the final document that will be sent to press. 86<br />

81 http://wiki.euain.org/doku.php?id=wiki:distribution:distributing_content:best_practice<br />

82 http://safari.oreilly.com/<br />

83 http://www.oreilly.com/openbook/<br />

84 http://www.archive.org<br />

85 An example of an HTML open book can be found at:<br />

http://www.oreilly.com/catalog/debian/chapter/book/index.html<br />

86 http://www.oreilly.com/oreilly/author/<br />

62


CWA <strong>15778</strong>:2008 (E)<br />

This particular example shows a publisher who has created a rationalised multichannel publication process. Both paper<br />

books and digital publications are created from the authors submissions. This process is well documented. It involves<br />

authors in the early stages guiding them through the edition. Even if it takes place in a favourable context (authors are<br />

computing scientists and O'Reilly is specialised in technical publication), this example can be considered as encouraging<br />

for structured content publication and leading to accessibility.<br />

7.5.3 School books<br />

These are examples of best practice in distributing accessible content in the form of school books.<br />

7.5.3.1 Bordas-Nathan electronic school books<br />

The French publishers Bordas 87 and Nathan 88 are major school books publishers in France. In addition to a very large<br />

catalogue of paper books for pupils, teachers and pedagogues, these publishers create multimedia products as web sites<br />

and CDROMs for teachers and pupils. Since 2000, Bordas and Nathan have been also actively involved in electronic<br />

schoolbags projects.<br />

BrailleNet 89<br />

has studied the accessibility of an electronic History and Geography book for 15 years old pupils published<br />

by Bordas. The electronic book content is based on the paper version with extra multimedia documents as Macromedia<br />

Flash animations or audio and video sequences. The aim of this project for the publisher was also to explore<br />

multichannel publishing to deliver information to different target devices as personal digital assistants (PDA) and third<br />

generation cell phones.<br />

The final version of the electronic book for computer is a thick client application based on a modified version of Mozilla: -<br />

the book's static content is encoded in XHTML, - the user interface is described with XUL, - and the major part of the<br />

application mechanisms are developed in JavaScript. The electronic book for computer offers the following functionality: -<br />

read textual content, view pictures, play video and audio documents, - search content in the whole book using an<br />

integrated search engine, - browse the table of content of the book and move directly to a given part, chapter, lesson<br />

thanks to encoded links, - annotate images using a minimal editing toolkit containing a brush and a colour selector, -<br />

create and edit XHTML content for pupil’s personal homework.<br />

BrailleNet carried out accessibility tests with common assistive technologies such as screen readers and magnifier<br />

software. The results were negative because of compatibility issues between Mozilla and the screen readers. Even if<br />

these technical problems could have been solved, the use of Macromedia Flash was also another important barrier to<br />

access.<br />

However the book's high level of structure and the large amount of semantic information added to it was a good<br />

opportunity to improve its accessibility. It was decided to go deeper in the study, the publisher let BrailleNet access the<br />

build chain of the application and develop solutions to create an accessible version of the book.<br />

Both XHTML content and XUL content are automatically built from XML data. The source of the whole application is<br />

contained in a unique XML document following a DTD developed by the publisher. This DTD is divided in two parts: - a<br />

first subset is common to every electronic book produced by the publisher, it is close to XHTML, and describes<br />

information as paragraphs, divisions, and images; - a second subset is specific to a given book. It defines the grammar<br />

and the vocabulary of the book: which containers are used, how they can be nested. This part was particularly interesting<br />

because the publisher chose to structure information in a very semantic way in order to output media-specific structures<br />

later in the production process.<br />

The publisher provided BrailleNet with this DTD and the XML source file of the application. It was interesting to note that<br />

the DTD already contained the necessary structures to add textual alternatives to images. However, all these alternatives<br />

were empty in the XML document. But all the documents (textual documents, illustrative pictures, video and audio) were<br />

provided with a textual legend introducing and describing the content.<br />

87 http://www.nathan.fr/Multimedia/cartable/default.asp<br />

88 http://www.editions-bordas.com/<br />

89 http://www.braillenet.org/<br />

63


CWA <strong>15778</strong>:2008 (E)<br />

With the publisher's DTD and XML source files, BrailleNet was able to develop a set of XSL style sheets to convert the<br />

XML source document into XML dtbook and then to an accessible XHTML book where a user can easily navigate with<br />

tables of contents, many internal links, textual alternatives to images etc.<br />

This example is encouraging for the integration of accessibility in the publishing industry. It shows how a publisher was<br />

led to add structure to its content to create new innovative products for its customers. It also shows how easy it was to<br />

create an accessible version of the electronic book from the XML source files of the publisher. This demonstrates the<br />

technical convergence of publishers and accessibility in electronic documents, especially in the case of multichannel<br />

publishing. This technical convergence is also illustrated by the example of a mathematics book from the same publisher:<br />

it was decided that mathematical content will be stored in MathML instead of images in order to be able to convert them<br />

later in the building chain if necessary.<br />

7.5.4 Electronic books<br />

These are examples of best practice in distributing accessible content in the form of school books.<br />

7.5.4.1 Numilog<br />

Numilog is an electronic bookseller based in France and selling books for both French-speaking and English-speaking<br />

markets. The Numilog website proposes a large catalogue of titles for its customers (more than 23,000 available titles).<br />

Customers can choose the format they prefer between: - Mobipocket format (PRC) readable on a computer with<br />

Microsoft operating system, Palm, Pocket PC and smartphone; - Microsoft Reader format (LIT) also readable on all the<br />

above platforms; - PDF for PC and Mac. The catalogue is composed of novels, books for children, documentaries, nonfiction<br />

books (computing, management, biology law, economics, ...) and dictionaries. 90<br />

7.5.4.2 Relations with publishers<br />

Numilog has business relations with publishers to negotiate the rights to distribute their books on the Internet, the selling<br />

price and eventually source file provision. English and American publishers set the price Numilog must pay to obtain the<br />

rights to sell electronic books from their catalogue. Then Numilog is free to decide the selling price and so its profit<br />

margin. The French publishers collaborating with Numilog have decided to apply the policy they use with conventional<br />

book distributors: book prices are set by the publishers themselves. This follows the French law about book prices, called<br />

“loi Lang”. This law was passed in 1981 and prescribes that anyone publishing or importing a book has to define its<br />

selling price. This price must be respected by all distributors. This law does not include electronic books but the<br />

publishers have decided to apply it to this particular case. This means that Numilog has to negotiate with publishers the<br />

selling price in order to keep a de<strong>cen</strong>t profit margin. This price of the electronic version is always lower than the price of<br />

printed books. Most of the time, Numilog has to pay to obtain the electronic files from publishers, and the price fluctuates.<br />

Publishers almost always provide Numilog with files of their books. 90% of these files are prepress files in PDF, Quark<br />

XPress or Adobe inDesign. PDF files are always optimised for press, with cutting lines and very large pictures. The other<br />

10% are author’s files, usually in Microsoft Word. Sometimes Numilog must digitise books because publishers cannot<br />

provide digital files.<br />

Numilog has chosen Adobe Content Server 91<br />

to secure eBook distribution. Content Server is a Web-based system for<br />

publishers, distributors, libraries and booksellers. It automates the supply chain for eBooks and other media by providing:<br />

- an interface for eBook publishing, distribution and procurement; - a way to manage and protect digital rights; - a secure<br />

repository with encryption of eBooks and authentication of transactions; - a business to business transactions model,<br />

including selling eBooks to clients and procurement from vendors; - a lending model for online libraries ... Numilog has<br />

become an Adobe partner for this particular product and can sell li<strong>cen</strong>ces and offer to host this service for a customer.<br />

7.3.3. eBooks preparation<br />

Numilog has set strict quality requirements for eBooks: - file size must be optimised (finding a good ratio between quality<br />

and file size); - all the eBooks must have a cover page; - file textual content must not be provided as images (textual<br />

electronic books only) so that content can be searched, magnified, selected (but not copied for security reasons); -<br />

technical books must be structured and offer a convenient way to access information quickly; - footnotes and endnotes<br />

must be hyper-textual links as often as possible; - external references are checked to avoid broken links. To prepare the<br />

eBooks, Numilog employs people to rework the files provided by publishers.<br />

90 http://www.numilog.fr/<br />

91 http://www.adobe.com/products/contentserver/index.html<br />

64


CWA <strong>15778</strong>:2008 (E)<br />

7.5.4.3 Conclusion<br />

This experience shows again a convergence of interests between the publishing world and the accessible information<br />

world because Numilog's eBook quality requirements are close to accessibility requirements. Numilog faces the same<br />

issues the accessible information world currently faces, publishers files are not structured for electronic publication, and<br />

the additional costs to make structured information from unstructured files is prohibitive.<br />

65


CWA <strong>15778</strong>:2008 (E)<br />

8 Identified gaps and areas for further research<br />

In Section 1 we asked two fundamental questions:<br />

<br />

<br />

How do we describe the questions and problems of end users and content providers?<br />

How do we marry both within manageable and consistent frameworks?<br />

This <strong>workshop</strong> has sought to provide some pointers towards answering these questions. As can be seen, there are many<br />

ongoing initiatives and projects which seek to incorporate accessibility within publishing processes. Largely through<br />

necessity, most of these initiatives have been undertaken by specialist organisations supporting print impaired people. In<br />

some instances, and sometimes in collaboration with publishers and other content providers, this has resulted in<br />

innovative practices and far greater access to information.<br />

However, progress has been fragmented and often very slow. The provision of alternative format materials varies greatly<br />

from country to country (and within national boundaries), according to local conditions and according to the economic<br />

vagaries of provision and depending on different types of impairment. These problems are well-described elsewhere.<br />

8.1 Descriptions & Requirements<br />

Considerable work has been done by specialist organisations in establishing end user requirements and a number of<br />

preferred output formats are well-established. There is less consistency, however, in relating these requirements to reusable<br />

models for use within content processing environments. Combined with a historical tendency to separate<br />

alternative format production from mainstream production and to focus on separate specialised formats, this has lead to<br />

a fragmented approach to implementing accessibility within both specialist and mainstream processes. Similarly, there is<br />

very little consistency in the work undertaken by content providers in this area and there is a corresponding<br />

fragmentation of effort. Such work that has been undertaken tends to focus on web authoring issues and this only<br />

addresses one part of the accessible content processing chain. Indeed, an important outcome from this <strong>workshop</strong> is the<br />

realisation that generic processing models are required for this work to proceed with any degree of coherence. 92<br />

8.1.1 Further Research on Descriptions & Requirements<br />

Systematic descriptions of end user requirements remain problematic. Such requirements are constantly changing and<br />

there has been little effort made to capture the dynamic nature of these requirements.<br />

Further research s required to establish requirements for different types of impairments and to compare and examine<br />

where points of intersection might lead to collaborative efforts.<br />

8.2 Process & Content Modelling<br />

This <strong>workshop</strong> has sought to provide information about different s<strong>cen</strong>arios and to point towards ‘real-life’ examples which<br />

have proven successful. The motivation for content providers to create accessible information will always vary but there<br />

is a clear need for generic processing frameworks which can make this as straightforward as possible.<br />

It appears unrealistic to expect that any one format be accepted by all stakeholders and universally applied. Multimedia<br />

content processing can involve many different types of software and many different processes, thereby making it very<br />

difficult indeed to introduce accessible content processing at the right stages. No single input, representation or output<br />

format can contain these complexities. Given the general move towards distributed media, it is perhaps better to focus on<br />

building frameworks which enable accessible content processing, according to the local preferences and requirements of<br />

all the stakeholders. In short, accessibility is not a format or a product: it is a process.<br />

Our modern use of multimedia information requires that information and services accommodate different presentations<br />

and interaction designs design at the user interface level, on the basis of requirements that include user needs,<br />

preferences, personalisation, customisation, adaptation and constraints; characteristics of the tasks to be performed (e.g.<br />

repetitive, knowledge-intensive, collaborative); capabilities of available access devices; and contextual information. In<br />

92 See for example, the ongoing work of the ProAccess project (http://proaccess.euain.org), co-ordinated by<br />

the Italian Publishers Association and supported by the Federation of European Publishers.<br />

66


CWA <strong>15778</strong>:2008 (E)<br />

providing this level of usability, a fundamental accessibility can be achieved which allows users to interact with content at<br />

a much deeper level.<br />

This work may be at a relatively early stage, but most importantly it is a mainstream endeavour: the modelling and<br />

aggregation of content is a <strong>cen</strong>tral concern for all those in the private and public sector.<br />

The ever-increasing complexity of dealing with information from structured and unstructured media (images, sound, text,<br />

recordings, etc.), in several working modalities and in multitasking modes, makes adapting to context and content a<br />

necessity. Additionally, what is becoming more and more important is a universal, scalable, adaptive and customisable<br />

multidimensional interface to media content where appropriate media viewpoints/perspectives can be presented to the<br />

users adapted to their preferences, workflow constraints, and interaction models.<br />

Existing approaches to user-content interaction are characterised by a lack of a holistic view to the complex problem of<br />

designing accessibility of interaction and content and they fail to look at the process of authoring, managing and<br />

delivering the content as being highly inter-woven. Also, the majority of these approaches either look at accessibility,<br />

personalisation or context of use problems; they do not deal with the more complex issue of user interaction and content<br />

presentation as a whole and the open-ended and frequently changing real world environments.<br />

The various MPEG family members operate at different abstraction levels with some communication between these<br />

abstraction levels. The process of contriving a procedure to interface the various processing levels should be based on<br />

use. The difficulty lies in achieving a level of description of the user requirements that allows re-description in<br />

technological terms. This re-description ideally leads to specifications and ultimately implementations. These<br />

implementations ‘prove’ the viability of the concept: it is the proof of the hypothesis. The process of standardisation that<br />

runs in parallel with this ensures extraction of higher level descriptions and these are aggregated down to the earlier<br />

family members. Using this built-in feature to provide ‘slots’ for common and specialised accessibility requirements would<br />

create what we refer to as accessibility from scratch (see above). If embedded in the family tradition of the MPEG<br />

initiative, accessibility might become a commonly available feature instead of a workaround necessity.<br />

The representation of the interplay between the various user groups should always remain accessible. If all relevant<br />

entities in a representation system remain accessible, creating meaningful mappings is a matter of connecting the<br />

appropriate entities. For this reason, accessibility from scratch is of fundamental importance.<br />

8.2.1 Further research on process and content modelling<br />

There is a need to develop open source frameworks to bridge the gap between original content design heuristics and<br />

intuitive multimodal interfaces required for content and communication systems.<br />

Such frameworks would build in profile-based access to information, content and services, which not only bring together<br />

and extend state-of-the-art technologies for information access, but also conform to standards and guidelines available<br />

for accessibility, usability, scalability and adaptability.<br />

There is a need to conduct basic research to establish the nature of the interaction between people with cognitive<br />

impairments and multimedia information. A critical and guiding factor is that the supply of information should be<br />

determined by the end user from a <strong>cen</strong>tral content reserve, thereby allowing the end- user the freedom to explore the<br />

information as they see fit and to make their own choices regarding how the information is to be displayed, rather than<br />

through the sometimes discriminatory filtering processes of information gatekeepers.<br />

8.3 Introducing and using metadata for accessibility purposes<br />

People compress information. People decompress information. The compression procedure involves filtering out<br />

redundant information based on the perspective of the user. How do we decide which redundant data entities are<br />

relevant for the user? What to use? On what requirements are these redundant data entities based? Whose<br />

requirements? How do we marry the existence of these accessibility metadata entities with the requirements as<br />

described in “common” metadata entities? More importantly, how do we ensure a synchronised and therefore valid<br />

coupling between any kind of content with these metadata entities? How do we ensure that any metatags themselves<br />

remain accessible? What is the context of any accessibility metatags that are to be conceived?<br />

How then can we make sure that the context remains consistent? If we describe the knowledge that is applied to enable<br />

processes to exist in a digital system that parallels analogue organisational systems, knowledge is transferred from the<br />

individual participants to a shared information framework. The use of knowledge can be separated into three parts: the<br />

body of information that is contained inside knowledge structures; static information about the knowledge processing,<br />

which is also known as meta-information or metatags; dynamic information that is used to describe the processes and<br />

67


CWA <strong>15778</strong>:2008 (E)<br />

procedures to retrieve, transform or use the content. By introducing metatags that aim to address the needs for<br />

accessible information processing, it is mandatory to describe the procedures that will meaningfully interpret these meta<br />

tags to communicate the content in a way that enables every person to appreciate the content. Creating meaningful<br />

mappings between the static redundant information -the meta tags- and the dynamic processes<br />

Many people believe structure to be static: from a meta modelling perspective this is not the case. It is well known that if<br />

the representation of the information at hand is perceived by the system and mapped onto a framework, the information<br />

is then usable in a multitude of ways: and for this reason non-programmers will often promote the use of XML.<br />

However, this mark up and the set of tools that surround it are simply a set of tools which exist to achieve this objective. If<br />

the architecture of the system does not answer the wider range of needs, requirements and questions, the mark up<br />

cannot paper over the cracks. In order to build extensibility into a system, the architecture should be such that every<br />

element used for processing the information is adaptable. This can be achieved by building a representation layer which<br />

builds an object oriented structure from the information and which is free to adapt the meta relationships and hierarchies<br />

intrinsic in that data genus. This is defined by identifying the parameters upon which the structure is built, and ensuring<br />

they are interconnected in such a way that promotes future adaptability without degrading the system: which is to say,<br />

using the right parameters for accessible information processing.<br />

As noted in 8.1.1 above, the goal should be to anticipate the changes in user requirements. These changes can occur in<br />

the very nature of the requirements, such as new functional groups or in the definition of the existing requirements, such<br />

as additional details. These aims should be pursued by adding redundant information in the form of meta tags, thus<br />

augmenting the quality of the content. The content itself and the existing meta tag structures, including their mapping to<br />

the meta modelling domain, is not allowed to change. From a meta modelling perspective, this allows us to meet<br />

changing requirements for the future, because if the requirements demand additional detail in the form of features or<br />

metadata, we can unveil the metadata that is available.<br />

8.3.1 Further research on introducing and using metadata for accessibility purposes<br />

Further research is required to identify and investigate the ways in which metadata can help achieve efficient and futureproof<br />

solutions to accessibility.<br />

In order to make this perceived information useful, it must be represented within an architecture which allows the<br />

accessibility requirements to be questioned in more than one way. Such architecture must enable both the core system<br />

to adapt to new and changing representation requirements, and to allow (theoretically) infinite user requirements.<br />

8.4 Standards and personalisation of content<br />

Personalisation of media makes several demands on standards. In particular, it requires:<br />

that the modalities in content are identified<br />

that the modalities or adaptations of them that a user requires in the context are identified<br />

that the glue standards that enable these things to work exist<br />

We perceive the world partly by using our senses. Modalities are the aspects or components of media or system<br />

interfaces that correspond with those senses and enable us to perceive them. For example a video usually has a visual<br />

aspect or component and this corresponds with the sense of sight. Without at least some match between the modalities<br />

available in the media and the senses a user has available at the time or access modes a user can use there can be no<br />

perception or use of the media. Therefore it is very useful if the modalities available in media or provided in an interface<br />

can be described. Doing so permits matching to the modalities a user has available or the authoring of adaptations to<br />

enable that matching.<br />

There is not a simple precise match between modalities and senses because some senses are used in complex<br />

interactive ways – for example the sense of sight and reading – but there is a general broad matching.<br />

Traditionally, ways to describe media on computers has developed in ways suited to the needs of computer design and<br />

not so well suited to use by people. An example would be the Multipurpose Internet Mail Extensions standard for<br />

describing media formats (MIME-types) with email but widely used for other purposes. This does not describe modalities.<br />

Only now are appropriate standards supporting modality description emerging.<br />

ISO/IEC JTC 1/SC 36 Committee for Information Technology for Learning <strong>Education</strong> and Training has<br />

produced a standard for description of access modalities and adaptations for those. This is the Individualized<br />

68


CWA <strong>15778</strong>:2008 (E)<br />

Adaptability and Accessibility in E-learning, <strong>Education</strong> and Training. 24751. This was initially developed in IMS<br />

and then internationalised in ISO. It is planned to be publicly available at the end 2007.<br />

The standard provides for description and matching of resources to personal contexts/requirements. It has<br />

three parts – a Framework that shows how to use the other parts, A standard for description of Digital<br />

Resources (Digital Resource Description DRD) and a standard for description of functional learner<br />

requirements Personal Needs and Preferences (PNP) that enables matching of resources and adaptations to<br />

user requirements and specific contexts. Currently the vocabulary for modalities within the standard can be<br />

used to describe the modalities of visual, textual, auditory, tactile and olfactory. Vocabularies for description of<br />

adaptations for these (for example audio description) are more extensive as these extend to representation<br />

forms used on computers and with assistive technology. Several new parts to the standard are under<br />

construction including parts for description of requirements and components for offline media and services<br />

and and for places and events.<br />

8.4.1 Further Research on personalisation of content<br />

In order usefully to use the modalities within media to match to or adapt to user requirements a few glue standards are<br />

needed. This is an area of active development in the standards.<br />

To make it all work there is needed the development of best practices with particular media. This CWA presents some of<br />

those best practices. Even more progress towards providing truly personalisable media could be obtained with the<br />

development of more formally-described practices. Ontologies describing the use of different media components across<br />

different media types would be very useful here. For example, most media types contain alternatives that could be<br />

described and matched to a context as described above, but the media types all do it different ways. What is needed is<br />

some way to operate across media types with the same principles.<br />

IEEE Learning Technology Standards Committee is developing a standard currently called Resource Aggregation Model<br />

for Learning, <strong>Education</strong> and Training. This work is constructing a standard ontology that tackles this cross-media issue,<br />

though the precise requirements for adaptation for accessibility have not yet been addressed in that work. The work can<br />

be found on http://www.ieeeltsc.org/working-groups/wg11CMI/ramlet/<br />

Alternatives within media types are further described elsewhere in this CWA.<br />

8.5 Li<strong>cen</strong>sing and technical protection measures<br />

It is evident that neither at international level nor at European level is there any requirement to provide exceptions to<br />

copyright protection facilitating access to protected materials by visually impaired people. Further, the measures on anticircumvention<br />

of technical protection measures introduced by the WIPO Copyright Treaty (WCT) and the WIPO<br />

Performances and Phonograms Treaty (WPPT) in 1996 are not matched by any provision to accommodate exceptions to<br />

copyright protection. The European Union Copyright Directive does address this issue in article 6.4.1 but gives no<br />

indication how the conflict is to be resolved nor does it require resolution of the conflict for content made available via<br />

interactive digital services.<br />

The legislative provisions found at national level are diverse. The reasons for this state of affairs include the following:<br />

The variety of digital file formats used by publishers<br />

The complexities of format and structure conversion and the provision of the corresponding resource<br />

requirement<br />

The concerns of publishers regarding the release to third parties of digital text files<br />

Concerns by publishers that they may be impeded from collaborating by not having the requisite<br />

rights to authorise conversion into particular formats e.g. audio books<br />

Security is a vital issue for publishers and technical protection measures are a complex issue for most content holders.<br />

Every publisher’s content, client base and requirements are different, which often results in a personalised set of<br />

requirements for each case. As a result, approaches to li<strong>cen</strong>sing and <strong>agreement</strong>s on accessible formats are often<br />

negotiated on a case-by-case basis. Naturally, publishers have to be confident that any digital format is being delivered<br />

through secure gateways to only the people who are intended to receive it. For these reasons, there is a perception that<br />

the provision of digital files in alternative formats may compromise technical protection measures. Combined with a<br />

widespread belief that the provision of accessible format materials is expensive and time-consuming, only limited<br />

progress has thus far been made.<br />

69


CWA <strong>15778</strong>:2008 (E)<br />

Much of the discussion around DRM and Accessibility has necessarily focused on the right of access versus the need to<br />

protect content. However, points of common interest exist and the development of trusted intermediary concepts can<br />

offer real-world solutions.<br />

From a technical perspective, earlier problems relating to the digitisation of materials have been largely overcome and<br />

re<strong>cen</strong>t formats (such as XML, RDF, METS, MARC21 etc) provide a realistic basis for implementing the different aspects<br />

of this work. It is now possible to address the key concerns of content creators and providers and coherently to address<br />

issues such as: automation of document structuring, adherence to emerging standards, workflow support, digital rights<br />

management and secure distribution platforms.<br />

As the lifetime of a book gets shorter and shorter, publishers frequently have to offer access to digital versions of that<br />

book and taking this into account when constructing the layout brings us much closer to real accessibility in the wider<br />

sense. Indeed, it has been the accessibility community that has in many ways pioneered new structures for digital<br />

content, as these developments are often borne of need.<br />

Trusted intermediaries establish a personalised relationship between content holders and specialist organisations<br />

whereby publishers and agencies serving blind and partially sighted people work together in a secure and trusting<br />

environment to increase the quantity and timeliness of titles available in an accessible format. Within trusted intermediary<br />

frameworks, DRM is an enabler of controlled access. A number of different security methods are being developed or are<br />

already in use for making content available in this way.<br />

As far as security is concerned, the higher the level the more likely publishers are to allow content to be made available<br />

in accessible digital formats. At present, the security systems used are simple, they use basic encryption technologies<br />

with key exchange mechanisms. The potential for the release of content is considerable – although there are few<br />

recorded instances of such occurring. Once decrypted, content is available to anyone, authorised or not. The ability to<br />

attach content to particular devices, or better to provide access only to authorised users, requires a level of DRM<br />

sophistication that is not yet generally in place in services catering to the needs of visually impaired people.<br />

8.5.1 Further research on li<strong>cen</strong>sing and technical protection measures<br />

There is a need to examine and describe existing practice in this area, with particular focus on the implementation of<br />

trusted intermediary environments.<br />

Further research is required to examine accessibility in the wider sense and to examine the requirements for modelling<br />

accessibility and DRM within emerging multimedia environments.<br />

9 Conclusion and future work<br />

Accessible content started as a niche market mainly targeting people with visual impairments and dyslexic users.<br />

Publishers have re<strong>cen</strong>tly realised the potential market and are offering alternative book versions (audio, large print, etc.)<br />

of newly published editions together with the printed ones. Accessible content can be used in different situations and<br />

cover part of the changing requirements of the user population in the framework of ambient intelligence for content<br />

anytime, anywhere, and with any service. Usage examples include, users on the move, multitasking environments that<br />

require hands-free, eyes-free access and browsing, learning and instructional s<strong>cen</strong>arios, and many others.<br />

Despite this, the full potential of accessible content has not been released yet, due to a mismatch between existing<br />

standards regarding accessibility and their implementation in publishing processes. This CWA aims to reduce that gap<br />

and realise the full market potential of accessible content. Therefore, several real-life publishing s<strong>cen</strong>arios have been<br />

analysed and related actors, formats, conversions and standards presented.<br />

In this elaboration the following conclusions can be drawn:<br />

(a) There are many common tasks that need to be covered when introducing accessibility into the publishing workflow.<br />

Those tasks deal with the definition of document style sheets that are agreed and disseminated to the actors, manual<br />

correction of conversion input/output (in the case where content creators did not follow style sheet requirements),<br />

extension of descriptions regarding visual data (images, videos) and other multimedia material and lead to common<br />

requirements.<br />

(b) Specific format conversions need to take place in order to accommodate user needs. In these format conversions, the<br />

original structure, annotation and metadata and intellectual property rights of the parent documents need to be inherited<br />

to the child documents and not be lost. One of the strategies that seem to be most viable is to use a media rich (including<br />

70


CWA <strong>15778</strong>:2008 (E)<br />

annotations, and metadata) structured format as reference and then create the desired output operating direct on that<br />

format. An additional advantage of this strategy is that this format can be ideally also used for backups and to be stored<br />

in the publishing archive.<br />

c) Existing research, standards and technologies can be used in order to transform current publishing workflows into<br />

accessible content processing workflows. In order to be successful in this transformation, the barriers publishers face for<br />

deploying accessibility into their workflow must fall and the in<strong>cen</strong>tives for producing accessible content increased.<br />

Barriers will fall when automated format conversions and clear stylesheets and guidelines are followed. Although there is<br />

still a lot of research to be done in order to robust and reliable produce automatically accessible documents from any<br />

source, there are nowadays means to economically implement accessible content processing workflows. In<strong>cen</strong>tives for<br />

producing accessible content may come from opening new markets for publishers e.g. personalised information delivery,<br />

electronic distribution, efficient storage, preservation, search and retrieval of publishing titles, and therefore a greater<br />

economic return is expected.<br />

The analysis of accessible information provision showed also that there are some specific steps that need to<br />

be done to complement and extent the accessible information publishing process. Those are:<br />

<br />

<br />

<br />

<br />

For software developers and ICT researchers, to develop frameworks and tools to support actors in<br />

content production to fulfil the accessibility requirements, including automated conversion of both<br />

single and multi-type composite document formats into accessible documents, personalised<br />

presentation of content and adaptive content interfaces, li<strong>cen</strong>sing a technical protection measures.<br />

The focus should be to solutions that address specific sector requirements (eGovernment, eLearning,<br />

Medical Documents, Scientific documents, etc.)<br />

For accessibility researchers to look more clearly and define requirements of users with special<br />

needs in the information provision, and based on that, propose and/or adapt standards for the<br />

publishing and content management industry.<br />

For the content management industry to bring accessible information provision on its agenda and<br />

start proposing standards in this area, supporting open framework and standards development.<br />

For publishers and publishing associations to work on elaborating tools used by authors in the<br />

content authoring process, and systems used in content production and delivery with the aim to<br />

provide detailed input for on-going accessible format and accessibility software and standards<br />

development and be part of future development on process and content modelling efforts for<br />

accessible information provision<br />

It is clear from the above elaboration that accessible information provision requires interdisciplinary efforts in<br />

order to be realised. We look forward to this realisation process and hope to attract the best solutions from<br />

each area and to see barriers to access information reducing.<br />

71


CWA <strong>15778</strong>:2008 (E)<br />

Appendix A – Relevant standards<br />

Nr. Title Subject (if available) URL<br />

BP – PRD<br />

BP – PRO<br />

MPEG 21<br />

Open eBook Publication Structure<br />

specification<br />

ST – PRD<br />

The International Standard Text Code<br />

(ISTC)<br />

This document, derived from the guidelines for TEI Lite, provides an introduction<br />

to the recommendations of the Text Encoding Initiative (TEI), by describing a<br />

subset to, and extension of, the full TEI encoding scheme developed for marking<br />

up OUCS web pages and course documentation.<br />

I have prepared a set of XSLT specifications to transform TEI XML documents to<br />

HTML, and to XSL Formatting Objects. I have con<strong>cen</strong>trated on TEI Lite, but<br />

adding support for other modules is fairly easy, and I am working my way<br />

through the TEI as applications come along. In the main, the setup has been<br />

used on `new' documents, i.e. reports and web pages that I have authored from<br />

scratch, rather than traditional TEI-encoded existing material.<br />

MPEG-21 is developed within the International Standard Organisation (ISO) and<br />

aims at defining a normative open framework for multimedia delivery and<br />

consumption for use by all the players in the delivery and consumption chain<br />

The Text Encoding Initiative (TEI) Guidelines are an international and<br />

interdisciplinary standard that facilitates libraries, museums, publishers, and<br />

individual scholars represent a variety of literary and linguistic texts for online<br />

research, teaching, and preservation.<br />

The International Standard Text Code (ISTC) is developed by ISO Project 21047<br />

and aims at a unique, international identification of individual textual works.<br />

http://www.chiariglione.org/mpeg/standards/<br />

mpeg-21/mpeg-21.htm<br />

http://www.idpf.org/specs.htm<br />

http://www.tei-c.org/<br />

http://www.collectionscanada.ca/iso/tc46sc9<br />

/wg3.htm<br />

AEN/CTN 139<br />

AEN/CTN 153<br />

Authoring Tools<br />

Working Group<br />

(AUWG)<br />

Computer applications for people with<br />

disabilities. Computer accessibility<br />

requirements. Software<br />

Audio description for visually impaired<br />

people. Guidelines for audio description<br />

procedures and for the preparation of<br />

audio guides<br />

Authoring Tool Accessibility Guidelines<br />

1.0 (ATAG 1.0)<br />

http://www.aenor.es/desarrollo/inicio/home/<br />

home.asp<br />

http://www.aenor.es/desarrollo/inicio/home/<br />

home.asp<br />

http://www.oucs.ox.ac.uk/oucsweb/teioucs.xml<br />

http://xml.web.cern.ch/XML/www.teic.org/Stylesheets/teixsl.html<br />

http://www.w3.org/TR/2000/REC-ATAG10-<br />

20000203/<br />

72


CWA <strong>15778</strong>:2008 (E)<br />

Nr. Title Subject (if available) URL<br />

Authoring Tools<br />

Working Group<br />

(AUWG)<br />

BS 7000-6:2005<br />

Authoring Tool Accessibility Guidelines<br />

2.0 (ATAG 2.0)<br />

Design management systems. Managing<br />

inclusive design. Guide<br />

http://www.w3.org/TR/ATAG20/<br />

http://www.bsiglobal.com/Quality_management/Design/bs<br />

7000-6.xalter<br />

CAN/CSA-B659-01 Design for Aging http://www.csaintl.org/onlinestore/GetCatalogItemDetails.a<br />

sp?mat=000000000002012683<br />

Draft ISO/IEC 24751<br />

Individualised Adaptability and<br />

Accessibility in E-learning, <strong>Education</strong><br />

and Training<br />

The scope of this multi- part standard is to provide a common framework to<br />

facilitate matching of learner accessibility needs and preferences with<br />

appropriate learning resources and user interfaces.<br />

Part 1: Framework<br />

Part 2: Access For All Personal Needs and Preferences Statement<br />

Part 3: Access For All Digital Resource Description<br />

Part 1: http://jtc1sc36.org/doc/36N1139.pdf<br />

(temporary URL)<br />

Part 2: http://jtc1sc36.org/doc/36N1140.pdf<br />

(temporary URL)<br />

Part 3: http://jtc1sc36.org/doc/36N1141.pdf<br />

(temporary URL)<br />

Draft ISO/IEC 26513 Software and systems engineering –<br />

User documentation requirements for<br />

documentation evaluators and testers.<br />

Draft ISO/IEC 26514 Software and systems engineering –<br />

User documentation requirements for<br />

documentation designers and<br />

developers.<br />

http://www.jtc1-sc7.org/<br />

To be developed by ISO/IEC JTC 1/SC 7/WG 2 – Software and systems<br />

documentation.<br />

Should contain both requirements and recommendations on all aspects of<br />

documentation evaluation and testing.<br />

Being developed by ISO/IEC JTC 1/SC 7/WG 2 – Software and systems<br />

documentation. Contains both requirements and recommendations on all<br />

aspects of documentation including planning, design, production and<br />

maintenance. Several clauses provide guidance on accessible documentation,<br />

notably 12.5<br />

Estimated ISO publication in 2010.<br />

Estimated ISO publication in 2008.<br />

EN 1332-4<br />

Identification Card Systems - Manmachine<br />

interface - Part 4: Coding of<br />

user requirements for people with special<br />

needs<br />

Machine readable cards facilitate the provision of a growing variety of services<br />

across Europe. The purpose of EN 1332 is to increase the accessibility of these<br />

services for the benefit of consumers. This will be achieved by facilitating the<br />

inter-sector and cross border interpretability of machine-readable cards and to<br />

do so with the maximum possible degree of user-friendliness. EN 1332<br />

addresses the needs of all users, including people with special needs, for<br />

example the aged, minors, the disabled, the visually impaired, those with<br />

learning difficulties, first time users, those not conversant with the local<br />

language.<br />

http://www.bsi-<br />

global.com/en/Shop/Publication-<br />

Detail/?pid=000000000030009505<br />

73


CWA <strong>15778</strong>:2008 (E)<br />

Nr. Title Subject (if available) URL<br />

ETSI EG 202 416:<br />

User interfaces: Set up procedures for<br />

mobile terminals and services<br />

Being developed by ETSI HF STF 285 – Guidelines for set up procedures for<br />

mobile terminals and services.<br />

The ETSI Guidelines (EG) will include documentation and include<br />

recommendations for disabled and elderly users.<br />

Available from ETSI web site free-ofcharge.<br />

http://www.etsi.org/services_products/freest<br />

andard/home.htm<br />

ETSI EG 202 417<br />

User education guidelines for mobile<br />

terminals and e-services<br />

Available from ETSI web site free-ofcharge.<br />

http://www.etsi.org/services_products/freest<br />

andard/home.htm<br />

IEEE standard<br />

RAMlet – Reference model for resource<br />

aggregation.<br />

http://ieeeltsc.org/wg11CMI/ramlet/Pub/<br />

also:<br />

http://www.ieeeltsc.org/workinggroups/wg11CMI/ramlet/Pub/RAMLET_proj<br />

ect_description.pdf/view<br />

IMS standard<br />

IMS AccessForAll Meta-data<br />

Specification 1.0<br />

http://www.imsglobal.org/accessibility<br />

IMS standard<br />

IMS Learner Information Package<br />

Accessibility for LIP 1.0<br />

http://www.imsglobal.org/accessibility<br />

ISO 10075:1991<br />

Ergonomic principles related to mental<br />

work-load -- General terms and<br />

definitions<br />

http://www.iso.org/iso/en/CatalogueDetailPa<br />

ge.CatalogueDetail?CSNUMBER=18045&I<br />

CS1=13&ICS2=180&ICS3=<br />

ISO 14915-1:2002<br />

Software ergonomics for multimedia user<br />

interfaces -- Part 1: Design principles<br />

and framework<br />

http://www.iso.org/iso/en/CatalogueDetailPa<br />

ge.CatalogueDetail?CSNUMBER=25578&s<br />

copelist=ALL<br />

ISO 14915-2:2003<br />

Software ergonomics for multimedia user<br />

interfaces -- Part 2: Multimedia<br />

navigation and control<br />

http://www.iso.org/iso/en/CatalogueDetailPa<br />

ge.CatalogueDetail?CSNUMBER=28583&s<br />

copelist=ALL<br />

ISO 20282-1:2006 Ease of operation of everyday products -<br />

- Part 1: Context of use and user<br />

characteristics<br />

ISO 20282-1:2006 provides requirements and recommendations for the design<br />

of easy-to-operate everyday products, where ease of operation addresses a<br />

subset of the concept of usability concerned with the user interface by taking<br />

account of the relevant user characteristics and the context of use.<br />

http://www.iso.org/iso/en/CatalogueDetailPa<br />

ge.CatalogueDetail?CSNUMBER=34122&s<br />

copelist=PROGRAMME<br />

74


CWA <strong>15778</strong>:2008 (E)<br />

Nr. Title Subject (if available) URL<br />

ISO 2108:2005 Information and documentation --<br />

International standard book number<br />

(ISBN)<br />

ISO TS 20282-2:2006 Ease of operation of everyday products -<br />

- Part 2: Test method<br />

The purpose of ISO 2108:2005 is to establish the specifications for the<br />

International Standard Book Number (ISBN) as a unique international<br />

identification system for each product form or edition of a monographic<br />

publication published or produced by a specific publisher. It specifies the<br />

construction of an ISBN, the rules for its assignment and use, the metadata to be<br />

associated with the ISBN allocation, and the administration of the ISBN system.<br />

ISO 20282-2:2006 specifies a test method for measuring the ease of operation<br />

of "walk-up-and-use" products. The purpose of the test is to provide a basis for<br />

predicting the ease of operation of a walk-up-and-use product, including<br />

measures of its effectiveness and efficiency of operation, and the satisfaction of<br />

the intended user population in its expected context of use.<br />

http://www.isbninternational.org/en/index.html<br />

http://www.iso.org/iso/en/CatalogueDetailPa<br />

ge.CatalogueDetail?CSNUMBER=36452&s<br />

copelist=PROGRAMME<br />

ISO/AWI TR 22411<br />

ISO/CD 9241-20<br />

ISO/DIS 9241-151<br />

ISO/DIS 9241-171<br />

ISO/DIS 9241-300<br />

ISO/IEC 11581<br />

Ergonomic data and guidelines for the<br />

application of ISO/IEC Guide 71 in<br />

standards related to products and<br />

services to address the needs of older<br />

persons and persons with disabilities<br />

Ergonomics of human system interaction<br />

-- Accessibility guideline for information<br />

communication equipment and services -<br />

- Part 20: General guidelines<br />

Ergonomics of human-system interaction<br />

-- Part 151: Guidance on World Wide<br />

Web user interfaces<br />

Ergonomics of human-system interaction<br />

-- Part 171: Guidance on software<br />

accessibility<br />

Ergonomics of human-system interaction<br />

-- Part 300: Introduction to requirements<br />

and measurement techniques for<br />

electronic visual displays<br />

Information technology -- User system<br />

interfaces and symbols-part 1-6<br />

http://www.iso.org/iso/en/CatalogueDetailPa<br />

ge.CatalogueDetail?CSNUMBER=40933&s<br />

copelist=PROGRAMME<br />

http://www.iso.org/iso/en/CatalogueDetailPa<br />

ge.CatalogueDetail?CSNUMBER=40727&s<br />

copelist=PROGRAMME<br />

http://www.iso.org/iso/en/CatalogueDetailPa<br />

ge.CatalogueDetail?CSNUMBER=37031&s<br />

copelist=ALL<br />

http://www.iso.org/iso/en/CatalogueDetailPa<br />

ge.CatalogueDetail?CSNUMBER=39080&s<br />

copelist=ALL<br />

http://www.iso.org/iso/en/CatalogueDetailPa<br />

ge.CatalogueDetail?CSNUMBER=40096&s<br />

copelist=PROGRAMME<br />

http://www.iso.org/iso/en/CatalogueListPag<br />

e.CatalogueList?COMMID=4768&scopelist<br />

=ALL<br />

75


CWA <strong>15778</strong>:2008 (E)<br />

Nr. Title Subject (if available) URL<br />

ISO/IEC 18019:2004<br />

Guidelines for the design and<br />

preparation of user documentation for<br />

application software<br />

Provides guidelines for the design and preparation of user documentation for<br />

application software. It describes how to establish what information users need,<br />

how to determine the way in which that information should be presented to the<br />

users, and how then to prepare the information and make it available. Contains<br />

recommendations on implementing accessibility for documentation (clause<br />

4.2.6).<br />

ISO/IEC 18019:2004 Software and system engineering --<br />

Guidelines for the design and<br />

preparation of user documentation for<br />

application software<br />

http://www.iso.org/iso/en/CatalogueDetailPa<br />

ge.CatalogueDetail?CSNUMBER=30804&I<br />

CS1=35&ICS2=80&ICS3=<br />

ISO/IEC 26300:2006<br />

OASIS standard<br />

Information technology -- Open<br />

Document Format for Office Applications<br />

(OpenDocument) v1.0<br />

The OpenDocument specification defines an XML schema for office applications<br />

and its semantics. The schema is suitable for office documents, including text<br />

documents, spreadsheets, charts and graphical documents like drawings or<br />

presentations, but is not restricted to these kinds of documents.<br />

The schema provides for high-level information suitable for editing documents. It<br />

defines suitable XML structures for office documents and is friendly to<br />

transformations using XSLT or similar XML-based tools<br />

http://www.oasisopen.org/committees/tc_home.php?wg_abb<br />

rev=office,<br />

http://www.iso.org/iso/en/CatalogueDetailPa<br />

ge.CatalogueDetail?CSNUMBER=43485&s<br />

copelist=ALL<br />

ISO/IEC CD 24756<br />

Information technology -- Algorithmic<br />

framework for determining accessibility<br />

for individual users of interactive systems<br />

http://www.iso.org/iso/en/CatalogueListPag<br />

e.CatalogueList?COMMID=4768&scopelist<br />

=ALL<br />

ISO/IEC CD TR<br />

19766<br />

Information Technology -- Guidelines for<br />

the design of icons and symbols<br />

accessible to all users, including the<br />

elderly and persons with disabilities<br />

http://www.iso.org/iso/en/CatalogueListPag<br />

e.CatalogueList?COMMID=4768&scopelist<br />

=ALL<br />

ISO/IEC DTR 19765<br />

Information technology -- Survey of<br />

existing icons and symbols for elderly<br />

and disabled persons<br />

http://www.iso.org/iso/en/CatalogueListPag<br />

e.CatalogueList?COMMID=4768&scopelist<br />

=ALL<br />

ISO/IEC NP 24786-1<br />

Information Technology - User Interfaces<br />

- Accessible User Interface for<br />

Accessibility Setting on Information<br />

Devices -- Part 1: General and methods<br />

to start<br />

http://www.iso.org/iso/en/CatalogueDetailPa<br />

ge.CatalogueDetail?CSNUMBER=41556&s<br />

copelist=ALL<br />

ISO/IEC TR 19764<br />

Information technology -- Guidelines,<br />

methodology and reference criteria for<br />

cultural and linguistic adaptability in<br />

information technology products<br />

http://www.iso.org/iso/en/CatalogueListPag<br />

e.CatalogueList?COMMID=4768&scopelist<br />

=ALL<br />

76


CWA <strong>15778</strong>:2008 (E)<br />

Nr. Title Subject (if available) URL<br />

ISO/TR 16982:2002<br />

Ergonomics of human-system interaction<br />

-- Usability methods supporting human<strong>cen</strong>tred<br />

design<br />

http://www.iso.ch/iso/en/CatalogueDetailPa<br />

ge.CatalogueDetail?CSNUMBER=31176&s<br />

copelist=<br />

ISO/TS 16071:2003<br />

Ergonomics of human-system interaction<br />

-- Guidance on accessibility for humancomputer<br />

interfaces<br />

http://www.iso.org/iso/en/CatalogueDetailPa<br />

ge.CatalogueDetail?CSNUMBER=30858&s<br />

copelist=ALL<br />

Italian Government<br />

Law n. 4, January 9, 2004 - Provisions to<br />

support the access to information<br />

technologies for the disabled (also<br />

known as "The Stanca Act").<br />

http://www.pubbliaccesso.gov.it/english/ind<br />

ex.htm<br />

JIS X 8341-1<br />

Japanese Industrial Standards<br />

Committee.<br />

Guidelines for older persons and persons with disabilities -- Information and<br />

communications equipment, software and services -- Part 1: Common<br />

Guidelines<br />

http://www.webstore.jsa.or.jp/webstore/Com<br />

/FlowControl.jsp?lang=en&bunsyoId=JIS+X<br />

+8341-<br />

1%3A2004&dantaiCd=JIS&status=1&page<br />

No=0<br />

JISS0032<br />

Japanese Industrial Standards<br />

Committee<br />

Guidelines for the elderly and people with disabilities - Visual signs and displays<br />

- Estimation of minimum legible size for a Japanese single character<br />

http://www.webstore.jsa.or.jp/webstore/Com<br />

/FlowControl.jsp?lang=en&bunsyoId=JIS+S<br />

+0032%3A2003&dantaiCd=JIS&status=1&<br />

pageNo=0<br />

JISS0033<br />

Japanese Industrial Standards<br />

Committee<br />

Guidelines for the elderly and people with dasabilities - Visual signs and displays<br />

- A method for color combination based on categories of fundamental colors as a<br />

functionof age<br />

http://www.webstore.jsa.or.jp/webstore/Com<br />

/FlowControl.jsp?lang=en&bunsyoId=JIS+S<br />

+0033%3A2006&dantaiCd=JIS&status=1&<br />

pageNo=0<br />

JISZ8071<br />

Japanese Industrial Standards<br />

Committee<br />

Guilines for standards developers to address the needs of older persons and<br />

persons with disabilities<br />

http://www.webstore.jsa.or.jp/webstore/Com<br />

/FlowControl.jsp?lang=en&bunsyoId=JIS+Z<br />

+8071%3A2003&dantaiCd=JIS&status=1&<br />

pageNo=0<br />

Nordic Cooperation on<br />

Disability<br />

Nordic Guidelines for Computer<br />

Accessibility<br />

http://trace.wisc.edu/docs/nordic_guidelines<br />

/nordic_guidelines.htm<br />

OASIS standard OASIS DITA Language Specification The Darwin Information Typing Architecture (DITA) specification defines both a)<br />

a set of document types for authoring and organising topic-oriented information;<br />

and b) a set of mechanisms for combining and extending document types using<br />

a process called specialisation.<br />

http://www.oasisopen.org/committees/download.php/12091/<br />

cd2.zip<br />

77


CWA <strong>15778</strong>:2008 (E)<br />

Nr. Title Subject (if available) URL<br />

Standard from:<br />

EDItEUR jointly with<br />

Association of<br />

American Publishers,<br />

Book Industry<br />

Communication and<br />

the Book Industry<br />

Study Group.<br />

ONIX International<br />

ONIX International is the international standard for representing and<br />

communicating book industry product information in electronic form,<br />

incorporating the core content which has been specified in national initiatives<br />

such as BIC Basic and AAP’s ONIX Version 1'<br />

http://www.editeur.org/onix.html<br />

Swedish Government,<br />

Office of the Disability<br />

Ombudsman<br />

Guidelines for an accessible public<br />

administration<br />

http://www.tillganglig.se/start.asp?lang=en&<br />

sida=1450<br />

US section 508 US Section 508 , on the requirements for accessibility for public procurement.<br />

This act requires all federal agencies' electronic and information technology is<br />

accessible to people with disabilities.<br />

http://www.section508.gov/<br />

User Agent<br />

Accessibility<br />

Guidelines Working<br />

Group (UAWG)<br />

User Agent Accessibility Guidelines<br />

(UAAG) 1.0<br />

http://www.w3.org/TR/2002/REC-UAAG10-<br />

20021217/<br />

W3C<br />

Recommendation<br />

EARL (W3) , the Evaluation and Report<br />

Language.<br />

The Evaluation and Report Language is a standardized vocabulary to express<br />

test results.<br />

http://www.w3.org/TR/EARL10-Schema/<br />

W3C<br />

Recommendation<br />

Web Content Accessibility Guidelines 1.0<br />

WCAG 1.0 has 14 guidelines that are general principles of accessible design.<br />

Each guideline has one or more checkpoints that explain how the guideline<br />

applies in a specific area.<br />

http://www.w3.org/WAI/intro/wcag.php<br />

W3C<br />

Recommendation<br />

Web Content Accessibility Guidelines 2.0<br />

Following WCAG makes Web content more accessible to the vast majority of<br />

users, including people with disabilities and older users, using many different<br />

devices including a wide variety of assistive technology.<br />

http://www.w3.org/TR/2005/WD-WCAG20-<br />

20051123/<br />

W3C<br />

Recommendation<br />

W3C<br />

Recommendation<br />

XHTML (1.0) W3C (X)HTML Working group http://www.w3.org/TR/xhtml1/,<br />

Working group homepage:<br />

http://www.w3.org/MarkUp/<br />

XML standard W3C XML standards http://www.w3.org/TR/2006/REC-xml-<br />

20060816/<br />

Working group homepage:<br />

http://www.w3.org/XML/<br />

78


CWA <strong>15778</strong>:2008 (E)<br />

Appendix B – Relevant European organisations<br />

Title Subject URL<br />

AAP Open eBook Standards Project and AAP/Andersen<br />

Consulting eBook Study<br />

AXMEDIS (EC Project)<br />

BISG (Book Industry Study Group)<br />

The goal of the AAP Open ebook Standards Project was to recommend standards<br />

and requirements in the areas of Digital Rights Management [ref], Metadata and<br />

Numbering that will enable an open, competitive marketplace for ebook commerce<br />

on a large scale. The intention is to consider all aspects of the burgeoning ebook<br />

marketplace in developing standards and requirements to promote its growth.<br />

AXMEDIS is developing technologies to reduce the costs of digital content<br />

production, distribution and protections. It is an environment where digital content<br />

producers, aggregators and distributors can gain access to a wide range of digital<br />

contents.<br />

A membership-supported, not-for-profit research organisation comprised of<br />

organisations from every sector of the publishing community. Its goal is to provide<br />

accurate and current research information about the industry for its members and<br />

others. BISG produced jointly with the Association of American Publishers and<br />

Book Industry Communication the ONIX International standard.<br />

http://www.publishers.org/digital/index.cfm<br />

http://www.axmedis.org/<br />

http://www.bisg.org/<br />

BrailleNet, France: Digital Document Delivery for the Blind in France http://www.braillenet.org/<br />

DAISY Consortium<br />

DBK, De Braillekrant, Belgium<br />

The DAISY Consortium's mission is to develop the International Standard and<br />

implementation strategies for the production, exchange, and use of Digital Talking<br />

Books in both developed and developing countries, with special attention to<br />

integration with mainstream technology, to ensure access to information for people<br />

with print disabilities.<br />

Private foundation supported by the Katholieke Universiteit Leuven and Sensotec<br />

company, producing daily newspapers in Braille since 1993, in Daisy format since<br />

2003 and in audio format starting April 2007.<br />

http://www.daisy.org/<br />

http://www.braillekrant.be/<br />

DEDICON Netherlands: National Federation of Libraries for the Blind http://www.dedicon.nl/<br />

Digital Media Project<br />

On the policy and legal side, new policies should be determined and legacy policies<br />

should be revised<br />

On the technical side, a DRM platform should be designed offering the following<br />

main features<br />

http://www.chiariglione.org/project/<br />

Dolphin Audio Publishing, United Kingdom: Multimedia solutions for the adaptive technology industries http://www.yourdolphin.com/<br />

79


CWA <strong>15778</strong>:2008 (E)<br />

ENABLES Enhanced Network Accessibility for the Blind<br />

and Visually Impaired<br />

ETSI Task Force STF 286<br />

The specific objectives in this area include:<br />

Developing techniques that will convert existing non-accessible Web contents into<br />

accessible forms according to the user's need.<br />

Investigate multimodal representation for different contents, different applications<br />

and different user disabilities.<br />

Producing guidelines for creating multimodal representation and toolkits for<br />

developers to create accessible Web contents in general and images in particular.<br />

ETSI Human Factors designing access symbols to indicate special services for<br />

disabled users of ICT equipment<br />

http://www.enabledweb.org/AWC.htm<br />

http://www.etsi.org/pressroom/Previous/200<br />

5/2005_05_stf286.htm<br />

EUAIN (EC Project) The EUAIN project (European Accessible Information Network) aims to promote e-<br />

Inclusion as a core horizontal building block in the establishment of the Information<br />

Society by creating a European Accessible Information Network to bring together<br />

the different actors in the content creation and publishing industries around a<br />

common set of objectives relating to the provision of accessible information.<br />

http://www.euain.org/modules/wfsection/<br />

FEP, Belgium: Federation of European Publishers, Brussels. http://www.fep-fee.be/<br />

FORCE, Netherlands:<br />

Independent foundation for education and support for the print impaired in<br />

developing countries<br />

http://www.force-foundation.org.uk/<br />

Institute Integriert Studieren, Austria Austria-wide Institute for Information systems Supporting Print Disabled Students http://www.integriert-studieren.jku.at/<br />

International Digital Publishing Forum (IDPF),<br />

ISO/IEC JTC 1 Special Working Group on Accessibility<br />

standards<br />

National Council For the Blind of Ireland Media<br />

Conversion Service (NCB IMCS)<br />

The International Digital Publishing Forum (IDPF), formerly the Open eBook Forum<br />

(OeBF), is the trade and standards association for the digital publishing industry.<br />

Established by JTC 1 (10/2004) to:<br />

- establish user requirements for accessibility standards<br />

- prepare an inventory of existing accessibility standards & legislation<br />

- Identify gaps (and overlaps) in accessibility standardisation<br />

- Work with standards bodies (ISO, IEC, ITU, CEN, ETSI, etc) to prepare new<br />

standards<br />

The MCS converts information and documents into formats accessible to<br />

people with vision impairments for a range of public and private<br />

clients. The MCS provides 3 specific services:<br />

1. Conversion into Braille and audio formats<br />

2. Audio-description<br />

3. MCS Consultancy and QA Services<br />

4. Braille Training for the Pharmaceutical Sector<br />

http://www.idpf.org/<br />

http://www.jtc1access.org/<br />

www.ncbi.ie<br />

80


CWA <strong>15778</strong>:2008 (E)<br />

OASIS DITA Technical Committee<br />

The purpose of the OASIS DITA Technical Committee (TC) is to define and<br />

maintain the Darwin Information Typing Architecture (DITA) and to promote the use<br />

of the architecture for creating standard information types and domain-specific<br />

markup vocabularies.<br />

http://www.oasisopen.org/committees/tc_home.php?wg_abb<br />

rev=dita<br />

OASIS ODF Technical Committee http://www.oasisopen.org/committees/workgroup.php?wg_a<br />

bbrev=odf-adoption<br />

ONCE, Spain: Spanish National Organisation for the Blind http://www.once.es/<br />

Royal National Institute of the Blind (UK)<br />

New RNIB web site addressing software accessibility<br />

Royal National Institute of the Blind<br />

RNIB<br />

Society for Technical Communication (US)<br />

UK’s leading charity offering information, support and advice to over two million<br />

people with sight problems. The Web Access Centre provides web designers and<br />

managers with the tools and resources needed to plan, build and test accessible<br />

websites. RNIB also offers paid for web accessibility consultancy services including<br />

website audits, advice, presentations and seminars. The directory of accessible<br />

websites lists the sites that have passed the RNIB audit within the past year.<br />

The STC is a US-based international organisation and has a Special Interest<br />

Groups addressing the accessibility of documentation (A-SIG).<br />

http://www.rnib.org.uk/xpedio/groups/public/<br />

documents/publicwebsite/public_sachome.h<br />

csp<br />

www.rnib.org.uk<br />

http://www.stcsig.org/sn/index.shtml<br />

SUTII, Poland: University Technical Research Department<br />

TechDis (a JISC Advisory Service)<br />

Technologie-Zentrum Informatik, University Bremen,<br />

Germany:<br />

Web Accessibility Business Case Documents<br />

Web Standards Project<br />

The mission of the JISC TechDis Service is to support the education sector in<br />

achieving greater accessibility and inclusion by stimulating innovation and providing<br />

expert advice and guidance on disability and technology. The TechDis website<br />

features many aids for accessible document processing, including the Accessibility<br />

Essentials guides to inclusive use of Microsoft (R) Word and Adobe (R) PDFs.<br />

Institute at the University Bremen with the main aim to develop cutting edge<br />

technologies in computer science and engineering and transfer those into practice.<br />

Special focus on multimedia content accessibility and presentation<br />

The Web Accessibility Initiative (WAI) <strong>Education</strong> and Outreach Working Group<br />

(EOWG). "Developing a Web Accessibility Business Case for Your Organisation."<br />

describes social, technical, financial, legal and policy aspects of Web accessibility.<br />

It is designed to help organisations develop their own customised business case for<br />

Web accessibility. It provides text that can be used as is, as well as guidance on<br />

identifying the most relevant factors for a specific organisation.<br />

The Web Standards Project is a grassroots coalition fighting for standards that<br />

ensure simple, affordable access to web technologies for all.<br />

http://www.techdis.ac.uk<br />

http://www.tzi.de/<br />

http://www.w3.org/WAI/bcase/<br />

http://www.w3.org/WAI/<br />

http://www.webstandards.org/<br />

81


CWA <strong>15778</strong>:2008 (E)<br />

Appendix C – Sustainability: network of interested parties for ongoing support and further development<br />

Name Enterprise Country Function Email address<br />

P. Abrahams Bloor Research United Kingdom Practice leader Accessibility<br />

&Usability<br />

peter.abrahams@bloor-research.com<br />

Mr. A. Arch Online Accessibility Consulting Vision Australia Manager andrew.arch@nils.org.au<br />

Mr. H. Aspelund<br />

Directorate for Health and Social Affairs, The<br />

Delta Centre<br />

Norway Adviser ICT Haakon.Aspelund@shdir.no<br />

haa@shdir.no<br />

Mr. S. Ball JISC TechDis Service United Kingdom Senior Advisor simon@techdis.ac.uk<br />

Mrs. A. Bergman-Tahon Federation of European Publishers Belgium Director abergman@fep-fee.be<br />

Mr. H. Bjarnø Visual Impairment Knowledge Centre Denmark hbj@visinfo.dk<br />

Mrs. T. Bogner Institut Integriert Studieren Austria tanja.bogner@jku.at<br />

L. Bowick Ministere de l’Agriculture et de la Peche France Reseaux et telecommunications lisa.bowick@educagri.fr<br />

Mr. M. Brauer Sun Microsystems Germany Technical Architect Software<br />

Engineering<br />

Michael.brauer@sun.com<br />

Mrs. Lino Brundu Alitha of Milan Italy Chairman Lino.Brundu@libero.it<br />

Mr. Bruno bruno@vonniman.com<br />

Mr. D. Burger Association BrailleNet France Dominique.Burger@snv.jussieu.fr<br />

82


CWA <strong>15778</strong>:2008 (E)<br />

Name Enterprise Country Function Email address<br />

Mrs. J. Clark Jenny.Clark@rnib.org.uk<br />

Mr. M. Cooper Web Accessibility Specialist Web accessibility specialist cooper@w3.org<br />

Mr. D. Crombie DEDICON Netherlands Head International Projects dcrombie@dedicon.nl<br />

Mrs. J. Darzentas University of the Aegean Greece jennyd@aegean.gr<br />

Mr. D. Day OASIS DITA Technical Committee IBM Lead DITA Architect dond@us.ibm.com<br />

Mr. A. Egger University of Applied Sciences Austria alexander.egger@campus02.at<br />

Mr. J. Engelen Katholieke Universiteit Leuven Belgium jan.engelen@esat.kuleuven.be<br />

Mrs. B. Fanning AIIM USA Director BFANNING@AIIM.ORG<br />

Mr. M. Ford Martin Ford Consultancy Italy Consultant martin.ford@libero.it<br />

Mr. T. Fraser Monotype Imaging Ltd United Kingdom Finance Director timothy.fraser@monotypeimaging.co.uk<br />

Mr. dr. J. Friedrich IBM Germany Germany Program Manager ICT<br />

Standardization<br />

jochen@de.ibm.com<br />

Mr. J. Garner INCITS/Information Technology Industry Council USA jgarner@itic.org<br />

Cerys Giddings IBM UK Ltd UK cgiddin@uk.ibm.com<br />

Mrs. K. Grant Kate@ninetiles.com<br />

Mr. C. Gravenhorst claus.gravenhorst@ccs-gmbh.de<br />

83


CWA <strong>15778</strong>:2008 (E)<br />

Name Enterprise Country Function Email address<br />

Mr. A. Haffner Technische Universität Dresden Germany Professur Mensch-Maschine-<br />

Kommunikation<br />

alexander.haffner@inf.tu-dresden.de<br />

Mrs. Mayumi Handa RiverDocs Ireland Test Manager mayumi.handa@riverdocs.com<br />

A.K. Heath Axelrod Access For All United Kingdom andyheath@axelrod.plus.com<br />

Mr. S. Herramhof Sandor.Herramhof@jku.at<br />

Mr. R. Hodgkinson Institute of Scientific and Technical<br />

Communicators (UK)<br />

United Kingdom Consultant Richard_Hodgkinson@btinternet.com<br />

Mr. M. Horstmann TZI, University Bremen Germany mir@tzi.de<br />

Mr. A. Houser arh@groupwellesley.com<br />

Mr. Dr. G. Ioannidis IN2 and TZI - University. Bremen Germany Director gi@in-two.com and<br />

george.ioannidis@tzi.de<br />

Mr. H. Janczikowski hartmut.janczikowski@ccs-gmbh.de<br />

Mr. G. Kerscher kerscher@montana.com<br />

Mr. S. Klironomos FORTH/ ICS Greece Secretariat manager arh@groupwellesley.com<br />

Mr. M. Koettstorfer marco.koettstorfer@jku.at<br />

Ms. MIRA KOIVUSILTA STAKES, R&D Centre for Health and Welfare Finland Mira.Koivusilta@stakes.fi<br />

Mr. P. Korn Sun Microsystems United States Accessibility Architect peter.korn@sun.com<br />

84


CWA <strong>15778</strong>:2008 (E)<br />

Name Enterprise Country Function Email address<br />

Mr. N. Kovacs DIN Committee Information Technology Germany NIKOLAUS.KOVACS@DIN.DE<br />

Mr. F.J. Martinez-Calvo ONCE Spain fmc@once.es<br />

Dr. Thomas Kahlisch Deutsche Zentralbücherei für Blinde zu Leipzig Germany Director thomas.kahlisch@dzb.de<br />

K. Lindelien Standards Norway Norway KLi@standard.no<br />

Mrs. L McNamee Texthelp Systems Northern Ireland Marketing manager louise@texthelp.com<br />

Mr. S. McGrenery CTO Ireland seamus.mcgrenery@bigriverdocs.com<br />

Mr. N. McKenzie DEDICON Netherlands nmackenzie@dedicon.nl<br />

Mr. D. Mann RNIB david.mann@rnib.org.uk<br />

Mrs. M. McRae OASIS-OPEN mary.mcrae@oasis-open.org<br />

Mr. C. Menezes UNESCO France Senior programme specialist c.menezes@unesco.org<br />

Mr. F. Middelkoop DEDICON fmiddelkoop@fnb.nl<br />

Mr. Dr. K. Miesenberger University of Linz Austria klaus.miesenberger@jku.at<br />

Mr. J. O Connor NCBI (National Council For The Blind Of Ireland) Ireland Web Accessibility Consultant joshue.oconnor@ncbi.ie<br />

Mr. H. O’Neill Central Remedial Clinic Ireland Project Coordinator honeill@crc.ie<br />

Mr. R. Orme RNIB Richard.Orme@rnib.org.uk<br />

P. Permezel Cosmosbay-Vectis France Chef de project ppermezel@cosmosbay-vectis.com<br />

85


CWA <strong>15778</strong>:2008 (E)<br />

Name Enterprise Country Function Email address<br />

Ms. C. Pollitt National Library for the Blind United Kingdom carol.pollitt@nlbuk.org<br />

Mrs. F. Preteux francoise.preteux@int-evry.fr<br />

Mr. K. Richter klaus.richter@bam.de<br />

Mr. Roel van Gils roel@anysurfer.be<br />

Mr. Dr. R. Romero Fundacion SIDAR Spain rafael@rafaelromero.com<br />

Mr. R. Ruemer University of Linz Austria reinhard.ruemer@jku.at<br />

Mrs. A. Salaun European Commission Belgium anne.salaun@cec.eu.int<br />

S. Schotel DEDICON Netherlands sschotel@dedicon.nl<br />

Mr. Shadi Abou-Zahra W3C Web Accessibility Initiative France Web Accessibility Specialist for<br />

Europe<br />

shadi@w3.org<br />

Ms. S. Sollat C-LHISTOIRE Digital-Prod France Executive Producer dmz@c-lhistoire.com<br />

Mr. F. van Stek DEDICON fvstek@fnb.nl<br />

Mr. C. Stephan cstephan@wxs.nl<br />

Mr. M. Straat Adobe mstraat@adobe.com<br />

Mr. C. Strobbe K.U.Leuven - Departement of Electrical<br />

Engineering - Research Group on Document<br />

Architectures<br />

Belgium Christophe.Strobbe@esat.kuleuven.be<br />

86


CWA <strong>15778</strong>:2008 (E)<br />

Name Enterprise Country Function Email address<br />

Paivi Tahkokallio STAKES, R&D Centre for Health and Welfare Finland paivi.tahkokallio@stakes.fi,<br />

Mr. D. Taylor Lightning Source UK Ltd United Kingdom Managing Director David.Taylor@lightningsource.com<br />

Mr. Malte Timmermann Sun Microsystems GmbH Germany Technical Architect Software<br />

Engineering<br />

Malte.Timmermann@sun.com<br />

Mr. T. Tontchev toni.tontchev@dzb.de<br />

Mr. D. Tucker FORCE Foundation Netherlands dtucker@f-force.nl<br />

Mr. S. Tyler RNIB steve.tyler@rnib.org.uk<br />

Mr. L. Van den Berghe<br />

CEN/ISSS - Information Society Standardization<br />

System<br />

Belgium Workshop manager luc.vandenberghe@<strong>cen</strong>orm.be<br />

Mr. C. Walinn Danish National Library for the Blind Denmark <strong>cwa</strong>@dbb.dk<br />

Mrs. M. White mandy.white@rnib.org.uk<br />

Mr. R. Winiarsczyk Silesian University of Technology Poland head of research group Ryszard.Winiarczyk@polsl.pl<br />

Mr. T. Worthington tom.worthington@tomw.net<br />

Mr. J. Worsfold Dolphin Computer Access United Kingdom john.worsfold@dolphinuk.co.uk<br />

Mr. W. Wünschmann ww2@inf.tu-dresden.de<br />

Mr. Dr. J. Rietveld Netherlands Standardization institute Netherlands Standardization Consultant jan.rietveld@NEN.nl<br />

87


CWA <strong>15778</strong>:2008 (E)<br />

Appendix D – Abbreviations List<br />

Abbreviation<br />

Text<br />

AAATE<br />

Association for the Advancement of Assistive Technology in Europe<br />

AIIM<br />

The Enterprise Content Management Association<br />

AIP<br />

Accessible Information Processing<br />

ANSI<br />

American National Standards Institute<br />

ASCII<br />

American Standard Code for Information Interchange<br />

CDDA<br />

Compact Disk Digital Audio<br />

CD-ROM<br />

Compact Disc read-only memory<br />

CEN<br />

COMITÉ EUROPÉEN DE NORMALISATION<br />

CMS<br />

Content Management System<br />

CSS<br />

Cascading Style Sheets<br />

CWA<br />

CEN Workshop Agreement<br />

DAISY<br />

Digital Accessible Information System<br />

DATSCG<br />

Design for All and Assistive Technologies Standardization Co-ordination Group<br />

DEXA<br />

Database and Expert Systems Applications<br />

DITA<br />

Darwin Information Typing Architecture<br />

DPA<br />

Document Processing for Accessibility<br />

DRM<br />

Digital Rights Management<br />

DTB<br />

Digital Talking Book<br />

DTD<br />

Document Type Definitions<br />

DTP<br />

Desktop publishing<br />

EARL<br />

Evaluation and Report Language<br />

EC<br />

European Commission<br />

ECM<br />

Enterprise Content Management<br />

EDeAN<br />

European Design for All e-Accessibility Network<br />

EIDD<br />

European Institute for Design and Disability<br />

88


CWA <strong>15778</strong>:2008 (E)<br />

ETSI<br />

European Telecommunications standards Institute<br />

EU<br />

European Union<br />

EUAIN<br />

European accessible information network<br />

EXD<br />

Encrypted XML document<br />

GNOME<br />

GNU Object Model Environment<br />

GNU<br />

a recursive acronym for GNU's Not Unix<br />

HTML<br />

Hypertext Markup Language<br />

ICCHP<br />

International Conference on Computers Helping People with Special Needs<br />

ICS<br />

International Classification for Standards<br />

ICTSB<br />

Information Communication Technologies Standards Board<br />

IEC<br />

International Electrotechnical Commission<br />

IEEE<br />

Institute of Electrical and Electronics Engineers<br />

IMS<br />

IMS Global Learning Consortium<br />

ISO<br />

International Organization for Standardization<br />

ISSS<br />

Information Society Standardization System<br />

JAWS<br />

Job Access With Speech<br />

JTC 1 Joint Technical Committee 1<br />

KDE<br />

K Desktop Environment<br />

LaTeX<br />

LaTeX is a document preparation system for high-quality typesetting.<br />

LCNS<br />

Lecture Notes in Computer Science<br />

LIP<br />

Learner Information Package<br />

MARC21<br />

Machien-Readable Cataloging<br />

MathML<br />

Mathemtatical Markup Language<br />

METS<br />

Metadata Encoding and Transmission Standard<br />

MIME<br />

Multipurpose Internet Mail Extensions<br />

MP3 MPEG-1 Audio Layer 3<br />

MPEG<br />

Moving Picture Experts Group<br />

NIMAS<br />

National Instructional Materials Accessibility Standard<br />

89


CWA <strong>15778</strong>:2008 (E)<br />

NISO<br />

Navigation Control Centre<br />

OASIS<br />

Organization for the Advancement of Structured Information Standards<br />

OCR<br />

Optical character recognition<br />

ODF<br />

OpenDocument Format<br />

OECD<br />

Organisation for Ecomic Co-operation and Development<br />

PC<br />

Personal Computer<br />

PDF<br />

Portable Document Format<br />

RDF<br />

Resource Description Framework<br />

RNIB<br />

Royal National Institute of the Blind<br />

RTD<br />

Research and Technology Development<br />

RTF<br />

Rich Text Format<br />

SC<br />

Subcommittee<br />

SCORM<br />

Sharable Content Object Reference Model<br />

SGML<br />

Standard Generalised Markup Language<br />

SME<br />

Small and medium enterprises<br />

SMIL<br />

Synchronized Multimedia Integration Language<br />

STM<br />

Scientific Technical Medical<br />

TC<br />

Technical Committee<br />

TC/HF<br />

Technical committee Human Factors<br />

TEI<br />

Text Encoding Initiative<br />

TeX<br />

TeX is a typesetting language.<br />

US<br />

United States<br />

USB<br />

Umoversa; Serial Bus<br />

VHS<br />

Video Home System<br />

VPAT<br />

Voluntary Product Accessibility Template<br />

W3C<br />

World Wide Web Consortium<br />

WAI<br />

Web Accessibility Initiatieve<br />

WAV<br />

short for Waveform<br />

90


CWA <strong>15778</strong>:2008 (E)<br />

WCAG<br />

Web Content Accessibility Guidelines<br />

WCT<br />

WIPO Copyright Treaty<br />

WG<br />

Working Group<br />

WIPO<br />

World Intellectual Property Organization<br />

WPPT<br />

WIPO Performances and Phonograms Treaty<br />

XHTML<br />

Extensible Hypertext Markup Language<br />

XML<br />

Extensible Markup Language<br />

XSLT<br />

XSL Transformations<br />

91

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!