cen workshop agreement cwa 15778 - Higher Education Academy
cen workshop agreement cwa 15778 - Higher Education Academy
cen workshop agreement cwa 15778 - Higher Education Academy
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
CEN<br />
WORKSHOP<br />
CWA <strong>15778</strong><br />
February 2008<br />
AGREEMENT<br />
ICS 35.240.30<br />
English version<br />
Document Processing for Accessibility<br />
This CEN Workshop Agreement has been drafted and approved by a Workshop of representatives of interested parties, the constitution of<br />
which is indicated in the foreword of this Workshop Agreement.<br />
The formal process followed by the Workshop in the development of this Workshop Agreement has been endorsed by the National<br />
Members of CEN but neither the National Members of CEN nor the CEN Management Centre can be held accountable for the technical<br />
content of this CEN Workshop Agreement or possible conflicts with standards or legislation.<br />
This CEN Workshop Agreement can in no way be held as being an official standard developed by CEN and its Members.<br />
This CEN Workshop Agreement is publicly available as a reference document from the CEN Members National Standard Bodies.<br />
CEN members are the national standards bodies of Austria, Belgium, Bulgaria, Cyprus, Czech Republic, Denmark, Estonia, Finland,<br />
France, Germany, Greece, Hungary, Iceland, Ireland, Italy, Latvia, Lithuania, Luxembourg, Malta, Netherlands, Norway, Poland, Portugal,<br />
Romania, Slovakia, Slovenia, Spain, Sweden, Switzerland and United Kingdom.<br />
EUROPEAN COMMITTEE FOR STANDARDIZATION<br />
COMITÉ EUROPÉEN DE NORMALISATION<br />
EUROPÄISCHES KOMITEE FÜR NORMUNG<br />
Management Centre: rue de Stassart, 36<br />
B-1050 Brussels<br />
© 2008 CEN All rights of exploitation in any form and by any means reserved worldwide for CEN national Members.<br />
Ref. No.:CWA <strong>15778</strong>:2008 D/E/F
CWA <strong>15778</strong>:2008 (E)<br />
Contents<br />
Page<br />
Foreword .............................................................................................................................................................4<br />
1 Introduction............................................................................................................................................5<br />
1.1 Scope ......................................................................................................................................................5<br />
1.2 Purpose ..................................................................................................................................................5<br />
1.3 Formal liaisons ......................................................................................................................................6<br />
1.4 Target audience .....................................................................................................................................6<br />
1.5 Opening new markets ...........................................................................................................................7<br />
2 Standards for document processing for accessibility ......................................................................8<br />
2.1 Problem Statement................................................................................................................................8<br />
2.2 Standards and structures .....................................................................................................................9<br />
2.3 Current situation..................................................................................................................................10<br />
2.4 Current state of the art........................................................................................................................11<br />
2.5 Current problems.................................................................................................................................14<br />
2.6 Influencing factors in Document Processing for Accessibility ......................................................15<br />
3 Formats for document processing ....................................................................................................18<br />
3.1 Printed paper........................................................................................................................................18<br />
3.2 Printed Braille ......................................................................................................................................19<br />
3.3 Audio.....................................................................................................................................................20<br />
3.4 ASCII Text.............................................................................................................................................21<br />
3.5 HTML documents.................................................................................................................................21<br />
3.6 XML .......................................................................................................................................................22<br />
3.7 Multi-type composite formats ............................................................................................................24<br />
4 Considerations for structuring documents ......................................................................................27<br />
4.1 Define and use document style guidelines.......................................................................................27<br />
4.2 Define and use structure guidelines..................................................................................................27<br />
4.3 Edit / add structure where needed.....................................................................................................28<br />
4.4 Edit DRM settings................................................................................................................................28<br />
4.5 Adaptation ............................................................................................................................................28<br />
5 Conversion processes ........................................................................................................................29<br />
5.1 Convert Multimedia Material to structured Multimedia Material.....................................................29<br />
5.2 Convert structured Multimedia Material to XML...............................................................................30<br />
5.3 Convert Multimedia Material to XML..................................................................................................30<br />
5.4 Convert traditional print to XML.........................................................................................................30<br />
5.5 Convert DTP to XML............................................................................................................................31<br />
5.6 Convert XML to Print...........................................................................................................................31<br />
5.7 Convert XML to Braille ........................................................................................................................31<br />
5.8 Convert XML to Large Print ................................................................................................................32<br />
5.9 Convert XML to HTML .........................................................................................................................32<br />
5.10 Convert XML to structured Multimedia Material...............................................................................32<br />
5.11 Convert DTP to Multimedia Material..................................................................................................33<br />
5.12 Convert Audio to structured Audio ...................................................................................................33<br />
5.13 Convert XML to XML ...........................................................................................................................33<br />
6 S<strong>cen</strong>arios introducing accessibility within publishing workflows.................................................35<br />
6.1 S<strong>cen</strong>ario 1 - Delivering XML files.......................................................................................................36<br />
6.2 S<strong>cen</strong>ario 2 - Accessibility enhancement in general.........................................................................38<br />
6.3 S<strong>cen</strong>ario 3 - Increasing web accessibility.........................................................................................40<br />
6.4 S<strong>cen</strong>ario 4 - Accessibility policy........................................................................................................42<br />
6.5 S<strong>cen</strong>ario 5 - Spoken documents for everyone .................................................................................43<br />
6.6 S<strong>cen</strong>ario 6 - Accessible and protected PDFs ...................................................................................45<br />
2
CWA <strong>15778</strong>:2008 (E)<br />
6.7 S<strong>cen</strong>ario 7 - Working hand in hand ...................................................................................................47<br />
6.8 S<strong>cen</strong>ario 8 - Accessible design..........................................................................................................49<br />
6.9 S<strong>cen</strong>ario 9 - Accessibility on a large scale .......................................................................................51<br />
6.10 S<strong>cen</strong>ario 10 - What authors can do....................................................................................................52<br />
6.11 S<strong>cen</strong>ario 11 - Repair and adaptation .................................................................................................54<br />
6.12 Common s<strong>cen</strong>ario requirements .......................................................................................................56<br />
6.13 Specific s<strong>cen</strong>ario requirements .........................................................................................................56<br />
7 Application-oriented s<strong>cen</strong>ario implementation ................................................................................56<br />
7.1 Harry Potter and the RNIB ..................................................................................................................56<br />
7.2 Magazine and Newspaper distribution in the Netherlands .............................................................57<br />
7.3 Time Warner and Dolphin Audio Publishing ....................................................................................58<br />
7.4 <strong>Education</strong>al publishing in Austria .....................................................................................................59<br />
7.5 Best practice for distributing accessible content............................................................................61<br />
8 Identified gaps and areas for further research.................................................................................66<br />
8.1 Descriptions & Requirements ............................................................................................................66<br />
8.2 Process & Content Modelling.............................................................................................................66<br />
8.3 Introducing and using metadata for accessibility purposes ..........................................................67<br />
8.4 Standards and personalisation of content .......................................................................................68<br />
8.5 Li<strong>cen</strong>sing and technical protection measures .................................................................................69<br />
9 Conclusion and future work ...............................................................................................................70<br />
Appendix A – Relevant standards ..................................................................................................................72<br />
Appendix B – Relevant European organisations ..........................................................................................79<br />
Appendix C – Sustainability: network of interested parties for ongoing support and further<br />
development.........................................................................................................................................82<br />
Appendix D – Abbreviations List ....................................................................................................................88<br />
3
CWA <strong>15778</strong>:2008 (E)<br />
Foreword<br />
The decision for this CEN Workshop Agreement (CWA) was taken by the DPA Workshop at its kick-off meeting on 13<br />
May 2005.<br />
This CWA provides a first elaboration on how the accessibility of publishing content can be enhanced by altering existing<br />
publishing workflows and introducing accessibility considerations where appropriate. For reaching this goal, in each step<br />
where accessibility is introduced, relevant formats and conversions are detailed out, as well new workflow items<br />
described.<br />
The document has been developed through the collaboration of a number of contributing partners such as publishers,<br />
blind organizations and universities. The names of the individuals and their affiliations that have expressed support for<br />
this CWA are available from the CEN/ISSS Secretariat.<br />
The final draft of this CWA was put on CEN’s web site for a 60 days period of external comments between 06 July and 10<br />
September 2007.<br />
The formal process followed by the Workshop in the development of the CEN Workshop Agreement has been endorsed<br />
by the National Members of CEN but neither the National Members of CEN nor the CEN Management Centre can be<br />
held accountable for the technical content of the CEN Workshop Agreement or possible conflict with standards or<br />
legislation. This CEN Workshop Agreement can in no way be held as being an official standard developed by CEN and<br />
its members.<br />
The final review/endorsement round for this CWA was successfully closed on 2007-09-18.The final text of this CWA was<br />
submitted to CEN for publication on 2007-11-30.<br />
This CEN Workshop Agreement is publicly available as a reference document from the National Members of CEN:<br />
AENOR, AFNOR, ASRO, BDS, BSI, CSNI, CYS, DIN, DS, ELOT, EVS, IBN, IPQ, IST, LVS, LST, MSA, MSZT, NEN,<br />
NSAI, ON, PKN, SEE, SIS, SIST, SFS, SN, SNV, SUTN and UNI.<br />
Comments or suggestions from the users of the CEN Workshop Agreement are welcome and should be addressed to<br />
the CEN Management Centre.<br />
4
CWA <strong>15778</strong>:2008 (E)<br />
1 Introduction<br />
1.1 Scope<br />
Given the widespread adoption of ICT within the publishing industries, there is a general interest in the creation and<br />
provision of well-formatted digital documents. For those people who are dependent on accessible information, this<br />
interest is of <strong>cen</strong>tral importance, and it is this convergence of interests that stimulated the creation of this Workshop. The<br />
WS/DPA has examined some of the ways in which this convergence is helping to build consensus and create new<br />
strategies and technologies for the provision of information in formats that are more accessible for everyone.<br />
In the real world, publishers rely on accessibility experts and consider accessible information only at the end of the<br />
content production chain. This requires considerable amount of efforts to make information accessible for everyone and it<br />
is a very hard problem to tackle. This <strong>workshop</strong> introduces accessibility as a design element in the content production<br />
and provides guidelines and best practices how more accessible documents can be produced. Another important issue is<br />
that the user requirements for accessible information are not well defined. In this work, we therefore base the elaboration<br />
on publishing use cases and s<strong>cen</strong>arios that have been derived together with publishers in order to analyse at least partly<br />
the user requirements. Additionally those s<strong>cen</strong>arios provide specific examples of accessible information provision as an<br />
entry point to publishing stakeholders.<br />
Sustaining the provision of useful services and meaningful accessible content can be considered vital to the growth of<br />
the Information Society as a whole. When designing, specifying and building applications and infrastructures to store<br />
accessible content, several apparently unrelated issues arise. How do we describe the knowledge and capabilities we<br />
possess and capture the repository of resources we can use to implement these capabilities? How do we describe the<br />
questions and problems of end users and content providers? How do we marry both within manageable and consistent<br />
frameworks? How do we re-apply this knowledge and combine these resources with new insights to solve new<br />
problems? How can we accelerate the processes described above and provide solutions to enable accessible<br />
information processing?<br />
The DPA Workshop (CEN Workshop on Document Processing for Accessibility) brought together some of the key players<br />
working in the fields of publishing and accessibility. The topics addressed ranged from generic document and knowledge<br />
structures, through all aspects of accessible document processing, to Digital Rights Management and copyright issues.<br />
Perhaps the most striking aspect was the level of convergence between the needs of accessibility communities and<br />
those of content creators and providers. Indeed, with the introduction of accessibility from scratch, the information needs<br />
of all consumers are better served, particularly as content providers seek innovative solutions for re-aggregating their<br />
content for new marketplaces.<br />
1.2 Purpose<br />
The DPA Workshop as detailed in its business plan has the following objectives:<br />
<br />
<br />
<br />
To bring together all the players in the information provision and e-publishing chain in order to<br />
achieve the critical mass significantly to enhance the provision of accessible information at a<br />
European level.<br />
To provide guidelines needed on integrating accessibility approaches and workflows within the<br />
document management and publishing process rather than as just a specialised additional service.<br />
To raise awareness and stimulate the adoption at local, regional, national and European levels of the<br />
emerging formats and standards for the provision of accessible information and to find ways of<br />
ensuring that technological protection measures do not inadvertently impede legitimate access to<br />
information by people with print impairments<br />
Based on those objectives this document:<br />
<br />
<br />
<br />
describes the outcomes from the DPA Workshop activities<br />
provides an elaboration of relevant standards and their possible use in the publishing sector<br />
examines the different formats required for accessible information provision<br />
5
CWA <strong>15778</strong>:2008 (E)<br />
<br />
<br />
<br />
<br />
provides a systematic overview of relevant conversion processes and related structured information<br />
activities<br />
examines possible s<strong>cen</strong>arios of use within the publishing sector<br />
provides real-life case studies and instances of best practice<br />
identifies areas for further research and systematisation<br />
1.3 Formal liaisons<br />
The Workshop has been initiated and supported by the EUAIN Project. The EUAIN network is a co-ordination action 1 cofunded<br />
by the INFSO DG of the European Commission within the RTD activities of the Thematic Priority Information<br />
Society Technologies of the 6th Framework Programme. The EUAIN project, co-ordinated by DEDICON Amsterdam,<br />
aims to promote e-Inclusion as a core horizontal building block in the establishment of the Information Society by<br />
creating a network to bring together the different actors in the content creation and publishing industries around a<br />
common set of objectives relating to the provision of accessible information. Accessibility for print impaired people can be<br />
an increasingly integrated component of the document management and publishing process and should not be a<br />
specialised or additional service.<br />
The DPA Workshop has established the following liaisons with relevant activities to its work programme. Liaisons<br />
established:<br />
AIIM PDF Access Working Group<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
CEN/ISSS Data Protection & Privacy Workshop<br />
CEN/ISSS Learning Technologies Workshop<br />
CEN/ISSS Dublin Core Metadata Workshop<br />
COST 219 TER<br />
EDeAN European Design for All e-Accessibility Network<br />
ETSI TC/HF ETSI Technical committee Human Factors<br />
ICTSB/DATSCG Design for All and Assistive Technologies Standardisation Co-ordination Group<br />
OASIS TCs concerning ODF and DITA<br />
W3C Web Accessibility Initiative (WAI)<br />
DAISY (NISO z39.86)<br />
ISO/IEC JTC 1 Special Working Group on Accessibility<br />
ISO/IEC JTC 1/SC 29 Coding of audio, picture, multimedia and hypermedia information<br />
ISO/IEC JTC 1/SC 7/WG 2 Software and Systems Documentation<br />
1.4 Target audience<br />
This CWA addresses accessibility in the publishing value chain and examines ways to introduce and enhance<br />
accessibility of publishing content inside publishing workflows. The intended audience includes actors and stakeholders<br />
within the publishing value chain (publishers, authors, content producers and distributors, publishing system developers<br />
and vendors) and the content accessibility area (specialised libraries, accessibility consultants, and accessible system<br />
developers and vendors)<br />
The CWA aims to provide a first elaboration on how the accessibility of publishing content can be enhanced by altering<br />
existing publishing workflows and introducing accessibility considerations where appropriate. For reaching this goal, in<br />
1 Contract number 511497, DG INFSO, EC, FP6. See http://www.euain.org<br />
6
CWA <strong>15778</strong>:2008 (E)<br />
each step where accessibility is introduced, relevant formats and conversions are detailed out, as well new workflow<br />
items described.<br />
The primary target group of this CWA are actors in the publishing value chain. The document is structured in a form that<br />
enables publishers to get quick insights on what they need to do in order to produce accessible content. For that reason<br />
several specific s<strong>cen</strong>arios are provided that, although not exhaustive, can serve as an entry point for publishers in their<br />
accessible content implementations.<br />
1.5 Opening new markets<br />
Accessible information is not a special type of information aimed at a specific group of a certain population. Accessible<br />
information is information that can be accessed by anyone, with or without a disability, aimed at a general market where<br />
anyone interested is a possible customer. Structured information is the first step in the accessible information process. A<br />
document whose internal structure can be defined and its elements isolated and classified, without losing sight of the<br />
overall structure of the information, is a document that can be navigated.<br />
Most adaptive technology allows the user to access a document, and to read it following the "outer" structure of the<br />
original. But if the same information has also an "inner" structure that allows the adaptive device to distinguish between a<br />
phrase and a measure, between a paragraph and a sentence, highlighting particular annotations, then the level of<br />
accessibility (and therefore usability) of the whole document will be greatly enhanced, allowing the user to move through<br />
it in the same way as those without impairments do when looking at a printed document, and following the same integral<br />
logic.<br />
In an ideal world, all documents made available in electronic formats should contain that internal structure that benefits<br />
everyone. Highly-structured documents are becoming more and more popular due to reasons that very seldom pertain to<br />
making it accessible to persons with disabilities. The move to XML related formats and associated standards for<br />
metadata have provided an impetus for far greater document structuring than before. Whatever the reasons behind those<br />
decisions are, the use of highly-structured information is of great benefit to anybody accessing them for any purpose.<br />
In re<strong>cen</strong>t years, the market for accessibility and assistive technologies has started to gain recognition. It is clear that the<br />
integration of accessibility notions into mainstream technologies would provide previously unavailable opportunities in the<br />
provision of accessible multimedia information systems. It would open up modern information services and provide them<br />
to all types and levels of users, in both the software and the hardware domain. Additionally, new consumption and<br />
production devices and environments can be addressed from such platforms and this would provide very useful<br />
information provision opportunities indeed, such as information on mobile devices with additional speech assistance.<br />
It is equally clear that we remain at the very beginning of the move to incorporate accessibility within mainstream content<br />
processing environments.<br />
7
CWA <strong>15778</strong>:2008 (E)<br />
2 Standards for document processing for accessibility<br />
2.1 Problem Statement<br />
Accessible solutions are required for anyone who requires assistance in using the mainstream solution. This<br />
could be because a user is blind, visually impaired, or impaired in some other way, and the term printimpaired<br />
is often used in this context. Accessible solutions range from small assistive applications, (such as<br />
screen magnifiers) to full scale operating systems and screen reading environments. The traditional problem<br />
with accessible solutions is that they are normally implemented as an afterthought or a piggy-back solution.<br />
This results in solutions that are not fully integrated (or not well integrated) with the mainstream solutions.<br />
These independent applications are then at a disadvantage whenever software versions or operating systems<br />
are updated. In order to make this integration process easier, and provide more intuitive designs for the future,<br />
it is essential that “design for all” and accessible design methodologies are widespread. Standard, policy and<br />
legislation also helps ensure that accessible designers have a solid standard to meet to ensure futureproofing<br />
2 .<br />
Notions of “accessibility” are normally equated with the adaptation and conversion of digital content, where this content<br />
can be made available. On a European level, and indeed often on a national level, much of the existing expertise on<br />
creating accessible adaptations of digital content is of a highly distributed nature. Within specialist organisations<br />
supporting print impaired people; or within university research laboratories; or indeed within publishing houses, many<br />
automated tools have been designed and implemented at least partially to execute the necessary adaptation procedures.<br />
However, each automated tool has its own, highly specific, field of application. Furthermore, the knowledge required to<br />
build these very specific tools is equally distributed, so that there is currently very little re-use of either tools or knowledge.<br />
The content provider’s perspective on digitisation is further complicated by security issues. In the modern environment<br />
driven by the internet for content dissemination, security is a vital issue for publishers. DRM is a complex problem for all<br />
content holders. Every publisher’s content, client base and requirements are different, which often results in a<br />
personalised set of requirements for each case. As a result, <strong>agreement</strong>s on accessibility are often negotiated on a caseby-case<br />
basis. Naturally, publishers have to be confident that any digital format is being delivered through secure<br />
gateways to only the people who are intended to receive it.<br />
Accessibility can also be viewed from a wider angle. Being able to see content in whatever modality; perceive its context;<br />
and attach a useful meaning to it requires that the user be able to access this content, its context and relevant software<br />
application in a way that meets that particular user's consumption preferences. These preferences may become<br />
requirements over time - we all get older. Being able to attach useful meanings to content is what lies at the very basis of<br />
the preservation and education of thought. Attaching useful meanings to content underpins the basis of culture,<br />
commerce and civilisation. Being able to access software and the content and the potential for understanding it<br />
unleashes, requires us to be able to gain access to software and not be hindered by huge costs, complexity, lack of<br />
support and additional barriers.<br />
Given the differences between the traditional approach to accessibility and the wider view outlined above, we are in<br />
something of a transitional phase at this time. From the software producer, business community and the Open Source<br />
System community we see a move towards the inclusion of accessibility features into systems, tools and the<br />
programming languages themselves as system wide core functionalities (examples being KDE 3 , GNOME 4 , and Java<br />
Accessibility 5 . From the accessibility community we see a move towards more advanced and abstract descriptions of the<br />
procedures involved in moving from 'common' content towards content that is processed to be granted accessible<br />
certification. A good example of such a move is the Web Content Accessibility Guidelines 1.0 6 and 2.0 7 , which provide<br />
2 See, for example, the work of EIDD-Design For All Europe, http://www.design-for-all.org<br />
3 KDE Accessibility Project, http://accessibility.kde.org/<br />
4 GNOME Accessibility Project, http://developer.gnome.org/projects/gap/GNOME-Accessibility.html<br />
5 JAVA Accessibility, http://java.suncom/j2se/1.4.2/docs/guide/access/<br />
6 http://www.w3.org/TR/WCAG10/<br />
8
CWA <strong>15778</strong>:2008 (E)<br />
detailed guidelines on how to (re)structure and enhance websites and their content to ensure a sufficient level of<br />
accessibility.<br />
The transitional stage described above involves relatively slow change when compared with general exhilarating<br />
technological developments. However, this relatively slow pace also creates an opportunity to take a step back and<br />
observe all the individual processes that touch upon the notion of accessibility. This allows us to explicate similarities and<br />
possible complementarities, a process of convergent gradualism if you like. The opportunity then arises to synchronise<br />
various efforts in the accessibility arena and offer them to end users and business as a ‘package’. Such a package<br />
contains scientific knowledge about accessibility, as well as technological knowledge about how to implement such<br />
notions. This package also contains detailed descriptions of the requirements of the end users, producers and<br />
distributors of content, as well as tools aiming towards market segments that rely on these requirements. Such an<br />
approach that aims to unify 'common' content, system, service and tool provision and the more 'specialised' content,<br />
system, service and tool provision, can be called Accessible Information Processing (AIP).<br />
2.2 Standards and structures<br />
Structured information is the first big step towards high-quality accessible information. A document whose internal<br />
structure can be defined and its elements isolated and classified, without losing sight of the overall structure of the<br />
document, is a document that can be navigated.<br />
As noted above, most adaptive technology allows the user to access a document, and to read it following the "outer"<br />
structure of the original. If that structure is left to a range of visual cues, like bold capital letters for the title of a chapter<br />
and bold italics for the heading of a subchapter, the adaptive device will surely flatten that visual structure, leaving a<br />
document with no structure at all. But if the same document has also an "inner" structure that makes possible for the<br />
adaptive device to distinguish between a paragraph and a footnote, between a chapter and a subchapter, then the level<br />
of accessibility of the whole document will be greatly enhanced, allowing the user to move through it in the same way<br />
those without disabilities do when looking at the printed document, following the same "logic".<br />
In an ideal world, any document made available in electronic format should contain that inside structure that benefits to<br />
everyone. Highly-structured documents are becoming more and more popular due to reasons that very seldom have to<br />
do with making it accessible to persons with disabilities.<br />
Some of the largest publishers are converting their old electronic texts into full XML documents so that it will allow them<br />
to look for certain bits of text that they can re-use in further editions, as well as to help them to avoid double-production of<br />
the same text. Whatever the reasons behind those decisions are, the use of highly-structured information is of great<br />
benefit to anybody accessing them for whatever purpose. And equally important is to structure the following information<br />
standards as this allows for:<br />
Consistency in the description of structural elements<br />
Understanding and predictability of structures<br />
Interaction with other standards<br />
Technical compliance with different devices<br />
Interchangeability of materials<br />
Flexibility and evolution.<br />
Standards are needed for many reasons, but probably the most relevant one is that they help manufacturers to make<br />
their products accessible in a detailed, coherent way. The existence of standards though does not imply that accessibility<br />
will be implemented in the same way or with the same results in all products. The existence of a number of standards for<br />
producing the same product (a document) may occasionally lead to two different levels of accessibility for the same<br />
"accessible" final product. Even within the same standard it sometimes happen that some features are considered<br />
essential while others may be considered expendable – as a result of this, the application of the same standard with<br />
different views about what is needed and what is not to make a document accessible may provide a wide range of<br />
accessibility levels for the same product, making it fully accessible for some users and just slightly accessible for others.<br />
7 http://www.w3.org/WAI/intro/wcag20<br />
9
CWA <strong>15778</strong>:2008 (E)<br />
It is therefore important to decide beforehand the level of accessibility to apply to a certain document according to<br />
different variables, such as:<br />
<br />
<br />
<br />
The depth of the structure that the document allows/needs to make it sufficiently navigable<br />
The level of navigability actually needed by the potential users<br />
The resources available to make a document accessible<br />
It may also happen that different standards are developed for the same purpose and though they deliver the same level<br />
of accessibility they are not compatible. This usually leads to confusion for manufacturers and service providers while it<br />
also "divides" users between the different existing standards (we all remember the Beta vs. VHS example). We can<br />
distinguish between formal (or de jure) standards and de facto standards. The former are those who have been<br />
"formalised" by standards organisations, while the latter are technical solutions that have been adopted informally by<br />
users due to its usefulness and/or reliability. Among these de facto standards, we also have two categories – proprietary<br />
standards (those developed by a commercial company) and open standards (outside vendor’s control, freely developed<br />
and updated by independent programmers).<br />
A list of relevant standards for accessible information processing has been collated and is included in Appendix A. It<br />
quickly becomes clear that no single standard or set of standards can help fully to implement accessible information<br />
processing within mainstream workflows. We rely on the existence and the promotion of accessibility standards to prove<br />
that accessibility can be built from the first stage of production: that Design For All can be applied to emerging standards<br />
so that all the features needed to grant accessibility to the final product are built into the system right from the beginning,<br />
instead of the traditional approach of adding those features later. There are many different standardisation agencies<br />
around the world, but what is considered to be fully accessible in the United Kingdom may not be seen as accessible in<br />
the same degree in Australia.<br />
2.3 Current situation<br />
A publisher can create a product that can be marketed for everyone, that includes accessibility features right from the<br />
first stages of the creative process and that looks just like its non-accessible version. This does not apply to all types of<br />
information (printed information, for instance), but it certainly applies to electronic publishing, audio publishing, the movie<br />
industry, etc. Printed information relies too heavily on its visual appearance and cannot be distributed "as is" to people<br />
with a print disability, but printed information is usually the last presentational output stage. Many other formats<br />
containing the same information are likely to have been created. Those files are sometimes documented, structured,<br />
catalogued and stored for future use, and those files can be created in a way in which the text and images they contain,<br />
the structure they rely on, may be of use for people with disabilities.<br />
One clear example of how accessible information need not necessarily look different or be marketed differently is the<br />
PDF format as demonstrated by Adobe PDF version 8 onwards. 8 A PDF document can be created and distributed either<br />
tagged or untagged. The former will allow print impaired people to read the document; the latter will be of use only to<br />
those who can see it. They both look the same, and they are equally expensive or inexpensive to create. Likewise, web<br />
pages can be designed in a way that can be accessed by assistive technology or not. They have the same look and, for<br />
a programmer who knows which elements to use, making an accessible webpage is not a much bigger task than making<br />
an inaccessible one.<br />
Therefore, what would make information providers decide to create their documents in a way that can be accessible to<br />
everyone? It could be to open up their market to anybody interested in the product they sell; or to comply with the<br />
legislation 9 that requires that particular information is made public in a format that is accessible to all possible; or simply<br />
to reach a much wider audience.<br />
The publishing value chain may change significantly if publishers begin to ascribe greater value the electronic files they<br />
use for printing. Tagged and structured electronic files not only have an intrinsic value (in the e-publishing sector) but<br />
may also be used to create other formats that people with or without disabilities may use or prefer to paper. For example,<br />
8 Adobe Reader from http://www.adobe.com and associated authoring/processing tools from<br />
http://www.adobe.com/products/acrobat/<br />
9 Sullivan, J., (2007) Study on Copyright Limitations and Exceptions for the Visually Impaired, SCCR/15/7,<br />
WIPO, Geneva.<br />
10
CWA <strong>15778</strong>:2008 (E)<br />
the cost of producing a structured talking book using these files together with good quality synthetic speech is minimal<br />
and would increase their potential market.<br />
The "accessibility features" needed to make an electronic document usable for people with print disabilities are not<br />
"exclusive features". This expression has a two-fold meaning – a fully structured XML document will not exclude people<br />
without disabilities from using it just because it is structured and in a readable format. Besides, we do not need to create<br />
an exclusive document to be used only with special equipment (except when copyright exceptions may apply).<br />
What is sometimes considered "accessible" is simply "usable" – structures, navigation, are features that make a<br />
document simply easier to use and more friendly to ANY user. When we add to a printed book a CD-ROM with the text or<br />
the audio (or both) we are not making it available only to people with disabilities - we are also helping students to carry<br />
the full text of the book anywhere in any portable device, and we are also giving them the opportunity to cite parts of the<br />
book in their papers without typing them. We are allowing buyers of the book to listen to it while driving home after work,<br />
as well as giving a blind person the possibility of hearing its contents.<br />
In some cases, electronic documents offer much more possibilities and a better reading experience than the printed<br />
edition. Books full of references to other books and authors, with words that can be found in a glossary or notes that are<br />
explained at the end of the book, are better used and enjoyed when they are published in an electronic format. When<br />
properly edited, entries can be created to any chapter, subchapter or any other significant part of the structure from the<br />
table of contents; an entry to the words in the glossary every time those words are used in the book; or links to the<br />
bibliography every time another author is mentioned. All this can be accomplished with a simple mouse click or by<br />
pressing the space bar. This enhances the value of certain books, and it is a format that has already been used by some<br />
publishers of reference material. These are not books for people with print disabilities, but they are (in most cases)<br />
perfectly accessible.<br />
2.4 Current state of the art<br />
Document processing and accessibility both have a very wide focus, it has therefore been decided to structure the<br />
elaboration of standards in this <strong>workshop</strong> according to the value chain in publishing, namely content creation, content<br />
production and content distribution. Following this workflow the elaboration will be structured in a comprehensive way as<br />
each step in the publishing process will be examined. A general publishing chain can be defined in simple terms as<br />
“economic activities that facilitate the creation, production, circulation, delivery and consumption of information-based<br />
products” 10<br />
In Figure 1 the above elements are visualised to exhibit the intention of how the elaboration is going to be demonstrated.<br />
The content processing chain requires on one side actors that create content, a framework and infrastructure for content<br />
production, storage and management, and at the end user side, consumption and delivery interfaces. In each of these<br />
building blocks of the content processing chain, several actors interact in order to create, produce and consume content.<br />
Those interactions form processing workflows along publishing channels and products, leading to single or multiple<br />
channel publishing. Dealing with each content processing building block requires modelling content, user and their<br />
interactions in a way that implicit steps and knowledge become explicit. Then, in each of these steps, accessibility<br />
requirements and enhancements need to be analysed in order to introduce accessibility right in the beginning rather than<br />
at the end of the content processing chain. This is exactly the methodology followed in the current CWA: in each of those<br />
workflow steps identified, actors and processes are analysed and accessibility requirements elaborated, links to useful<br />
and related standards made explicit, and possible content accessibility enhancement steps described in more detail. For<br />
feasibility reasons we limit the potential workflows in order to present at least in those cases a thorough analysis.<br />
10 South Africa Department of Arts, Culture, Science and Technology (1998), Final Report: The cultural<br />
industries growth strategy (CIGS): The South African publishing industry report<br />
11
CWA <strong>15778</strong>:2008 (E)<br />
Figure 1 - Generic Publishing Process<br />
2.4.1 Current practices in accessible publishing<br />
2.4.1.1 Document Conversion<br />
Much of the current practice in document conversion <strong>cen</strong>tres on the use of Word Processor files and conversion to<br />
formats such as Adobe PDF. There are various methods for achieving this by use of plug-in type software 11 for a host<br />
application or larger scale document conversion services 12 . There are methods which can improve the accessibility and<br />
meta enhanced aspects of these formats, but in order to be successful there is a responsibility on the author to oversee<br />
and ensure the quality of this process.<br />
Other methods are using desktop publishing applications such as Adobe InDesign 13 or QuarkXpress 14 but the latter<br />
cannot generate accessible PDF documents.<br />
There is a lot of industry wide interest in using XML for publishing. There are many practical advantages to using XML<br />
based languages, these include the dynamic reusability and repurposing of content and the development of XML-<strong>cen</strong>tric<br />
workflows by publishers which promises the potential to create structured content that can be outputted in accessible<br />
formats required by people with disabilities 15 .<br />
In certain countries this is due, in part, to changes in legislation where accessible content is a requirement 16 but also to a<br />
growing recognition on the part of the publishing industry of the changes in the ways end users will access content due to<br />
technological advances 17 and the need for users to be able to access published content in the medium of their choice.<br />
11 http://www.adobe.com/support/downloads/detail.jsp?ftpID=1161<br />
12 http://createpdf.adobe.com/<br />
13 http://www.adobe.com/de/products/indesign/<br />
14 http://www.quark.com/<br />
15 http://www.webaim.org/intro/<br />
16 http://www.ncd.gov/newsroom/publications/2004/inclusion_whitepaper.htm<br />
17 http://www.audible.co.uk/<br />
12
CWA <strong>15778</strong>:2008 (E)<br />
While the referenced examples relate primarily to Web accessibility, the issues encountered by users and subsequent<br />
methodologies used to overcome barriers to access, are relevant to the publishing industry 18.<br />
2.4.1.2 Trusted Intermediaries<br />
Trusted intermediaries establish a personalised relationship between content holders and specialist organisations<br />
whereby publishers and agencies serving blind and partially sighted people work together in a secure and trusting<br />
environment to increase the quantity and timeliness of titles available in an accessible format. Within trusted intermediary<br />
frameworks, DRM is an enabler of controlled access. A number of different security methods are being developed or are<br />
already in use for making content available in this way.<br />
As far as security is concerned, the higher the level the more likely publishers are to allow content to be made available<br />
in accessible digital formats. At present, the security systems used are simple, they use basic encryption technologies<br />
with key exchange mechanisms. The potential for the release of content is considerable – although there are few<br />
recorded instances of such occurring. Once decrypted, content is available to anyone, authorised or not. The ability to<br />
attach content to particular devices, or better to provide access only to authorised users, requires a level of DRM<br />
sophistication that is not yet generally in place in services catering to the needs of visually impaired people 19 . Current<br />
examples of such practices include countries like France 20 and Austria 21.<br />
2.4.1.3 Enterprise content management (ECM)<br />
ECM plays an important role because this will likely be the basis for future workflows in publishing environments as the<br />
tools and processes converge. In mid 2002, 20% of EU media and publishing enterprises with a website had content<br />
management systems in place, though only 3% had supply chain CMS 22 . As illustrated in Figure 1 the content production<br />
process consists of many different steps and there are also different actors involved in those processes. ECM plays an<br />
important role in being the <strong>cen</strong>tral element of these processes since all the data can be captured, managed, stored,<br />
preserved, and delivered within the organisation and thus support the publishing process. It is obvious that integrating<br />
the accessibility requirements is an important issue related to ECM and publishing environments. Accessibility<br />
considerations should be taken into account during the whole publishing process rather than at the end as an add-on<br />
feature and so correspond with the design for all principles 23 .<br />
2.4.1.4 Electronic Publishing<br />
The landscape of electronic publishing is constantly evolving but as technology changes ever faster there is an ever<br />
increasing need for publishers to be able to archive their content, easily retrieve it and export it in a wide variety of<br />
necessary output formats. This is a complex and multi layered process and as with any workflow there are certain points<br />
in the digital publishing chain that require specific knowledge and processing methodologies in order to manage them<br />
18 http://www.afb.org/Section.asp?SectionID=4&TopicID=222&DocumentID=1224<br />
19 http://www.indicare.org/tiki-read_article.php?articleId=169<br />
20 du Bourguet, Guillaume; Guillon, Benoit; Burger, Dominique (2003): Helene: a collaborative server to<br />
create and securely deliver documents for the blind. Proceedings AAATE 2003<br />
21 Miesenberger, K.; Ruemer, R.: Schulbuch Barrierefrei (Accessible School Books) - Co-operation Between<br />
Publishers and Service Providers in Austria. In: Proceedings of the 10th International Conference on<br />
Computers Helping People with Special Needs(ICCHP '06 ), Linz, Springer, pp32-39<br />
22 OECD report on scientific publishing, DSTI/ICCP/IE(2004)11/FINAL<br />
23 Darzentas, J., Miesenberger, K.: Design for All in Information Technology: a Universal Concern (Keynote),<br />
in: Andersen, K., V., Debenham, J., Wagner, R. (eds.): Database and Expert Systems Applications, 16th<br />
Internaitonal Conference, DEXA 2005, Copenhagen, Denmark, August 2005, Proceedings, Springer LCNS<br />
3588, Berlin/Heidelberg 2005, pp. 406 – 420<br />
13
CWA <strong>15778</strong>:2008 (E)<br />
successfully. These include archiving data and legacy document usage 24 , data exchange, intellectual property rights and<br />
Digital Rights Management 25 , and finding the best practice models that work for the widest range of publishers 26 .<br />
Though each publisher may have specific needs that differ in the broadest sense, there are common issues for all. These<br />
include, but are not limited to, the need for logical document structure that not only makes sense to readers but also<br />
makes it possible to retrieve, index and update a document by using metadata to identify the various parts of an<br />
electronic document. Processes such as these are essential in order to modify the document format or extract relevant<br />
parts of a document for insertion in another. Much technical work has therefore to be done to create methodologies and<br />
mechanisms to realise and develop these processes.<br />
2.4.1.5 XML-Publishing<br />
There is a transition underway from the limited tag set and functionality of SGML to XML based tools and procedures and<br />
there are various XML based standards, specifications and initiatives for the printing and publishing industry 27 , one of the<br />
most powerful being DITA 28 .<br />
DITA is an XML-based specification for modular and extensible topic-based information. DITA provides a model for<br />
defining and processing new information types as specialisations of existing types. DITA populates the model with an<br />
extensible hierarchy of standard types. DITA encourages re-use by reference either of topics or of fragments of topics.<br />
DITA topics:<br />
<br />
<br />
<br />
can be assembled in different combinations for many deliverables or output formats;<br />
are optimised for navigation and search;<br />
are well suited for concurrent authoring and content management.<br />
DITA is customisable, which allows for the introduction of specific semantics for specific purposes without increasing the<br />
size of other DTDs, and which allows the inheritance of shared design and behaviour and interchangeability with<br />
unspecialised content 29 .<br />
2.5 Current problems<br />
Although initiatives and projects for incorporating accessibility in publishing are ongoing 30 , the current situation relating to<br />
the publishing process is the following:<br />
<br />
<br />
There is a lack of knowledge of accessibility related standards and formats within the publishing<br />
industry. Experiences in the work with publishers also show, that there are different understandings<br />
of “what is structure” between publishers and the accessible content producing communities.<br />
There is also a lack of knowledge of publishing workflows and standards within specialist accessible<br />
content producing communities. This lack of understanding makes it hard for consulters and experts<br />
to give comprehensive suggestions to publishers.<br />
24 http://www.dlib.org/dlib/january00/01hodge.html<br />
25 http://www.publishers.org/press/pdf/DRMWhitePaper.pdf<br />
26 http://www.tbs-sct.gc.ca/im-gi/references/pub/epub-topic_e.asp<br />
27 http://publishing.xml.org/standards/index.shtml#ice<br />
28 http://dita.xml.org/<br />
29 http://xml.coverpages.org/DITA-OASIS-CFP.html<br />
30 BMSG-533048/0001-V/10/2004: Project funded by the Austrian government, for making educational<br />
material accessible using TEI-XML (creation), produce those in formats like large print Braille, PDF and HTML<br />
(production) and distribute them electronically in a variety of formats and taking into account DRM (Digital<br />
Rights Management)<br />
14
CWA <strong>15778</strong>:2008 (E)<br />
<br />
<br />
<br />
<br />
There is a perception that the provision of digital files in alternative formats will compromise technical<br />
protection measures. In the digital world, where copyright infringements cost companies millions of<br />
Euro each year 31 , publishers are understandably afraid, to lose the control over the usage and<br />
distribution of their content.<br />
There is a perception that the provision of accessible format materials is expensive and timeconsuming.<br />
This perception might also be a result of the fact that there is less knowledge on<br />
Accessible Information Processing outside of the accessible content producing communities.<br />
There is a lack of knowledge of the scale of the accessible format market. Its is hard for publishers to<br />
estimate the monetary cost-benefit ratio, since there are less comprehensive statistics and market<br />
estimations for accessible content products. One should also take into account, that accessible<br />
content is also usable for mainstream users and this means also an increase of the target market.<br />
There is no <strong>agreement</strong> on a single unified accessible format which meets the stakeholders<br />
requirements. The file formats used by publishing companies is very specific to the requirements in<br />
the publishing environment. But when it comes to produce an accessible document of the same<br />
content , the requirements of the formats are entirely different to those in the publishing field. The<br />
problem is to unify all the requirements under one common file format.<br />
2.6 Influencing factors in Document Processing for Accessibility<br />
This section describes the different factors that affect on one hand the publishing process and on the other hand the<br />
results of the process – the Documents and the way they can be accessed.<br />
2.6.1 Structure of digital source file<br />
By speaking of structure we have to distinguish between the implicit structure of a document (structure that is only<br />
visually recognisable) and the explicit structure of a document (semantic structure or structure that is describing the<br />
document on a META-Level). Examples for implicit structures are the visual formatting of a text document through line<br />
breaks etc. Examples for explicit structures are the usage of structure elements to describe the content e.g. in HTML this<br />
would mean to use for a Heading or in a Microsoft Word document this would mean to use the “Heading 1”-Format<br />
option to mark a text as heading. If a document has such an explicit structure this makes it possible to transform the<br />
information without loosing the structure of the document. Structure is also the basic requirement and an fundamental<br />
prerequisite for accessible content.<br />
But since publishing is a very diverse process there are many different formats involved. For example, one author might<br />
hand his script to the publisher in Microsoft Word format while another author might do the same but uses a TeX-based<br />
format. Within one file format, for example Microsoft Word, there is also wide range of possibilities for structuring the<br />
document 32 (e.g. by using Headings, Lists, etc.). In Desktop Publishing Tools (DTP), the structure is usually only made<br />
visible through visual formatting of the content. Other file formats do not or only minimally support content structuring (e.g.<br />
plain text format). It can therefore be said that there is not one typical method of making content accessible but that each<br />
case must be looked at individually.<br />
2.6.2 Different accessibility requirements of target user groups<br />
The different target groups (blind 33 , visually impaired 34 , 35<br />
or cognitive 36 ) have different needs in terms of how they<br />
access the information. These requirements differ even within the groups. For example, one visually impaired person<br />
31<br />
http://www.ipi.org/ipi%5CIPIPublications.nsf/PublicationLookupFullText/E274F77ADF58BD08862571F8001BA<br />
6BF<br />
32 http://www.webaim.org/techniques/word/<br />
33 http://www.webaim.org/articles/visual/blind.php<br />
34 http://www.webaim.org/articles/visual/lowvision.php<br />
35 http://www.webaim.org/articles/visual/colorblind.php<br />
15
CWA <strong>15778</strong>:2008 (E)<br />
needs the material in a large print form, while another visually impaired person with a form of colour blindness might<br />
need the information with a different foreground-background colour setting. Besides that, the requirements for a deaf<br />
person are again totally different from those of the groups mentioned before. This variety of needs for different types of<br />
output creates a challenge for the producer of accessible information.<br />
So it is important, particularly for the content authoring process, to consider the diverse range of user’s needs to ensure<br />
that they are met in the content authoring process. In order to better understand how the publishing industry can meet<br />
the accessibility needs of their users, a consultation took place between the members and the stakeholders of the<br />
<strong>workshop</strong>. Based on the results of this consultation, the next chapter elaborates on possible s<strong>cen</strong>arios for publishers<br />
introducing accessibility in their workflow.<br />
2.6.3 Different types of content<br />
Different modalities (text, image, video and audio) have different characteristics, purposes and uses. Every type of<br />
content raises also different issues in terms of accessibility. Images need a descriptive text to be accessible 37 and video<br />
or audio sequences should have a textual transcript 38 .<br />
When modalities are wrapped in particular aggregations or bundles, as they must be in publishing workflows (such as<br />
PDF containing text and images) we hit a huge increase in complexity and confusion because there are so many<br />
different aggregations, no universal <strong>agreement</strong> about terms used and what they mean in each aggregation. This CWA<br />
begins the process of simplifying this specific source of confusion by establishing a number of common s<strong>cen</strong>arios usable<br />
to support accessibility of different types of modality and aggregation.<br />
2.6.4 Different publishing domains<br />
Publishing processes in each domain (general, educational, scientific, etc.) have their own requirements. Some<br />
segments may be consumer-driven and some more producer-driven. For scientific content, stability and content review<br />
are important (as in scientific journals) and particular domains can present strong notational requirements that need tricky<br />
notational representation issues such as representing mathematics in device-independent formats.<br />
Many general domains may require only content distribution with little consumer feedback – this is useful in that it<br />
provides easy ways for publishers to control the processes and apply resources but it is difficult in that any feedback from<br />
customers about what is needed is less direct.<br />
The educational domain requires more flexibility so as to be able to meet the requirements of specific contexts and in<br />
many institutions adaptation for particular learners needs to be carried out. In many parts education also is rapidly<br />
shifting from a <strong>cen</strong>tral distribution (i.e. lecture or book) mode to one where learner participation and learner authoring<br />
needs to be provided for. This can create a serious challenge for publishing.<br />
Also the content itself varies among the different domains. A novel usually has a very flat structure while a biology book<br />
for 10 th graders has a lot of figures, tables and other elements.<br />
2.6.5 Target audience of content<br />
Cultural and religious issues notwithstanding, it is probably desirable that public content (governmental content, news,<br />
etc.) is accessible to the broadest range of people and audiences possible. Commercial content on the other hand may<br />
target particular niches markets (e.g. Product brochure of a machine manufacturer). Still, there are commercial<br />
advantages in expanding the range of a product to the widest market possible. The hidden demand for a product or<br />
feature or some particular content may be wider than the producer would anticipate (e.g. visually impaired, blind and/or<br />
dyslexic persons can benefit also from the production of structured audio books). A distinct advantage of creating<br />
accessible content is the enhanced usability for many other users who may not have a disability. In general we can say<br />
that accessibility interest and usability in general are two converging issues 39 .<br />
36 http://www.webaim.org/articles/cognitive/<br />
37 http://www.w3.org/TR/WAI-WEBCONTENT/#gl-provide-equivalents<br />
38 http://www.washington.edu/accessit/articles?79<br />
39 http://joshuakaufman.org/articles/pdf/Accessible_and_Usable_Web_Design-Kaufman.pdf<br />
16
CWA <strong>15778</strong>:2008 (E)<br />
2.6.6 Digital Rights Management (DRM), usage restrictions and li<strong>cen</strong>ce <strong>agreement</strong>s<br />
Blind, partially sighted and other print disabled people read electronic material by modifying the way in which it is<br />
presented, without modifying the content. They may do this through magnification, transformation into synthetic audio, or<br />
the use of a temporary, or "refreshable" Braille display. In some instances the software with which to make these<br />
changes is incorporated in mainstream packages, but the most flexible and adaptable solutions are achieved via<br />
dedicated "screen reader" software. The term "assistive technology" is used in this document to refer to this form of<br />
access.<br />
Digital rights management schemes, or the technological protection measures within them, can react to assistive<br />
technology as if it was an illicit operation. Thus, the DRM systems applied to e-Books and e-documents can prevent<br />
access by people who use assistive technology to read the screen or to control the computer 40 .<br />
We see that usage restrictions and li<strong>cen</strong>ce <strong>agreement</strong>s can present a particular challenge for accessibility because<br />
making content accessible for some particular context or user may require adaptation of the content which in turn may<br />
require access to parts of the content or even modification of the content using tools other than those it is provided with.<br />
These <strong>agreement</strong>s may be good for producers and intermediate suppliers (such as educational establishments) but<br />
unless crafted very carefully may not serve the needs of actual consumers well.<br />
40 http://www.indicare.org/tiki-read_article.php?articleId=170<br />
17
CWA <strong>15778</strong>:2008 (E)<br />
3 Formats for document processing<br />
In order to systematically describe the processes needed for accessible information processing a three layer generic<br />
system architecture is used to categorise the formats. The formats used can be divided into three categories: input<br />
formats, interchange formats and output formats. Some of the formats belong to more than one category.<br />
Input formats refer to any format that is used to add information to the start of a process. In some cases this involves<br />
getting analog formats such as paper formats digitised in order to convert those to more accessible formats for computer<br />
use. In other cases, the input layer will be the output from another process and may be a purely machine readable format.<br />
For accessible information processing the following formats are often used for input to processes:<br />
Printed paper<br />
<br />
<br />
<br />
<br />
<br />
Printed Braille<br />
ASCII Text<br />
HTML<br />
XML<br />
Multi-type composite formats<br />
Interchange and storage formats (sometimes referred to as representation formats) are extremely important. They are<br />
the formats in a system upon which the main logic or intelligence acts. For accessible information processing it is<br />
essential that this stage of the system is well designed. As systems and technologies become more advanced, these<br />
formats will likely reduce to purely XML formats of well specified international standardized multi-type composite formats.<br />
For the purpose of this document, the following formats have been identified as interchange formats:<br />
ASCII Text<br />
<br />
<br />
<br />
HTML<br />
XML<br />
Multi-type composite formats<br />
Output formats describe the result returned from a completed process. These can be any of the available formats as<br />
they can deliver a format to an end user (be it an actor or another process).<br />
Printed paper<br />
<br />
<br />
<br />
<br />
<br />
<br />
Printed Braille<br />
Audio<br />
ASCII Text<br />
HTML<br />
XML<br />
Multi-type composite formats<br />
3.1 Printed paper<br />
Printed paper is an essential part to document production. It is the traditional end product of production processes, and<br />
can often be the input stage of a Digitisation process. Printed Paper is still the main format in our society for authors,<br />
magazine publishers etc. to make their content available to the public.<br />
Printed paper is an essential part of document production. It is the traditional end product of the production processes,<br />
and can often be the input stage of a digitisation process. How useful it is in terms of generating alternative versions from<br />
a printed paper material depends largely on the quality of the material.<br />
18
CWA <strong>15778</strong>:2008 (E)<br />
3.1.1 Clear print<br />
Many publishers are recognising the market need for (slightly larger format) books using 12 or 14 point text in medium or<br />
bold.<br />
The resulting text is easier to read for the growing number of people who have difficulty reading smaller sized print. This<br />
is increasingly becoming a viable alternative.<br />
3.1.2 Large Print (16 point or over)<br />
Large Print is a format based on Printed Paper. A Large Print document can be either a scaled copy of the original or, if<br />
the electronic source document exists and is available in a word processor format (Microsoft Word, OpenOffice,<br />
StarOffice,...), it is possible to scale the font size and, if necessary, change the colour of the font and background colour<br />
as well as the font itself to satisfy the reader’s needs and print the document in this special setup.<br />
Many readers need larger print than conventional publications have. The font used should be clear and simple, medium<br />
or bold and with good line spacing. For large print documents, the process of conversion depends on the complexity of<br />
the original document and also on the quality requirements for the output document. For simple text documents such as<br />
novels designed to be read in a linear form, production of large print versions can be quite easy: once the original<br />
document’s content is available in a word processing software, characters are increased to a predetermined font size.<br />
The software usually ensures that the text flows on to new pages depending on the page format chosen. The document's<br />
page numbers may be modified (adapted book's page numbers do not match the original page numbers any more). The<br />
page numbering may not be important for a novel but is likely to be relevant in a non-fiction or reference work. In this<br />
case the original page numbering is often marked in the text. Complex documents require more care in their conversion<br />
to large print. Documents containing cross references need to be thoroughly checked because of the changes made to<br />
page numbers 41 . Table of contents and indexes must be regenerated. Production of large print books may involve some<br />
of the techniques of prepress production normal to publishing.<br />
3.2 Printed Braille<br />
Braille has been in use by blind and visually impaired people since it was invented by Louis Braille in 1821. It exists as a<br />
code based on a series of 6 raised dots (scientific Braille uses 8 dots).<br />
Printed Braille is produced by physically pressing dots into paper so that the points can be read by the fingers of the blind<br />
reader. This is a mechanical process converting the digital file into the analog version. Documents can be produced on a<br />
machine like a typewriter which produces single pages or on a printing machine driven by a computer feeding it a Braille<br />
file converted from a text file. Braille is usually printed on larger pages than conventional books in order to allow enough<br />
characters on a line to make reading efficient. The paper is thicker (200 gram/m2 or more) to retain the dots through<br />
many readings. The result is that documents in Braille are much bigger than the conventional print version. With<br />
precision equipment Braille can be printed at both sides of a paper (Interpoint technique) which halves more or less the<br />
amount of paper needed (but not its volume!). For example, a Braille version of the Bible takes up the same shelf space<br />
as 24 box files. There are methods of reproducing printed Braille using the pages as moulds for thin sheets of special<br />
plastic (thermoplast technique), but the current practice is to keep the digital file and print off copies when they are<br />
required.<br />
It is possible to convert existing Braille back to a digital Braille file by scanning, but this requires expensive machinery<br />
and there are only a couple of institutions regularly involved in this practice. It is being used to recover and archive rare<br />
documents such as music Braille where there is no easy route to a printed original which can be scanned. It is a<br />
therefore a finite process and will be less relevant in the future.<br />
Printed Braille output refers to any hard copy which represents Braille. It is the only and universal medium for blind<br />
people to read books without the use of modern information technologies 42 . Despite the new alternatives which came<br />
with Information and Communication technologies it is still very relevant to Accessible Information Processing. This is<br />
especially so when it comes to complex types of content e.g. math, music. This will be described in the following<br />
paragraphs.<br />
41 Unless they are softcoded.<br />
42 One should, however, been aware that only 10 % of the blind population can read Braille. This is due to the<br />
fact that elderly persons (who form the majority of blind people) cannot distinguish the dots anymore.<br />
19
CWA <strong>15778</strong>:2008 (E)<br />
3.2.1 Braille Music<br />
Braille music is a system of converting music notation to be represented in Braille code so that music can be read by<br />
visually impaired musicians. Almost all standard print music notations can also be written in Braille music notation. Braille<br />
music notation is an independent and well-developed notation system with its own conventions and syntax. This is<br />
constantly being expanded by a small group of transcribers around the world who communicate with each other to<br />
secure <strong>agreement</strong> as new instrumentation and musical figures are transcribed for the first time. It is therefore a genuinely<br />
universal notation standard.<br />
Visually impaired musicians' gain the same benefits by becoming musically literate through learning to read Braille music<br />
as do sighted musicians who learn to read printed music. It is therefore an important format in the AIP.<br />
3.2.2 Mathematics in Braille<br />
The need to convey mathematics linearly - without the use of special typesetting and, often, with a limited character set<br />
as well - is a common one. Linearising mathematical formulas is necessary for Braille. Contrary to music, there is no<br />
universal system in place for the linearization or for the codes used. Furthermore, some systems rely on six-dot Braille,<br />
others on 8 dots.<br />
Sometimes the use of spoken mathematics is promoted. Here again there are no universal rules on how to pronounce<br />
(complex) formulas.<br />
Mathematical Braille is largely country or region dependent. Therefore producers should bear in mind the location of the<br />
users they are producing for. Some of the better known codes are:<br />
The US Nemeth code 43 , 44<br />
The German Marburg code 45<br />
The UK Braille Mathematics Notation 46<br />
W3C's MathML 47<br />
In many circumstances the use of Latex for mathematics has been promoted. Latex is a typesetting system based on a<br />
combination of text and codes (somewhat similar to XML, but exclusively layout based). Latex has its own system for<br />
linearising mathematics which can be very useful for higher education purposes and screen reading but not for printing<br />
on paper as it is extremely verbose.<br />
A complete overview of mathematics handling and conversion between "standards" has been made in the European<br />
Lambda project 48 .<br />
3.3 Audio<br />
Audio files have been an integral part of specialist formats for some time. Originally metallic tape and vinyl were used to<br />
distribute spoken materials to print impaired users. These analogue formats have mostly been replaced by digital formats<br />
such as WAV, MP3 and other MPEG formats.<br />
43 More information on http://www.brailleauthority.org/Math-science.htm<br />
44 A. Nemeth. The Nemeth Braille Code for Mathematics and Science Notation 1972 Revision. American<br />
Printing House for the Blind, 1972.<br />
45 H. Epheser, D. Pograniczna, and K. Britz, Internationale Mathematikschrift für Blinde, Deutsche<br />
Blindenstudienanstalt, Marburg (Lahn) 1992.<br />
46 http://www.bauk.org.uk/docs/bmn.pdf<br />
47 http://www.w3.org/TR/MathML2/<br />
48 http://www.lambdaproject.org<br />
20
CWA <strong>15778</strong>:2008 (E)<br />
In modern workflows and supply chains, these are mostly packaged with some kind of multimedia framework such as<br />
SMIL 49 or MPEG 50 but audio can still be used in some very specific processes.<br />
Audio books are by far the most popular medium for the distribution of alternative format publication. Not only are the<br />
specialist libraries around the world producing far more audio than all the other formats put together but commercial<br />
audio is becoming an increasingly important part of the output of many major publishing houses.<br />
Most of the currently commercially available audio books are read by a professional narrator. Often these are actors but<br />
some authors read their own books which might make them more commercially attractive.<br />
Most of the currently commercially available audio books are distributed in uncompressed audio format (PCM).<br />
Commercial production using expensive narrators and professional studios meant that adding even a rudimentary<br />
structure was only a marginal cost. Most productions have at least chapter identifiers and many have chosen to add<br />
structure to a finer level, but always using the basic "one level tracks structure" of audio CDs.<br />
Existing audio CDs can hold a maximum of 80 minutes per disc 51 which results either in a large number of discs for most<br />
of the large books, or in abridged versions of those same works, which is an unacceptable solution for print disabled<br />
users wanting to access the full content of a certain book.<br />
Mp3 versions of books can be produced from a human narration or automatically through computer generated synthetic<br />
speech. The range of existing synthetic speech engines is very wide. Many languages are available though the most<br />
common languages tend to have the best speech quality because of the level of investment in development which<br />
reflects the potential market. A further advantage is that these files are considerably smaller than uncompressed audio,<br />
which allows producers to fit up to 10-15 times the standard duration of uncompressed audio CDs on one data CD<br />
(depending on the desired audio quality).<br />
3.4 ASCII Text<br />
ASCII refers to an internationally recognised text standard. In terms of Accessible Information Processing it is used to<br />
represent either a text file or codes which represent information in Braille. It is becoming increasingly obsolete as supply<br />
chains move towards XML formats, but ASCII remains the base element of much of the information archived by national<br />
specialist providers.<br />
3.5 HTML documents<br />
HTML is a very common output format. As a mark-up language, structure is given a high importance, so when correctly<br />
formatted, an HTML document can be searched and browsed in a very logical manner, and headings, paragraphs, links,<br />
lists, and so on can be used for full easy navigation. One of the advantages of HTML is that the files can be read on any<br />
web browser in any platform. A key disadvantage of HTML is that it is not possible to edit the content while reading it in a<br />
browser, as we can do when reading documents in text-processors. This may sometimes be necessary and can be<br />
compared to adding notes in a traditional book. 52<br />
It does allow for the user, however, to open the source file of the<br />
document and add information if they know how to do so.<br />
A bigger disadvantage is that the layout of the information is mixed in the same HTML document with the information<br />
itself and its structure, which may be a handicap when accessing the content.<br />
49 SMIL: Synchronized Multimedia Integration Language, http://www.w3.org/AudioVideo/<br />
50 MPEG Moving Picture Experts Group, http://www.chiariglione.org/mpeg/<br />
51 This is due to the initial purpose of audio CDs’: high quality recording of music. On audio CDs no trade off<br />
between quality and length of recording can be made.<br />
52 Editing of HTML based texts on the web can be done through WIKI technology, a system for collaborative<br />
working on one single document, mostly known from the wikipedia encyclopedia pages but applicable to other<br />
documents too (cf. EUAIN wiki below).<br />
21
CWA <strong>15778</strong>:2008 (E)<br />
The Internet is becoming for many the dominant information location and retrieval source. Accessibility of websites and<br />
search engines (based mostly on HTML) is therefore essential to the needs of visually impaired people. Different<br />
guidelines exist on how to structure correctly HTML files, as well as guidelines on how to make them fully accessible. 53<br />
3.6 XML<br />
The Extensible Markup Language (XML) is a W3C-recommended general-purpose markup language that supports a<br />
wide variety of applications. XML is a simplified subset of Standard Generalised Markup Language (SGML). Its primary<br />
purpose is to facilitate the sharing or the re-use of data across different areas of application. Formally defined languages<br />
based on XML allow diverse software to reliably understand information formatted and passed in these languages.<br />
Based on these standardised languages it is also possible to write converters to transform the information into other<br />
formats.<br />
The format is important for accessible information processing as it provides interchange formats for converting<br />
information to accessible information. XML is the name of the general family of several describing languages. Two<br />
implementations of XML (technically called DTDs or Schemas) are the TEI standard and the already mentioned DAISY<br />
standard.<br />
The Extensible Markup Language (XML) is a mark-up language for describing data regardless of its external appearance.<br />
XML is also focussing mainly on the structure of the content. In a different process numerous different transformation<br />
style guidelines can be applied to one XML document to present the information in the most suitable way. In the context<br />
of accessible information, XML has two main advantages over other formats – the first is that the elements of books like<br />
chapters, paragraphs, headings, can be perfectly represented by XML's structural elements; the second advantage is<br />
that, being a layout-agnostic language, different transformation style guidelines may be applied to the same content to fit<br />
different user’s needs and preferences.<br />
This format is therefore of great importance for accessible information processing as it provides interchange formats for<br />
converting information to accessible outputs. 54<br />
3.6.1 XHTML<br />
XHTML is in principle a reformulation of the older HTML 4 specification. The formal specification of XHTML can be read<br />
here http://www.w3.org/TR/xhtml1/, but in the context of accessible materials a number of points could be made to help<br />
understanding the concept. XHTML is an application of XML. This means that even though XHTML has the semantics of<br />
HTML, it has the syntax and the very strict rules of XML. There are both advantages and disadvantages compared to<br />
HTML:<br />
Among the advantages are that there are a number of tools that are specifically made for working with XML, called XMLparsers.<br />
These are built into software that use or read XHTML documents, and they make the processing of the<br />
document both easier and more accurate. Ironically, this creates a disadvantage for the user. Since XHTML documents<br />
must be valid (e.g. conform to the very strict rules of XML) they are difficult to create and edit by hand and as a<br />
consequence most XHTML documents are machine-generated.<br />
Software that requires valid XHTML documents include Digital Talking Book players and some e-text readers. It must be<br />
noted however, that most Internet browsers – such as Internet Explorer 6 and 7 - are very tolerant and can render<br />
XHTML documents even if they are not valid.<br />
3.6.2 TEI XML<br />
TEI is a specific implementation of the XML standard.<br />
53 HTML standard. ISO/IEC 15445:2002 Information technology - Document description and processing<br />
languages - HyperText Markup Language (HTML). HTML 4.01 Specification (HyperText Markup Language),<br />
http://www.w3.org/TR/html401/<br />
54 XML standard Extensible Markup Language (XML) 1.0 (Fourth Edition) 29 September 2006.<br />
http://www.w3.org/TR/REC-xml/<br />
22
CWA <strong>15778</strong>:2008 (E)<br />
The Text Encoding Initiative (TEI) Guidelines are an international and interdisciplinary standard that enables libraries,<br />
museums, publishers, and individual scholars to represent a variety of literary and linguistic texts for online research,<br />
teaching, and preservation. 55<br />
3.6.3 Daisy XML<br />
The term “DAISY” covers a variety of standards and specifications that are maintained and supported by the DAISY<br />
Consortium (www.daisy.org). DAISY is an acronym that means “Digital Accessible Information System”. The term<br />
covers at least 3 different standards and specifications:<br />
DAISY 2.02: A specification that defines the DAISY Digital Talking Book (DTB) format. DAISY 2.02 is by far the most<br />
widespread and popular of the DAISY formats and has gained support from both producers of digital talking books as<br />
well as producers of players and production software. When people talk about “a DAISY book” they are probably<br />
referring to DAISY 2.02.<br />
DAISY 2.02 is based on XHTML and SMIL (Synchronised Multimedia Integration Language – also an application of XML).<br />
A number of different audio formats can be used in a DAISY 2.02 book – the most common one used being MP3. A<br />
DAISY 2.02 book can contain the full text of a printed work as well as pictures and other multimedia content, or it can be<br />
an audio only book.<br />
The heart of a DAISY 2.02 book is the NCC file (Navigation Control Centre) that both presents itself to the user as the<br />
Table of Content and works as the reading software’s main system file. The NCC is always present in a DAISY 2.02 book<br />
even if it is without text.<br />
DAISY 2.02 books can be read using either a hardware player or a computer with software for this purpose. DAISY 2.02<br />
is platform independent. A number of commercially available production tools for producing DAISY 2.02 books are<br />
presented on the DAISY Consortium website. 56<br />
Daisy/NISO Standard 2005 (aka DAISY 3/ Z39.86): This standard covers a complex of sub-specifications and Document<br />
Type Definitions (DTDs) that together constitute a Digital Talking Book. Compared to DAISY 2.02 this standard is very<br />
advanced and can be used to represent any book in text and audio format.<br />
The sub-systems in a DAISY 3 book include:<br />
The manifest, that contains a complete list of all the files that make up the DTB<br />
The Spine, that defines the reading order of the DTB<br />
Tours, that defines alternative reading orders of the DTB<br />
The heart of the DAISY 3 book is the NCX file that functions in much the same manner as the NCC file does in a DAISY<br />
2.02 book. Unlike the NCC, the NCX represents the structure of the book in a true hierarchy.<br />
Many of the features of DAISY 3 are drawn from Open eBook Publication Structure Version 1.2. DAISY 3 is a<br />
fairly new standard and books are not yet widely distributed by DTB producers. Some producers use DAISY 3<br />
as a production and interchange format where the great richness in detail makes it ideal for XML/XSLT<br />
transformations to other formats, mainly DAISY 2.02 for distribution. 57<br />
DTBook: The text component of a Z39.86 book consists of one ore more DTBook files. The DTBook<br />
specification has, however, found widespread use outside the framework of the Digital Talking Book. DTBook<br />
is an element set that represents European/American book tradition in an XML context. The element set<br />
consists of 79 elements, but can be extended with e.g. math, poetry etc. DTBook is specially suited as an<br />
55 Guidelines for Electronic Text Encoding and Interchange (2002). http://www.tei-c.org/P4X/<br />
56The full specification can be seen here: http://www.daisy.org/z3986/specifications/daisy_202.html<br />
57 The standard can be seen here: http://www.niso.org/standards/resources/Z39-86-2005.html<br />
23
CWA <strong>15778</strong>:2008 (E)<br />
interchange format and is widely used as such. Using XML tools like XSLT, a DTBook file can be transformed<br />
into PDF, RTF (Word), XHTML, HTML and virtually any other text format. 58<br />
NIMAS (National Instructional Materials Accessibility Standard): A standard that was developed 2002-2004<br />
by the National File Format Technical Panel. NIMAS is a sub-set of Z39.86.<br />
“NIMAS is a technical standard used by publishers to produce source files (in XML) that may be used to develop<br />
multiple specialised formats (such as Braille or audio books) for students with print disabilities.<br />
The source files are prepared using Extensible Markup Language (XML) to mark up the structure of the original content<br />
and provide a means for presenting the content in a variety of ways and styles. For example, once a NIMAS fileset has<br />
been produced for printed materials, the XML and image source files may be used to create Braille, large print, HTML<br />
versions, DAISY talking books using human voice or text-to-speech, audio files derived from text-to-speech<br />
transformations, and more.<br />
The separation of content from presentation is an important feature of the NIMAS approach. In most cases, a human will<br />
need to enhance the source files to provide additional features needed by diverse learners.<br />
The various specialised formats created from NIMAS file sets may then be used to support a very diverse<br />
group of learners who qualify as students with print disabilities. It is important to note that most elementary<br />
and secondary educational publishers do not own all of the electronic rights to their textbooks and related<br />
core print materials and a copyright exemption allows them to deliver the electronic content of a textbook and<br />
the related core print materials to the NIMAC, a national repository which began operations on 12/3/06, as<br />
long as the publishers possess the print rights. NIMAS applies to instructional materials published on or after<br />
7/19/06. “ 59<br />
3.6.4 MathML<br />
Mathematical Markup Language (MathML) is an application of XML for describing mathematical notation and capturing<br />
both its structure and content. It aims at integrating mathematical formulas within other XML documents. It is a<br />
recommendation of the W3C math working group.<br />
3.7 Multi-type composite formats<br />
Those formats are especially important to the publishing industry, because they are widely used and part of the<br />
publishing process, In general we consider here:<br />
<strong>Education</strong>al multimedia documents<br />
<strong>Education</strong>al publishing may require a greater degree of flexibility than other forms of publishing<br />
because of the direct engagement with the persons using the publication. Typically it is more<br />
interactive (as it must engage the learner’s attention in a more demanding context). <strong>Education</strong> often<br />
involves direct learner participation, authoring, and other processes such as assessment. Content<br />
and processes must match the learner’s particular context and requirements as closely as possible<br />
because the closer the match the better the learning outcome. This is very demanding. Given the<br />
huge variety and individual nature of assistive technology and personal adaptations (such as<br />
cognitive adaptations) it is unrealistic to expect that content producers can tailor content for each<br />
circumstance or even that they can have the accessibility knowledge necessary to do so.<br />
<br />
Scientific, technical and medical documents (STM)<br />
The global market for English-language STM (scientific, technical and medical) journals is about $5<br />
billion. The industry employs 90,000 people globally, of which 40% or 36,000 are employed in the EU.<br />
Another 20– 30,000 full time employees are indirectly supported 60 One of the main typesetting<br />
58 A comprehensive set of guidelines for applying DTBook mark-up is available from the Daisy Consortium.<br />
(http://www.daisy.org/z3986/guidelines/sg-daisy3/structguide.htm). Other sources: Theory behind the DTBook DTD<br />
(http://www.daisy.org/publications/docs/theory_dtbook/theory_dtbook.html)<br />
59 From the NIMAS web site (http://nimas.cast.org/about/nimas/index.html):<br />
60 http://www.stm-assoc.org/storage/Scientific_Publishing_in_Transition_White_Paper.pdf<br />
24
CWA <strong>15778</strong>:2008 (E)<br />
3.7.1 PDF<br />
systems used is TeX, and the main document mark-up language and document preparation system<br />
is LaTex. It is widely used by mathematicians, scientists, philosophers, engineers, and scholars in<br />
academia and the commercial world, and by others as a primary or intermediate format (e.g.<br />
translating DocBook and other XML-based formats to PDF) because of the quality of typesetting<br />
achievable by TeX. It offers programmable desktop publishing features and extensive facilities for<br />
automating most aspects of typesetting and desktop publishing, including numbering and crossreferencing,<br />
tables and figures, page layout and bibliographies 61 .<br />
In the past, Adobe PDF files could be very inaccessible, especially to people using screen readers. When the PDF is<br />
made by attaching an image of a page to another it is still completely inaccessible as screen readers have no text to read.<br />
This began to change with Acrobat 5, when Adobe introduced the ability to tag PDF files for accessibility. Although PDF<br />
tags could not be manipulated as easily as HTML tags, they made the content more accessible to some users with<br />
screen readers. Adobe Reader 7 continues to improve the user's access to PDF files offering the possibility to customise<br />
preferences extensively. Additionally, Adobe has included a DRM (Digital Rights Management) mechanism into Acrobat<br />
Reader, but there are also several other DRM plug-ins that work with Acrobat Reader.<br />
Due to the major characteristic of PDF, the fact that it is rendered the same no matter on which viewer or operating<br />
system it is viewed on, PDF has become the most popular format used by publishers and increasingly among other<br />
content creators. With PDF the difference between an accessible and an inaccessible document depends on proper<br />
usage of the programmes used to create PDF files. Much more education and training in creating accessible PDF is still<br />
needed.<br />
3.7.2 QuarkXpress® file format<br />
Quark®, provides a layout software, called QuarkXpress®, that used for writing, editing, and typography with colour and<br />
pictures to produce rich final outputs for print and Web 62 . It is used by more than three million users worldwide. The<br />
software is used mainly for creative design and page layout. QuarkXpress Version 7, provides a Voluntary Product<br />
Accessibility Template 63 (VPAT) that details the accessibility features of Quark's product in order to help customers to<br />
determine its compliance with Section 508.<br />
With QuarkXPress users can import and export XML Documents. With Quark Digital Media Server content can be stored<br />
in a <strong>cen</strong>tral database. It then can be used in multiple forms according to the principles of multi-channel publishing. Quark<br />
XTensions software, which are plugins, can automate functions and eliminate repetitive steps with palettes, commands,<br />
tools, and menus. Tests with QuarkXPress 6.5 Passport (international Edition) showed that QuarkXPress was not able to<br />
import the TEI-DTD. To tag the text of the book, a new, flat DTD had to be written. With the new DTD the mapping from<br />
layout formats to XML tags was possible. The content then is exported into a XML file. This is the basic version for the<br />
accessibility work.<br />
3.7.3 In Design file format<br />
Adobe® markets InDesign® to produce professional page layouts 64 . Riche and complex documents and outputs for<br />
multiple media. Adobe InDesign CS3 software version, supports accessible cross-media publication, allowing export of<br />
InDesign documents into PDF, XHTML, and XML. Users can add tags and alternative text attributes to InDesign<br />
documents that support the production of accessible content in these exported formats. 65<br />
InDesign from Adobe Inc. is a desktop publishing application (DTP) which can work with XML files. It is possible to import<br />
XML into InDesign and then prepare the document for output e.g. printed book. This feature is an important step toward<br />
61 http://en.wikipedia.org/wiki/LaTeX<br />
62 http://www.quark.com/products/xpress/<br />
63 http://www.quark.com/products/xpress/pdf/VoluntaryProductAccessibilityTemplate.pdf<br />
64 http://www.adobe.com/products/indesign/<br />
65 http://www.adobe.com/accessibility/products/indesign/<br />
25
CWA <strong>15778</strong>:2008 (E)<br />
multi-channel and cross-media publishing. Tests with Adobe InDesign CS2 showed that it is possible to tag the text of the<br />
layout document. Further investigations are done to efficiently map layout to the structure. InDesign supports the<br />
mapping of text-formats to XML-Tags but the structure had to be added afterwards. The mapping feature can be used, if<br />
the text is in a proper layout. Otherwise the user has to mark the specific text area (e.g. one chapter) and then to assert<br />
the XML tag to the text.<br />
3.7.4 Digital Talking Book (DTB) documents<br />
For many years, "talking books" have been made available to print-disabled readers on analog media such as<br />
phonograph records and audiocassettes. These media served their users well in providing human-speech recordings of<br />
a wide array of print material in increasingly robust and cost-effective formats. However, analog media are limited in<br />
several respects when compared to a printed book. Firstly, they are by their nature linear presentations, which leave<br />
much to be desired when reading reference works, textbooks, magazines, and other materials that are often accessed<br />
randomly. In contrast, digital media offer readers the ability to move around in a book or magazine as freely as (and more<br />
efficiently than) a sighted reader flips through a print book. Secondly, analog recordings do not allow users to interact<br />
with the book by placing bookmarks or highlighting material. A DTB offers this capability, storing the bookmarks and<br />
highlights separate from, but associated with, the DTB itself. Thirdly, talking book users have since long complained that<br />
they do not have access to the spelling of the words they hear. As will be explained below, some DTBs will include a file<br />
containing the full text of the work, synchronised with the audio presentation, thereby allowing readers to locate specific<br />
words and hear them spelled. Finally, analog audio offers readers only one version of the document. If, for example, a<br />
book contains footnotes, they are either read where referenced, which burdens the casual reader with unwanted<br />
interruptions, or grouped at a location out of the flow of the text, making them difficult for interested readers to access. A<br />
DTB allows the user to easily skip over or read footnotes. The Digital Talking Book offers the print-disabled user a<br />
significantly enhanced reading experience -- one that is much closer to that of the sighted reader using a print book.<br />
The DTB goes far beyond the limits imposed on analog audio books because it can include not just the audio rendition of<br />
the work, but the full textual content and images as well. Because the textual content file is synchronised with the audio<br />
file, a DTB offers multiple sensory inputs to readers, a great benefit to, for example, learning-disabled readers or people<br />
with dyslexia. Some visually impaired readers may choose to listen to most of the book, but find that inspecting the<br />
images provides information not available in the narrative flow. Others may opt to skip the audio presentation altogether<br />
and instead view the text file via screen-enlarging software. Braille readers may prefer to read some parts or the entire<br />
document via a refreshable Braille display device connected to their DTB player and accessing the textual content file.<br />
Digital Talking Books are not tied to a single distribution medium. CD-ROMs will be used first but DTBs are portable to<br />
any digital distribution medium capable of handling the large files associated with digital audio recordings. Regardless of<br />
how a DTB is distributed, however, it will normally be in the context of a digital rights management system.<br />
One implementation of DTBs is based on the Daisy Standard (DAISY 2.02 / ANSI/NISO Z39.86-2005). Daisy books are<br />
described above in the sections 3.6.3.<br />
This standard for creating digital content in structured multimedia is developed and maintained by the DAISY Consortium.<br />
Using XML text files and MP3 audio files, the DAISY format can create a range of text only, fully synchronised text and<br />
audio and audio-only books that are fully accessible and navigable for blind and visually impaired users as well as people<br />
with other disabilities such as dyslexia. It allows up to 6 levels of structure (chapter, subchapter, paragraph, and so on) as<br />
opposed to the one-level structure of commercial audio CDs, which makes it suitable for complex books like educational<br />
materials.<br />
The DAISY standard has been adopted as the standard to be used by publishers in the United States of America to<br />
comply with the Instructional Materials Accessibility Act (2002). In Europe DAISY is also used by a wide range of<br />
alternative media publishers to create accessible material.<br />
26
CWA <strong>15778</strong>:2008 (E)<br />
4 Considerations for structuring documents<br />
Structured information is the first step in the accessible information process. A document whose internal structure can be<br />
defined and its elements isolated and classified, without losing sight of the overall structure of the information, is a<br />
document that can be navigated.<br />
Most adaptive technology allows the user to access a document, and to read it following the "outer" structure of the<br />
original. But if the same information has also an "inner" structure that allows the adaptive device to distinguish between a<br />
phrase and a measure, between a paragraph and a sentence, highlighting particular annotations, then the level of<br />
accessibility (and therefore usability) of the whole document will be greatly enhanced, allowing the user to move through<br />
it in the same way as those without impairments do when looking at a printed document, and following the same integral<br />
logic. In an ideal world, all documents made available in electronic formats should contain this internal structure that<br />
benefits everyone. Highly-structured documents are becoming more and more popular due to reasons that very seldom<br />
pertain to making them accessible to people with disabilities.<br />
The move to XML related formats and associated standards for metadata have provided an impetus for far greater<br />
document structuring than before. Whatever the reasons behind those decisions are, the use of highly-structured<br />
information is of great benefit to anybody accessing them for any purpose. In re<strong>cen</strong>t years, the market for accessibility<br />
and assistive technologies has started to gain recognition. It is clear that the integration of accessibility notions into<br />
mainstream technologies would provide previously unavailable opportunities in the provision of accessible multimedia<br />
information systems. It would open up modern information services and provide them to all types and levels of users, in<br />
both the software and the hardware domain. Additionally, new consumption and production devices and environments<br />
can be ad- dressed from such platforms and this would provide very useful information provision opportunities indeed,<br />
such as information on mobile devices with additional speech assistance.<br />
Structuring content is essential for the subsequent processing and transformation of it into accessible content.<br />
Document style sheets and guidelines need to be followed though out the content creation process, so to<br />
enable successful conversions into accessible content and eliminate costly post-editing tasks. For that reason,<br />
careful consideration of styles and usage <strong>agreement</strong>s with stakeholders are of special importance. The next<br />
sections present the steps that need to be introduced in publishing workflows to enable accessible content<br />
production. These workflow steps listed below are also part of the s<strong>cen</strong>arios introducing accessibility within publishing<br />
workflows (section 0) and given here for allowing the reader to refer back when reading this section.<br />
4.1 Define and use document style guidelines<br />
This activity basically defines style guidelines that will be used by all actors in the accessible content processing value<br />
chain. Introducing style guidelines requires an <strong>agreement</strong> between all actors in the publishing chain. These style<br />
guidelines can be used to tag specific content and map it into a specific conversion. The consistent use of style<br />
guidelines through the whole publishing chain enables efficient processing and automatic structuring of content and thus<br />
enhances significantly the accessibility of it. Actually it is not necessary for all actors use the same style guidelines but<br />
only if the mappings between the used style guidelines exist (which is not always the case!) and are well known.<br />
There are several style guidelines available, very commonly used are those based on the Chicago Manual of Style 66<br />
4.2 Define and use structure guidelines<br />
In this specific activity guidelines that define structure are agreed between the relevant actors. Structural guidelines are<br />
more abstract than style guidelines. They actually define how structure is tagged in the content. In the case of MS Word,<br />
structural guidelines will need to define how different headings are tagged. By consistently following the structural<br />
guidelines, automatic conversion can be achieved. Structure also deals with tagging of sections that contain graphics,<br />
images, drawings, other rich media, math, music etc. Structural guidelines also enable navigation on the structural<br />
elements and by doing so ease information retrieval and content consumption. A good point to start and examples on<br />
how to handle images or drawings can also be found in the structure guidelines for DAISY at<br />
http://www.daisy.org/z3986/structure/<br />
66 http://www.chicagomanualofstyle.org/home.html<br />
27
CWA <strong>15778</strong>:2008 (E)<br />
4.3 Edit / add structure where needed<br />
This activity deals with editing or adding structural elements, where either those were lost in the conversion process, or<br />
have not been tagged in the beginning because of inconsistent use of style guidelines. If it is a post-processing operation<br />
then it requires a lot of manual effort. Therefore, those tasks should be kept to a minimum.<br />
In order to edit or add structure in this task, guidelines on how to add structure by hand need to be developed and used<br />
consistently as well. A list of relevant guidelines has been compiled by the EUAIN project and can be found in the<br />
literature 67<br />
4.4 Edit DRM settings<br />
Unless very carefully handled, DRM has the potential to disable adaptation of content to make it accessible in another<br />
context. For example, a format may include text that is locked into the format, cannot be copied and is not accessible<br />
with assistive technology such as a screen-reader. The text may be only accessible as a bitmapped image, meaning that<br />
it cannot be simplified or rendered in some other format. Whatever the format and tool the content is produced with, it is<br />
unlikely that the designer will have considered and provided for all the contexts in which it may be used. For best results<br />
the separate modalities within material need to be available for other software and hardware tools that are able to make<br />
the material accessible for unforeseen contexts to use. This usually means leaving the text unlocked 68 . This topic is for<br />
expansion in further work.<br />
4.5 Adaptation<br />
Adaptation for a context typically will involve matching the content to the context and making changes to meet that<br />
context. This is much easier if the content exists in a form that can be taken apart (disaggregated), supplemented with<br />
other content (such as subtitling a video) then put together again. The reason for this is that because adaptations are<br />
often needed for only parts of the material and not for all of it. It may be that a visually-impaired person cannot use the<br />
diagrams in the content, for example, so that it is necessary to include alternatives for the diagrams in the content.<br />
In all cases it will be necessary to examine both the context AND the material to see if they match. If, in the production<br />
chain, this process takes place close to the user it is more likely that the materials will match the user’s needs. In some<br />
cases, such as described in s<strong>cen</strong>ario 11, the user’s requirements will be directly available. In others it will be necessary<br />
to incorporate many alternatives within the content (so that the learner can select the modalities she can use) but still to<br />
allow for the content to be adapted as late as possible to meet unanticipated user needs.<br />
When the user’s requirements are not directly available, as in a pre-sale publication process, it is useful to test the<br />
material against such checkpoints as are provided in sets of guidelines elsewhere in this document. For web content the<br />
Web Accessibility Initiative Web Content Accessibility Guidelines can form a useful base to do this and a number of<br />
automated tools that can assist with this process are available. It is important to remember that doing this is using virtual<br />
or average requirements and will not meet all circumstances but does help.<br />
67 http://wiki.euain.org/doku.php?id=wiki:guidelines_for_accessible_information_processing<br />
68 This topic was studied in the European SEDODEL project<br />
(http://canada.esat.kuleuven.be/docarchwebsite/show.jsp?page=projects&id=SEDODEL) but due to rapid<br />
changes in DRM technology, the work should be continuously updated.<br />
28
CWA <strong>15778</strong>:2008 (E)<br />
5 Conversion processes<br />
This list of conversion processes is not exhaustive but it should cover the most important conversions in the accessible<br />
content processing workflows. The conversions listed below are also part of the s<strong>cen</strong>arios introducing accessibility within<br />
publishing workflows (section 0) and given here for allowing the reader to refer back when reading this section.<br />
5.1 Convert Multimedia Material to structured Multimedia Material<br />
By multimedia material we mean all content that consists of different media types. Multimedia content can be a Microsoft<br />
Word document that consists of text and images but it can also be a video sequence. In most of the following s<strong>cen</strong>arios<br />
the term Multimedia Material is used for PDF files.<br />
Accessible PDF can be seen in some cases as an output format which is passed to the reader. Sometimes it might be an<br />
interchange format from which further transformations into other formats are performed.<br />
If the accessible PDF is intended as output format, creating accessible tagged PDF files will produce PDF files<br />
accessible to standard screen readers which support tagged PDF (like JAWS and Window Eyes).<br />
This circumvents the need for end users to learn how to use Adobe's embedded speech synthesiser.<br />
However, it is not always easy to make PDF files directly accessible to screen readers. Documents with complex layouts<br />
can be extremely difficult, if not impossible, to convert into an accessible PDF file, due to the fact that the content does<br />
not linearise correctly. It can also be very challenging to make documents with extensive charts or with embedded videos<br />
accessible.<br />
Converting PDF into accessible PDF may include the following steps (a detailed description of these steps can be found<br />
on the EUAIN Training Resource Centre 69 ):<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
Apply OCR to Image only PDF<br />
Converting Existing PDF Files into Tagged PDF Files<br />
Change Tag Type<br />
Add Alternate Text to Images<br />
Create New Tag<br />
Delete Tag<br />
Reorder Tags<br />
Reordering Tags Using the Order Tab<br />
Artefacts<br />
Adding Tags to Untagged PDF Files<br />
Add Tags to Documents Feature<br />
Add All Tags Manually<br />
TouchUp Reading Order Feature<br />
Adding Tags Using TouchUp Reading Order<br />
69 http://wiki.euain.org/doku.php?id=wiki:processes:conversions:html_to_xml<br />
29
CWA <strong>15778</strong>:2008 (E)<br />
5.2 Convert structured Multimedia Material to XML<br />
By structured Multimedia Material we mean multimedia content that has information on its structure included. This can be<br />
a fully navigable audio document, a Microsoft Word document that makes correct usage of heading etc or a fully<br />
accessible video document with subtitling. In most of the s<strong>cen</strong>arios we mean accessible PDFs when speaking of<br />
structured Multimedia Material.<br />
When Multimedia Material is structured it makes it possible to transform the Material into various output formats. In case<br />
of PDFs it is, if possible generally more cost effective to start from the original word processing documents. Quite often<br />
the original file used to create the PDF is unavailable. In that case you can create a XML file using Acrobat, but the file<br />
will probably be more complex and will require more post processing and restructuring work to make it accessible.<br />
If you have images, only the alternate description will be saved, but not the image, and there are no tables in the HTML<br />
file, even if the table was an appropriately-tagged data table in the original PDF file. More information can be found on<br />
the EUAIN Training Resource Centre.<br />
5.3 Convert Multimedia Material to XML<br />
PDF is widely recognised as a de facto standard for electronic end user documents. Public services, banks, insurance<br />
companies and others that distribute electronic documents to end users utilise the advantages of a format that is easily<br />
printed by the user and has a fixed unchangeable layout. It is also widely recognised that PDF is unsuitable for anything<br />
else than visual presentation. Information in a PDF-document is deliberately made un-flexible and is not easily retrieved.<br />
In spite of these facts, one of the most common tasks for national providers of accessible information is the conversion of<br />
PDF to an accessible format. At the same time, this particular conversion is both costly and time-consuming.<br />
There are two different approaches to extract information from a PDF-document. Since PDF can be compared to a highquality<br />
TIFF picture – the same picture that is the result of a scanning process – it is possible to use the same OCR<br />
processes with PDF-documents as the processes used with scanned paper-documents – and the results are comparable.<br />
Most of the OCR-software providers on the market are recommending this method. The other approach is to try to extract<br />
the text-information that is present in a PDF-document. This is done by software that reads both the text content and the<br />
visual information in a PDF-document and translates this information to the common attributes of a text document –<br />
paragraphs, headings, etc.<br />
The result of both processes can then be exported into an XML file which is the basis for further accessible information<br />
processing. Both of these methods require extensive use of human resources and are not easily automated. Especially<br />
rendering of visual components such as columns, tables, page-headers, unusual fonts, etc. have proved very difficult to<br />
automate. Considering the large amount of human resources employed in this type of conversion, it could be concluded<br />
that research efforts used on improving these methods would be very well spent. It should be noted that conversions to<br />
accessible formats earlier in the production chain – when the conversion process is still easily automated – would have<br />
similar results.<br />
5.4 Convert traditional print to XML<br />
This activity refers to the transformation of printed-paper into an electronic interchange format. Usually this is completed<br />
through scanning of printed pages.<br />
The result is a set of image files (one file per page unless the multiple page TIFF format is used). These image files can<br />
then be saved or be processed through optical character recognition (OCR) software. This software recognises the text<br />
in the images and transforms the images into a text file. Either by specific software or by hand this text can be converted<br />
into a structured XML file.<br />
This output will refer to structured content which is either being fed into an XML reader which accesses the content and<br />
the structural data which surrounds that content or the XML is being fed into the next stage of an XML production stream<br />
for further conversion.<br />
This process describes the first stage of a digitisation project where an analog format is being taken in and converted into<br />
a format (XML), which allows further conversion, storage or processing.<br />
30
CWA <strong>15778</strong>:2008 (E)<br />
5.5 Convert DTP to XML<br />
This, primarily multimedia, input will refer to a complex package of media files, which are held together by one governing<br />
structure. Well known DTP packages currently are QuarkXpress and Adobe InDesign. These documents also have the<br />
most complex formats, which can be used in accessible information processing, and the ability to process these formats<br />
forms the key to integrating accessibility with mainstream workflows and processes.<br />
In the case of Accessible Output from a Multimedia package, guidelines are often needed for describing images and<br />
other multimedia formats which cannot be directly translated into a suitable format.<br />
XML output will refer to structured content which is either being fed into an XML reader which accesses the content and<br />
the metadata which surrounds that content or the XML is being fed into the next stage of an XML production stream for<br />
further conversion.<br />
This process is likely to be taking some (or all) of the content in a multimedia package and processing it into a more<br />
generic format (XML) for conversion into further formats. More information can be found on the EUAIN Training Resource<br />
Centre 70 .<br />
5.6 Convert XML to Print<br />
XML to printed paper conversion processes (specific to accessible information processing) would refer to any process,<br />
which uses XML data to prepare content for accessible use based on a paper format.<br />
Primarily XML input will refer to part of an XML production stream where XML is used as the core interchange format.<br />
This being the case, it is likely that there will be some pre-processing, as content is rarely created by hand straight into<br />
XML. From XML, content can be converted to almost any format so it is a very good starting point in accessible<br />
information processes.<br />
The transformation itself is usually done automatically by software. It uses specific transformation templates (XSLT) to<br />
convert the XML data into a readable format. The transformation templates contain information on font sizes, colours,<br />
and also layout information.<br />
Printed-paper output refers to any hard copy, which represents the information. It is probably coming out of a printer or a<br />
photocopier. In terms of accessible information processing printed-paper output can be a clear print output without<br />
sophisticated lay outing of the text. This makes the text easier to read.<br />
This conversion process is likely to be one output branch of an XML production process that uses XML as its <strong>cen</strong>tral<br />
archiving and interchange format. This would be the output node for one particular format.<br />
5.7 Convert XML to Braille<br />
A common conversion process in accessible information processing is that of a generic XML format to a specialist format<br />
with specific niche user requirements. As many specialist organisations move towards an XML based production process,<br />
this process will become commonplace.<br />
Printed Braille output refers to any hard copy, which represents Braille. Braille can be created through various means.<br />
These means will probably involve a Braille Embosser or any other Braille printer. One should be aware that Braille<br />
printer files are machine dependent.<br />
The transformation itself is usually done automatically by software. It uses a specific transformation style guideline to<br />
convert the XML data into the format, which is fed into the Braille printer. The style guideline must do the pre-formatting<br />
for Braille output. This includes layout information like adding line-breaks and other Braille print specific information. One<br />
of the trickier transformations is the generation of language dependent contractions.<br />
This conversion process is likely to be one output branch of an XML production process that uses XML as its <strong>cen</strong>tral<br />
archiving and interchange format. This would be the output node for one particular format.<br />
70 http://wiki.euain.org/doku.php?id=wiki:processes:conversions:multimedia_to_xml<br />
31
CWA <strong>15778</strong>:2008 (E)<br />
5.8 Convert XML to Large Print<br />
XML to printed paper conversion processes (specific to accessible information processing) would refer to any process<br />
that uses XML data to prepare content for accessible use based on a paper format. This probably refers to large print<br />
representations.<br />
Primarily XML input will refer to part of an XML production stream where XML is used as the core interchange format.<br />
This being the case, it is likely that there will be some pre-processing, as content is rarely created by hand straight into<br />
XML. From XML content can be converted to almost any format, so it is a very good starting point in accessible<br />
information processes.<br />
The transformation itself is usually done automatically by software. It uses a specific transformation style guideline to<br />
convert the XML data into a readable format. The style guideline contains information on font sizes, colours, and also<br />
layout information. High-precision XML to print transformation generally will be done through the use of XSLT or XSL:FO<br />
style guidelines (cf. also below: XML->PDF transformation).<br />
Printed-paper output refers to any hard copy, which represents the information. It is probably coming out of a printer or a<br />
photocopier. In terms of accessible information processing printed-paper output is likely to be a large print representation.<br />
This is likely to be one output branch of an XML production process that uses XML as its <strong>cen</strong>tral archiving and<br />
interchange format. This would be the output node for one particular format.<br />
5.9 Convert XML to HTML<br />
XML to HTML conversion processes (specific to accessible information processing) would refer to any process, which<br />
uses XML data to prepare content for the use within web environments be it a web site or just for offline reading within a<br />
web browser.<br />
Since both the input format and the output format can contain high amounts of structural data or structured information,<br />
we can assume that these processes are quite modern and that they represent good practice in accessible information<br />
processing.<br />
In essence, XML input will refer to part of an XML production stream where XML is used as the core interchange format.<br />
This being the case, it is likely that there will be some pre-processing, as content is rarely created by hand straight into<br />
XML. XML content can be converted to almost any format (using e.g. XSLT or other processing systems such as Stilo-<br />
Omnimark), so it is a very good starting point for accessible information processes.<br />
An output medium of HTML suggests that the content is being prepared for the usage within web environments. This<br />
means that processing means should be aware of the relevant standards for accessible web content (e.g. W3C, WCAG<br />
1.0, WCAG 2.0).<br />
The HTML will possibly be published on the web or possibly as part of a Content Management System (CMS).<br />
This conversion process is likely to be one output branch of an XML production process that uses XML as its <strong>cen</strong>tral<br />
archiving and interchange format. This would be the output node for one particular format.<br />
5.10 Convert XML to structured Multimedia Material<br />
XML to structured Multimedia Material conversion processes refer to a process where for example XML data is<br />
transformed into the PDF format. PDF is a very common output format, it has the advantages that the content cannot be<br />
changed by the user. It is also a format that is platform independent, which means that a PDF document will look the<br />
same on any computer and is not dependent on any specific reading software.<br />
Though PDF output refers to an electronic document format it can also be used within accessible information processing<br />
for preparing documents for Large Print output. These Large Print PDF can either be read on a computer screen or, if<br />
allowed, be also printed out for use as Printed Paper output.<br />
The transformation itself is usually done automatically by software. It uses a specific transformation style guideline to<br />
convert the XML data into a readable format. This implies that processing means should be aware of the relevant PDF<br />
32
CWA <strong>15778</strong>:2008 (E)<br />
transformation (XSL:FO).The conversion style guideline contains information on font sizes, colours, and also layout<br />
information. XSL:FO processing requires additional software called an FO-processor 71 .<br />
In essence, XML input will refer to part of an XML production stream where XML is used as the core interchange format.<br />
This being the case, it is likely that there will be some pre-processing, as content is rarely created by hand straight into<br />
XML. XML content can be converted to almost any format, so it is a very good starting point for accessible information<br />
processes.<br />
5.11 Convert DTP to Multimedia Material<br />
This primarily multimedia input will refer to a complex package of media files, which are held together by one governing<br />
structure. Well known DTP packages currently are QuarkXpress and Adobe InDesign. These documents also have the<br />
most complex formats, which can be used in accessible information processing, and the ability to process these formats<br />
forms the key to integrating accessibility with mainstream workflows and processes.<br />
In the case of Accessible Output from a Multimedia package, guidelines are often needed for describing images and<br />
other multimedia formats which cannot be directly translated into a suitable format.<br />
Usually the PDF output from DTP software is used for submission to the printing house for print output. New production<br />
techniques and also improvements and new features in the DTP software now make it possible to create a basic PDF<br />
output version that can be used as input to other conversion processes such as the conversion of Multimedia Material to<br />
structured Multimedia Material.<br />
PDF as an output format is a very common one and it has the advantages that the content cannot be changed by the<br />
user. It is also a format that is platform independent, which means that a PDF document will look the same on any<br />
computer and is not dependent on any specific reading software.<br />
5.12 Convert Audio to structured Audio<br />
Audio files have been an integral part of specialist formats for some time. Currently all audio streams are available or<br />
easily convertible to digital formats (WAV, MP3, other MPEG based formats etc.)<br />
The conversion process includes the modification of the audio content and adding structural data to the audio content. In<br />
general, this implies cutting large files into smaller ones or producing a list of timing markers so that audio rendering can<br />
start at any desired point in a larger audio file.<br />
Structured Audio as an output format refers to audio content that is fully navigable (headings, chapters, paragraphs). This<br />
can be reached by using relevant standards or frameworks for handling structured audio content such as ANSI/NISO<br />
Z39.86 (DAISY) and NIMAS/DAISY. These frameworks can provide a combination of different output formats. In the case<br />
of DAISY this can be a combination of textual representation of the content combined with audio. A special case is the<br />
addition of time markers on human-read audio files so that they can be rendered in small pieces of audio. This process is<br />
part of the production chain of hybrid (text + audio) Daisy books.<br />
5.13 Convert XML to XML<br />
Primarily XML input will refer to part of an XML production stream where XML is used as the core interchange format.<br />
This being the case, it is likely that there will be some pre-processing, as content is rarely created by hand straight into<br />
XML. From XML content can be converted to almost any format, so it is a very good starting point in accessible<br />
information processes.<br />
XML output will refer to structured content which is either being fed into an XML reader which accesses the content and<br />
the metadata which surrounds that content or the XML is being fed into the next stage of an XML production stream for<br />
further conversion.<br />
In general this process is the core of many compound production processes. XML is being converted in some way. This<br />
could be an adaptation from one XML format to another (e.g. TEI to DAISY) or a change in presentation of an XML<br />
format, possibly through use of XSLT.<br />
71 An example of this is the Altova Stylevision processor, http://www.altova.com/downloadtrialstylevision.html<br />
33
CWA <strong>15778</strong>:2008 (E)<br />
Due to the fact that XML can also be used as storage format it is likely that different publishers may use their own XML<br />
format. For accessible information processing it might be necessary to transform one XML format into another to be able<br />
to apply the follow up conversions into the final delivery format. This might include a conversion from proprietary XML to<br />
an open standard such as TEI or DAISY.<br />
The transformation itself is usually done automatically by software. It uses a specific transformation style guideline<br />
(XSLT) to convert the XML data into another XML format. To be able to preserve all the structural information during the<br />
transformation process it is necessary for the people involved to be aware of the involved XML grammaticism.<br />
Technically this means that both input and output structuring definitions (DTD or Schemas) must be known.<br />
34
CWA <strong>15778</strong>:2008 (E)<br />
6 S<strong>cen</strong>arios introducing accessibility within publishing workflows<br />
The term "fully accessible" or "full accessibility" is always controversial. Accessibility is a mixture of two main factors –<br />
first, the type and degree of the print disability of the user, and second, the combination of "readability" and "navigability"<br />
of the information offered to the user. Some formats are only readable, but hardly navigable and some are both. Some<br />
users may have problems with colours and others with the size of the fonts used, and some with both.<br />
As far as formats go, a pure ASCII file is perfectly readable with most (if not all) adaptive devices, but it is hard to<br />
navigate as there is usually no structure at all, neither visual nor spatial. However, an XML text (or HTML, XHTML, etc.)<br />
are both readable and navigable, as they allow for different degrees of internal structure. Some text formats (ASCII,<br />
marked-up languages and word processor documents) allow for customisation of the characters shown in the screen<br />
(both in size and appearance) and the background colours. Some others may be readable and navigable but not<br />
customisable (like, for example, a tagged PDF document read with Acrobat Reader – here the size of the whole page<br />
can be changed but it does not allow for changing only parts of the text to the size or appearance that may be needed).<br />
According to this pure images can be put on one side of the scale and a DAISY book on the other side – a pure<br />
untagged PDF document is completely inaccessible for most print disable people (as much as a printed book is), while a<br />
full-audio full-text DAISY book includes all possible accessibility features that might be thought of. In between these two<br />
extremes the possibilities are endless. There are proprietary file formats that can create good accessible documents if<br />
only used properly (PDF, Word, etc.).<br />
But making documents accessible for customers does not mean that there is an obligation to satisfy everyone’s needs.<br />
That is never possible, not even with printed books – some people prefer smaller books, other people would like a<br />
slightly bigger font, while yet others prefer glossy to matte paper. To try to address as many customers as possible<br />
(according to economical, social, cultural and personal differences) publishers put in the market different editions of the<br />
same book – hardback editions, paperback editions, pocket editions… and they make as many translations of the book<br />
as needed to sell as many copies of it in as many countries as possible. All these different editions are considered to be<br />
in the same format – printed paper. But, in fact, they are "customised" versions of the same format, and some people<br />
may even consider them to be "different formats". And they all originate in the same electronic file, slightly or largely<br />
modified to meet the special characteristics of a certain edition. So publishers are already publishing the same book in<br />
different "formats" when needed.<br />
Formats that are only "readable" are only advisable if there is no other possibility of producing an accessible version of a<br />
book. These formats are, for instance, pure ASCII files or a continuous wav or mp3 file for an audio book. The book thus<br />
produced can be read, but the reading experience is far from satisfactory. Sometimes, if the book is not the type of book<br />
that is usually read sequentially, plain readable formats are of very little or no use at all.<br />
On the other side of what an accessible book may be stand DAISY books. They can reproduce the experience of moving<br />
and browsing through a printed book for those who cannot read print. Navigation can be taken as far as the word level<br />
when needed and both text and audio (when they are both present) are perfectly synchronised. DAISY books have many<br />
layers of navigability that go from full-audio and full-text (on the top of the scale) to fully structured text-only or audio-only<br />
books. DAISY books can be produced directly from properly created XML files with the appropriate DTD.<br />
35
CWA <strong>15778</strong>:2008 (E)<br />
6.1 S<strong>cen</strong>ario 1 - Delivering XML files<br />
We are an SME operating largely within one large EU country. All of our document design and structuring is out-sourced<br />
to different design agencies, depending on the type of materials we are publishing. We have been asked to provide XML<br />
source files to our national organisation for the blind in order to create accessible versions of the materials. Our material<br />
is largely educational and includes multimedia materials. How can we comply with this request?<br />
In this s<strong>cen</strong>ario the libraries have to be specific in their wishes. E.g. do they want source documents or XML?<br />
Publisher's structures (if not, structures will be lost when the material is exported)<br />
Guidelines need to be produced instructing actors what to ask for in specific situations.<br />
Actors<br />
<br />
<br />
<br />
National Publisher, <strong>Education</strong>al Publishing, Multimedia Publishing, SME<br />
Library for the blind<br />
Service Provider<br />
Conversions<br />
Multimedia to structured Multimedia [5.1]<br />
Structured Multimedia to XML [5.2]<br />
Multimedia to XML [5.3]<br />
XML to XML [5.13]<br />
S<strong>cen</strong>ario description<br />
There are three basic ways to create accessible information in this s<strong>cen</strong>ario:<br />
Convert the Multimedia files (e.g. PDFs) to XML and add then the structural information to the XML<br />
file and deliver those to the Library for the Blind<br />
Edit the the structure within the Multimedia document and convert it afterwards to XML. Deliver the<br />
XML to the Library for the Blind<br />
Convert a multimedia document into a structured multimedia document and then into XML. This can<br />
then be delivered to the Library for the Blind<br />
36
CWA <strong>15778</strong>:2008 (E)<br />
Figure 2 - S<strong>cen</strong>ario 1 Delivering XML files<br />
37
CWA <strong>15778</strong>:2008 (E)<br />
6.2 S<strong>cen</strong>ario 2 - Accessibility enhancement in general<br />
We are a large publisher operating in several markets across Europe and the US. We already have an enterprise content<br />
management system. To what extent can this be made more accessibility compliant so that we can create accessible<br />
output formats for national accessibility organisations in different countries? In some countries we are required by<br />
legislation to provide these materials, in others it is on a voluntary basis, but in each case we need to know how to do<br />
this.<br />
In this s<strong>cen</strong>ario, the national accessibility organisations need to express their requirements of formats and files that are<br />
needed (e.g. sources files or XML). Guidelines need to be produced instructing actors what to ask for in specific<br />
situations.<br />
Actors<br />
<br />
<br />
International Publisher, General, Multimedia Publishing<br />
Large national specialist providers<br />
Conversions<br />
Multimedia to XML [5.3]<br />
DTP to XML [5.5]<br />
DTP to Multimedia [5.11]<br />
S<strong>cen</strong>ario description<br />
Three possible ways:<br />
The DTP files are converted straight into a multimedia format (e.g. PDF) and then into XML, where<br />
the structure and accessibility information are added<br />
The DTP is directly converted into XML and the structure and accessibility information are added<br />
The structure is adden within the DTP document (e.g. through XML tagging). Then these documents<br />
can be converted into XML again<br />
38
CWA <strong>15778</strong>:2008 (E)<br />
Figure 3 - S<strong>cen</strong>ario 2 Accessibility enhancement in general<br />
39
CWA <strong>15778</strong>:2008 (E)<br />
6.3 S<strong>cen</strong>ario 3 - Increasing web accessibility<br />
We are a major scientific and technical publisher. Most of our material is already available online but we do not really pay<br />
attention to accessibility issues other than on a general level. Our customers tell us this is not enough, and much of our<br />
material remains inaccessible. How can we better distribute our materials over the web to our print impaired users?<br />
Guidelines need to be produced instructing actors what to ask for in specific situations.<br />
Actors<br />
<br />
<br />
International Multimedia STM Publisher<br />
Print impaired end users<br />
Conversions<br />
XML to HTML [5.9]<br />
S<strong>cen</strong>ario description<br />
In this case, the Publisher in question has a large body of content which they would like to make accessible on the web.<br />
This is a case where standards are increasingly important.<br />
There are several relevant standards in this case:<br />
WCAG 1.0 (Web Content Accessibility Guidelines 1.0)<br />
WCAG 2.0 (Web Content Accessibility Guidelines 2.0)<br />
There are several relevant organisations in this case:<br />
CEN/ISSS Workshop on Document Processing for Accessibility (WS/DPA)<br />
W3C (World Wide Web Consortium)<br />
WAI (Web Accessibility Initiative)<br />
In order to make the incorporation of these standards as streamlined as possible, it is important to use a CMS system<br />
suitable for the needs of both the target end users and the producing organisation. For non-XML documents the user<br />
might need access to information or transformation platforms to convert those documents also into HTML.<br />
40
CWA <strong>15778</strong>:2008 (E)<br />
Figure 4 - S<strong>cen</strong>ario 3 Increasing web accessibility<br />
41
CWA <strong>15778</strong>:2008 (E)<br />
6.4 S<strong>cen</strong>ario 4 - Accessibility policy<br />
We are a traditional print publisher but we have absolutely no experience in accessibility. We want to train our staff in<br />
making our materials accessible, what do we do?<br />
Guidelines need to be produced instructing actors what to ask for in specific situations.<br />
Actors<br />
<br />
<br />
International Multimedia STM Publisher<br />
Print impaired end users<br />
Conversions<br />
Traditional print to XML [5.4]<br />
S<strong>cen</strong>ario description<br />
This requires a plan to be put in place concerning an accessibility policy within the organisation. It is important that such<br />
a policy is aware of relevant standards, policies and legislation. These training materials are a good start, but<br />
communication with specialist organisations for the blind and visually impaired could also be useful.<br />
From a technical perspective, the possibility of incorporating accessibility within existing XML processing streams should<br />
be investigated.<br />
And for all new materials, publisher should develop author guidelines for digital publishing workflows to incorporate<br />
accessibility needs within the process.<br />
42
CWA <strong>15778</strong>:2008 (E)<br />
Figure 5 - S<strong>cen</strong>ario 4 Accessibility policy<br />
6.5 S<strong>cen</strong>ario 5 - Spoken documents for everyone<br />
We are a major player in the EU Spoken Book market. We are interested in the accessibility area, as we believe this<br />
could strengthen our hold on the market. How do we produce books that are accessible for everyone, especially people<br />
with dyslexia?<br />
Actors<br />
<br />
<br />
EU audio book publisher<br />
National Specialist provider<br />
Conversions<br />
Audio to structured Audio [5.12]<br />
S<strong>cen</strong>ario description<br />
It is likely that the solution to this s<strong>cen</strong>ario is very simple. With the assistance of the national specialist provider in<br />
question, it should be possibly to convert an audio book (WAV or MP3 based) into an audio book which contains<br />
structure for navigation. This would require the use of a standard such as Daisy.<br />
43
CWA <strong>15778</strong>:2008 (E)<br />
Figure 6 - S<strong>cen</strong>ario 5 Spoken documents for everyone<br />
44
CWA <strong>15778</strong>:2008 (E)<br />
6.6 S<strong>cen</strong>ario 6 - Accessible and protected PDFs<br />
We are a major publishing conglomerate. Much of our content is made available in PDF format exclusively. How do we<br />
make PDFs accessible but still protected from unauthorised use? Is there software I can use?<br />
Actors<br />
<br />
<br />
Major Publishing Conglomerate<br />
National Specialist Provider<br />
Conversions<br />
Multimedia to structured Multimedia [5.1]<br />
S<strong>cen</strong>ario Description<br />
Accessible PDFs are a hot item at the moment and as a result there is a lot of activity on the internet:<br />
Adobe Accessibility Resource Centre 72<br />
Creating Accessible PDF Documents with Adobe Acrobat 73<br />
PDF Universal Access working group 74<br />
In this document, PDF is considered to be a multimedia format which is a packaged set of files of different formats<br />
structured into a document. In the conversion process 5.1 a PDF document is converted through tagging of the PDF into<br />
a structured multimedia document. The DRM within these documents can be considered as another element with specific<br />
and often conflicting requirements. The technical requirements for conversion are much the same as those for any<br />
multimedia package.<br />
72 http://www.adobe.com/accessibility/<br />
73 http://www.adobe.com/enterprise/accessibility/pdfs/acro7_pg_ue.pdf<br />
74 http://www.aiim.org/standards.asp?ID=27861<br />
45
CWA <strong>15778</strong>:2008 (E)<br />
Figure 7 - S<strong>cen</strong>ario 6 Accessible and protected PDFs<br />
46
CWA <strong>15778</strong>:2008 (E)<br />
6.7 S<strong>cen</strong>ario 7 - Working hand in hand<br />
We are an organisation providing accessible format materials. We often receive unstructured (or poorly structured)<br />
publisher source files and convert them to (DAISY) XML and Braille. What specifications can we give to publishers so<br />
that they can better structure their source files themselves, as this will make the process easier?<br />
Actors<br />
<br />
<br />
Specialist organisation<br />
General National Publisher<br />
Conversions<br />
Multimedia to XML [5.3]<br />
XML to Braille [5.7]<br />
XML to XML [5.13]<br />
S<strong>cen</strong>ario description<br />
The communication of requirements between publishers and specialist organisations is difficult. In both directions, there<br />
can be complications because of the language used, priorities and different end user requirements. In order to make the<br />
communication process as seamless as possible, it is essential to start this process as early as possible, and build up a<br />
relationship of trust and knowledge of each other’s requirements. International Standards are also essential to ensure<br />
that there is a starting point and other work can be re-used. In order to create a solution that is as wide as possible it is<br />
important to make use of a standardised XML dialect for support of automatic transformation.<br />
47
CWA <strong>15778</strong>:2008 (E)<br />
Figure 8 - S<strong>cen</strong>ario 7 Working hand in hand<br />
48
CWA <strong>15778</strong>:2008 (E)<br />
6.8 S<strong>cen</strong>ario 8 - Accessible design<br />
We are a design agency working for several private and public organisations. They have asked us to build in accessibility<br />
in our products but how do we do this? We understand the principles of "Design For All" and web accessibility but how<br />
does this apply to the documents we design?<br />
Actors<br />
<br />
<br />
Design Agency<br />
Public and private service provider<br />
Conversions<br />
Multimedia to XML [5.3]<br />
XML to structured Multimedia [5.10]<br />
S<strong>cen</strong>ario description<br />
Multimedia documents are converted into XML. The information can then be structured and afterwards converted again<br />
in multiple output formats.<br />
Assuming the service provider is using modern software and modern XML production streams, it should be a case of<br />
adding new outputs to these streams for the major accessibility formats which are described in these training materials.<br />
49
CWA <strong>15778</strong>:2008 (E)<br />
Figure 9 - S<strong>cen</strong>ario 8 Accessible design<br />
50
CWA <strong>15778</strong>:2008 (E)<br />
6.9 S<strong>cen</strong>ario 9 - Accessibility on a large scale<br />
We are a national library and are responsible for archiving vast amounts of digital material. How can we be sure this<br />
material will be accessible? Should we follow some guidelines?<br />
Actors<br />
<br />
<br />
Library acting as a service provider<br />
National Specialist Organisation for the Blind<br />
Conversions<br />
XML to structured multimedia (5.10)<br />
S<strong>cen</strong>ario description<br />
Since there is an archive of digitised information, it is likely that an XML production stream has been used. If this is the<br />
case, then specific formats for impaired users in question need to be added to the output formats. If no XML processes<br />
have been put in place, it is likely that some manual intervention may be required to add metadata to the information in<br />
order to make it suitable for conversion to accessible formats. As a national Library, it is also important that there is a<br />
degree of communication with the National Library for the Blind in your country.<br />
Figure 10 - S<strong>cen</strong>ario 9 Accessibility on a large scale<br />
51
CWA <strong>15778</strong>:2008 (E)<br />
6.10 S<strong>cen</strong>ario 10 - What authors can do<br />
I am an author who publishes my own works/publishes as part of a collective. How do I make sure that everyone can<br />
read my books without me incurring too many costs?<br />
Actors<br />
<br />
<br />
Author<br />
Accessibility specialist<br />
Conversions<br />
Structured Multimedia to XML [5.2]<br />
Traditional print to XML [5.4]<br />
S<strong>cen</strong>ario description<br />
The author defines together with accessibility specialist guidelines that support the author to create<br />
accessible documents.<br />
On the other hand existing print material is converted into XML which can then be exported into<br />
various output formats.<br />
52
CWA <strong>15778</strong>:2008 (E)<br />
Figure 11 - S<strong>cen</strong>ario 10 - What authors can do<br />
53
CWA <strong>15778</strong>:2008 (E)<br />
6.11 S<strong>cen</strong>ario 11 - Repair and adaptation<br />
We are a disability support unit in a major university. Our staff members are experts in specific disabilities and our role is<br />
to support specific learners at the university by ensuring the multimedia content supplied for their learning is accessible<br />
to each learner, providing or recommending appropriate formats to match access modes available to that learner. How<br />
can we adapt learning materials? What are the processes and what standards are available to support the processes.<br />
Actors<br />
<br />
Disability Support Staff<br />
Conversions<br />
Multimedia to structured Multimedia [5.1]<br />
S<strong>cen</strong>ario description<br />
A Disability Support Staff member will work with the learner and the material assessing the adaptations required to make<br />
the material accessible for the learner’s context. The learner’s requirements for the context might be expressed in a<br />
functional description (e.g. IMS Accessibility for LIP 75 or ISO/IEC JTC 1/SC 36 24751-2 Personal Needs and Preferences<br />
Statement 76 ). There may also be environmental context requirements, such as might be expressed in a device profile.<br />
Content produced by the Educator will be in the form of an aggregation, which might be HTML or SCORM or an IMS<br />
Content Package or other aggregation usable by a Learning Management System. A piece of content may itself contain<br />
multiple aggregations and formats such as PDF files, MPEG videos, text, HTML etc. Determining whether the content<br />
matches the functional requirements and what the needs for repair are might involve examination of Metadata associated<br />
with the content (such as IMS AccessForAll Metadata 77<br />
or ISO/IEC JTC 1/SC 36 24751-3 Digital Resource<br />
Descriptions 78 ) or using automated software tools to determine accessibility properties of content.<br />
Appropriate repair assistance tools might output machine and human-readable statements in the language EARL 79 or<br />
properties related to WCAG 2.0 or Section 508 or other such Accessibility standards and also other properties.<br />
Repair might involve disassembling an aggregation (such as IMS Content Packaging 1.2 80 ) into its constituent parts,<br />
providing alternate/supplemental resources and re-assembling. Repair might also involve provision of offline materials<br />
(such as the notes in large font at some specific time) or online services or offline services to accompany the use of the<br />
material.<br />
75 http://www.imsglobal.org/accessibility/#acclip<br />
76<br />
http://www.iso.org/iso/en/CatalogueDetailPage.CatalogueDetail?CSNUMBER=43603&scopelist=PROGRAMM<br />
E<br />
77 http://www.imsglobal.org/accessibility/accmdv1p0/imsaccmd_oviewv1p0.html<br />
78<br />
http://www.iso.org/iso/en/CatalogueDetailPage.CatalogueDetail?CSNUMBER=43604&scopelist=PROGRAMM<br />
E<br />
79 http://www.w3.org/TR/EARL10-Schema/<br />
80 http://www.imsglobal.org/content/packaging/<br />
54
CWA <strong>15778</strong>:2008 (E)<br />
Figure 12 - S<strong>cen</strong>ario 11 Repair and adaptation<br />
55
CWA <strong>15778</strong>:2008 (E)<br />
6.12 Common s<strong>cen</strong>ario requirements<br />
As is shown in the possible workflow for each s<strong>cen</strong>ario, there are common tasks that need to be covered when<br />
introducing accessibility into the publishing workflow. Those tasks include:<br />
<br />
<br />
<br />
<br />
<br />
Definition of document style guidelines that are agreed and disseminated to the actors.<br />
Reference conversion of document’s references into tagged structured formats.<br />
Manual correction of conversion input/output (in the case where content creators did not follow style<br />
guidelines requirements).<br />
Extension of descriptions regarding visual data (images, videos) and other multimedia material.<br />
Subsequent on demand conversion into accessible formats.<br />
6.13 Specific s<strong>cen</strong>ario requirements<br />
In order to make documents accessible for a specific group, specific format conversions need to take place. It is<br />
important in these format conversions that the structure, annotation and metadata and intellectual property rights of the<br />
parent documents will be inherited by the child documents and not be lost. Therefore, attention must be paid to using the<br />
right conversion order and toolkits. One strategy that seems to be the most viable is to use a media rich (including<br />
annotations, and metadata) structured format as reference and then create the desired output operating directly on that<br />
format. An additional advantage of this strategy is that this format can ideally also be used for backups and can be stored<br />
in the publishing archive or also in libraries for long-term archiving.<br />
The next sections will further elaborate the requirements found for the possible s<strong>cen</strong>arios presented above. An analysis<br />
will be provided of the options for document style guidelines, reference conversion into a structured format, how to<br />
minimise manual correction and enhancement of the conversion output, how to extend descriptions of visual data and<br />
other multimedia material and how to convert the reference format into other accessible formats.<br />
7 Application-oriented s<strong>cen</strong>ario implementation<br />
7.1 Harry Potter and the RNIB<br />
Actors involved: International Publisher, General, Printed paper books, Large, International Publisher, General,<br />
Audiobooks, Large, Public blind Service Provider<br />
Conversions: Audio wav to ASCII, ASCII to printed Braille, Multimedia to printed Braille, Multimedia to printed paper,<br />
Multimedia to Multimedia<br />
This case study concerns the simultaneous release of standard and accessible versions of a popular work of fiction in the<br />
United Kingdom. The latest Harry Potter book (Harry Potter and the Half Blood Prince) was brought out simultaneously in<br />
normal print, large print, Braille, audio and DTB. The RNIB produced the Braille version; the publishers produced the<br />
normal print, large print and audio. A third actor produced the DTB.<br />
7.1.1 Print version<br />
Bloomsbury published both the standard print and large print versions of the book simultaneously. The large print version<br />
was produced in 16 point and was of good quality. A number of individuals seemed to encounter significant difficulties in<br />
securing large print copies of the book; this was despite the large print version being mentioned in press releases and<br />
being included on the Nielsen database (the primary tool for booksellers in identifying availability). Potential customers<br />
for large print were encouraged to pursue the issue with mainstream booksellers. The National Blind Children's Society<br />
also provided customised large print versions of the book to children, in a range of point sizes. They waited until the print<br />
version of the book was published and then scanned it. The customised large print service is only available to children<br />
and the print price is heavily subsidised by the charity. As with other accessible versions of the book (see below), the<br />
stated aim of the charities concerned is to provide the ‘same book, at the same time, at the same price to the print<br />
impaired end user. (List price of Standard Print Book £16.99, List price of large print Book £30.00, Price of NBCS book is<br />
£16.99 to individual children)<br />
56
CWA <strong>15778</strong>:2008 (E)<br />
7.1.2 Braille<br />
RNIB were required to undertake and pay for a security audit before the publication date. Once cleared, a representative<br />
from Bloomsbury arrived with the word file of the book on a CD. The CD was loaded onto a computer that was not<br />
networked. The word file was translated into a Braille file and a stock of hard copy Braille was produced. The artwork for<br />
the cover had been secured earlier than the text. The Braille version of the book was available at exactly the same time<br />
as the print version of the book. This was a major achievement, the cost of production was still heavily subsidised by<br />
RNIB. (Price of Braille copy of book: £16.99 available from RNIB)<br />
7.1.3 Analog Audio<br />
The audio rights for the book were owned not by Bloomsbury but by an individual, Helen Nicholl. The actor Stephen Fry<br />
narrated the book (as he had done with previous Harry Potter titles). The book was recorded using industry standard,<br />
professional, recording software (Pro tools/ Sadie). The digital master was then used to produce both a CDDA (Compact<br />
Disk Digital Audio) copy of the book (17 CDs long) and an audio-cassette version of the book. These audio versions of<br />
the book were available 6 weeks after the publication of the print book. This delay was apparently due to the work<br />
commitments of Stephen Fry. According to UK law you can not use a copyright exception to produce an accessible<br />
version of a book if there is already an equivalently accessible version commercially available. For this reason the<br />
Stephen Fry cassette and CD version of the book is the only one available (albeit at an incredibly high price) (List Price of<br />
analog audio £55.00, List price of CDDA £65.00)<br />
7.1.4 Daisy Audio<br />
Daisy audio is deemed a significantly different format to both analog audio and CDDA so RNIB was within the law to<br />
produce the book in Daisy. There were several options available:<br />
Use the electronic file (used to produce the Braille) to produce a synthesised voice version of the<br />
book<br />
<br />
<br />
Use an RNIB narrator to produce a real voice recording of the book<br />
Use the Stephen Fry recording to produce a Daisy version of the book<br />
The advantages of the first two options were that the Daisy version would be available at the same time as the print<br />
version of the book. However as previous Daisy versions of the series had all used the Stephen Fry audio version it was<br />
decided to go for continuity over ‘same day publication’. The rights owner provided a copy of the CDDA version of the<br />
book. This was used to produce wave files of the recording which were then converted into the Daisy book. The Daisy<br />
book is available for loan via the RNIB talking book service. It is also available for sale. The rights owner set an arbitrary<br />
limit of 250 copies (for sale). This will soon be reached and will need to be re-negotiated. Once again the Daisy version<br />
of the book was given the same price as the standard print price copy of the book. It was available for loan and sale, 4<br />
months after the original publication of the print book. (Daisy copy of book from RNIB £16.99)<br />
7.2 Magazine and Newspaper distribution in the Netherlands<br />
Actors involved: Public blind, Public partially sighted, Public dyslexics, National Publisher, General, Multimedia<br />
Publishing, Large<br />
Conversions: ASCII to XML, XML to audio, XML to Braille, XML to printed paper, XML to XML, HTML to XML, Multimedia<br />
to XML<br />
The Dutch Library for the Blind Dedicon converts 37 newspapers automatically to accessible XML formats, making them<br />
available at the same time if not before the printed editions. Magazines still require some intervention by skilled staff to<br />
convert files to a common format, which sometimes causes short delays between print publication and the availability of<br />
accessible versions.<br />
DEDICON, the Dutch library for print impaired readers distributes 37 newspapers and 60 magazines in a special XMLformat.<br />
The website Anderslezen.nl is used as a digital distribution platform, which reduces delay in delivery to a<br />
minimum. Most of the newspapers are available at the same time or even earlier than the printed versions. This is<br />
possible because conversion and distribution of the content is fully automated. Magazines cannot be delivered so quickly<br />
because some of the work still has to be done by hand. Production of the special “Dedicon” XML format means that<br />
delivery times cannot be guaranteed. Nevertheless delays are reasonable.<br />
57
CWA <strong>15778</strong>:2008 (E)<br />
7.2.1 The production of newspapers<br />
Publishers deliver their content in different XML formats. For example NewsMl, NITF, SGML and special internally<br />
developed XML formats. The files are transferred by FTP to a Dedicon server, where they are converted automatically:<br />
The conversion editor/post processor (Dedicon software) or XSLT software converts the file into Dedicon-XML and sorts<br />
the content in a logical order.<br />
A DMD-file (document metadata, Dedicon software) is created. It contains metadata about the publisher, the newspaper,<br />
the date and the size of the file. It supports the transport and storage of metadata to Anderslezen (website and<br />
distribution platform).<br />
<br />
<br />
<br />
<br />
<br />
<br />
The Dedicon-XML is encrypted and placed, together with a CSS and a XDF-file (the CSS supports<br />
the customer's reader software) in an EXD-file (encrypted XML document).<br />
EXD- and DMD-files are placed on the website Anderslezen (distribution platform).<br />
The costumer gets an automatically generated email with an attached (EXD-file) or a link.<br />
The EXD-file is decrypted and the file is opened by the Dedicon-reader.<br />
The costumer can navigate through the document with his mouse or the buttons of his keyboard.<br />
The customer can choose between different output formats: synthetic speech, Braille and large print,<br />
all of them produced locally out of the XML files.<br />
7.2.2 The production of magazines<br />
The content is not only delivered by FTP but also by CD or email. Dedicon receives many different formats, such as: PDF,<br />
Word, QuarkXpress, Indesign and HTML. It is not possible to convert all of those formats automatically.<br />
QuarkXpress is converted by hand in an Apple environment. The employee places the different parts of the content in the<br />
right order by using Textarch software. Automatic export would lead to an illogical content order.<br />
Indesign is converted in a Windows environment. With the assistance of FIX software (developed by Dedicon) the<br />
employee places the different parts of the content in the right order.<br />
The conversion editor/post processor (Dedicon software) converts the content into Dedicon XML.<br />
The employee controls the quality of the Dedicon XML. The content has to be well formed and valid. In case of an error<br />
the employee restores the content by hand.<br />
Any information which is missing and relevant is added to the DMD file (document metadata, Dedicon software) by the<br />
employee. The rest of the process is similar to the production process of newspapers.<br />
7.3 Time Warner and Dolphin Audio Publishing<br />
Actors involved: International Publisher, General, Audiobooks, Large Service Provider<br />
AFB Talking Books re<strong>cen</strong>tly introduced a new technology developed in partnership with Time Warner AudioBooks and<br />
Dolphin Audio Publishing, for the best-selling author James Patterson's new novel, The Jester. The Jester appeared as<br />
an audio e-book in 2003 and included an unabridged CD audio version of The Jester. The audio e-book, basically a<br />
digital talking book (DTB), was made possible through standards developed by libraries for people who are blind from<br />
around the world through the international DAISY (Digital Accessible Information System) Consortium.<br />
DTB technology allows large amounts of textual information and formatting to be stored, transcribed into a variety of<br />
formats, and easily navigated. The EaseReader software developed by Dolphin Audio Publishing synchronises audio to<br />
the text and plays/displays The Jester on desktop and laptop PCs. Readers can display the text of the book on the<br />
screen, fully synchronised with the audio of a professional narrator. Switching back and forth, or “toggling,” between print<br />
and audio versions of the same work, is also possible. Additionally, users can search both the entire text and audio for<br />
keywords and phrases. These features have a particularly broad appeal for travellers and commuters who may wish to<br />
read the text and listen to the audio independently or simultaneously, depending on their environment.<br />
Electronic hardware manufacturers are already responding to the innovation. In the near future, audio e-book technology<br />
will be integrated into hand-held Personal Data Assistants. Additionally, the Consumer Electronics Association is planning<br />
58
CWA <strong>15778</strong>:2008 (E)<br />
to integrate the DTB file format into CD player technology, allowing any CD player to access the audio portion of the<br />
audio e-book. This in itself would mark a significant advance, since an entire book’s worth of text and audio can fit onto<br />
one CD with the DTB file format.<br />
7.4 <strong>Education</strong>al publishing in Austria<br />
Actors involved: International Publisher - General - Printed paper books - SME, Service Provider - Public blind<br />
Conversions: Multimedia to XML, XML to HTML, XML to Multimedia. This case study outlines the co-operation between<br />
Austrian schoolbook publishers and service providers for people with special needs, to make books available in<br />
electronic formats.<br />
7.4.1 Situation in Austria<br />
In Austria, the Federal Ministry for Social Affairs and Generations is providing educational materials like schoolbooks and<br />
other materials for primary and secondary education. Blind and visually handicapped students - and hopefully soon in the<br />
future other print disabled students - can order books in accessible formats.<br />
Publishers, till this project, did not agree on handing over and distributing digital copies of books. The development of<br />
alternative formats starts from printed books with scanning, OCR or, when lots of graphics and/or formal structures like<br />
math are used, with typing. In this process structure was added to the book, headings were defined and lists and other<br />
structural elements were assigned to the text. This was a very time consuming process.<br />
This situation motivated to start a project which addressed the following issues:<br />
a minimum set of structural elements which documents from publishers have to contain to make<br />
them usable for the production of books in alternative formats (e.g. Braille, large print, ebooks)<br />
know-how and handouts for publishers on how to implement structured design with these elements<br />
using standard desktop publishing (DTP) systems (InDesign, QarkExpress)<br />
examples new books and redesigning existing books to learn how to do accessible document design<br />
in practice<br />
training materials, <strong>workshop</strong>s and seminars to transfer the developed know-how to other publishers<br />
and design agencies<br />
a general <strong>agreement</strong> which gives the right of transferring books in electronic format to students with<br />
disabilities<br />
a Document Rights Management System including to prevent the data to be misused in practice<br />
a workflow for the co-operation between schools/teachers, service providers, publishers and the<br />
ministry.<br />
Publishers were interested to take part as a) the new anti discrimination legislation will ask for accessibility of school<br />
books and b) they experience general problems in the publishing process when they want to use sources for different<br />
publishing purposes (e.g. print, online, CD, audio/multimedia). This convergence of interests led to a strong partnership<br />
for the project named.<br />
7.4.2 "Multi Channel Publishing"<br />
Five publishers take part in the project. Each of them is responsible for the designing or redesigning one of their books<br />
based on a predefined set of structural elements. This basic structural design defined in the project guarantees that the<br />
electronic version of the book can be used for the production of alternative formats. An analysis of the publishing process<br />
at publisher’s sites showed that service providers can only start from the final print ready version as the content, which is<br />
approved by public authorities, changes till this point. This final version today is most of the time a PDF generated from a<br />
DTP Tool (e.g. Adobe InDesign or QuarkXPress). Due to this, if electronic sources should be usable for services<br />
providers, structured design has to be implemented into the DTP work.<br />
7.4.3 Definition of structural elements for electronic versions of books<br />
To be able to collect the data of the source document and convert it into a XML File, we used the element set of the TEI-<br />
Standard, in particular the TEI Lite DTD. The TEI’s Guidelines for Electronic Text Encoding and Interchange were first<br />
published in April 1994. This set of metadata is widely known by publisher and guarantees compatibility or convertibility<br />
to other definitions in use like Daisy [Daisy 06]. Using TEI keeps the process close to the upcoming XML database<br />
59
CWA <strong>15778</strong>:2008 (E)<br />
schemes which publishers might use in the future using database structures for processing their documents. The TEI Lite<br />
DTD still consists of over 120 Elements for the tagging of books, most of them important for librarians. To simplify the<br />
work for all participating parties, a subset of those elements was selected. This subset consists of structural elements<br />
which are of general importance for structured document design and automatic content processing. This subset does not<br />
ask for special knowledge of accessible versions but can be seen as the basis for structured document design in general.<br />
Using this subset guarantees that the sources (or PDFs) can be used as a starting point for the production of accessible<br />
versions. In general this sub-set of the TEI Lite DTD comprises structural metadata elements for:<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
Headings<br />
Divisions / Subdivisions<br />
Images<br />
Tables<br />
Notes<br />
Page breaks<br />
References<br />
It also comprises administrative metadata elements (e.g. Edition, Year of Publishing, Author(s), Publisher, ...). The<br />
experience in the project showed, that this D.T.D. Subset is sufficient to structure the content of the schoolbooks.<br />
Publishers after a short training were able to do the work by themselves. This subset also proved to be in accordance<br />
with new publishing systems based on X.M.L. databases.<br />
7.4.4 Authoring Tools<br />
After the definition of the XML DTD, knowledge was developed how the authoring tools could support the efficient<br />
marking-up ofdocuments in the right way during the layout process. Further on routines for exporting the defined<br />
structure and layout data into XML were developed. The two most widely used authoring tools were examined in detail:<br />
7.4.5 Example Books<br />
The post-processing tasks are necessary, because, as mentioned before, the exported files in some cases have no<br />
structure and there are also parts of some books that could not be exported (e.g. graphics, made in the authoring<br />
systems). The post-processing tasks were:<br />
<br />
<br />
<br />
Adding Structure to the XML<br />
Revise elements, that were not exported properly<br />
Describe Images<br />
The result after the completion of the work is a valid XML version of the book. The next step is to convert the XML via<br />
style sheets into the target format. The style sheets for the conversion are freely available on the internet. They allow to<br />
convert the XML file into a HTML file with one/multiple pages and also to convert the XML file into a PDF file.<br />
7.4.6 Training materials, seminars and <strong>workshop</strong>s<br />
Training materials have been developed which are now used in <strong>workshop</strong>s and seminars to transfer the knowledge to as<br />
many publishers as well as design agencies as possible.<br />
7.4.7 DRM-System<br />
To make sure that the books are not used outside the designated user group a DRM System was customised. The<br />
system consists of a secure-reader-software and a USB dongle, which acts as the key. Every student gets a key and the<br />
software. The key has a code, which allows the student to read the book if the key is plugged into the computer. This<br />
system has the advantage that the user is not bound to one specific computer or piece of hardware. He can read the<br />
book for example at school but also in a learning group or at home. How the students get their books and a detailed<br />
workflow between publishers and the service providers is described in the next paragraph.<br />
7.4.8 Workflow<br />
To start the process, a teacher of a student with special needs orders a book in an accessible format. If the schoolbook<br />
service provider does already have the book in stock, it will be provided directly to the student. Otherwise, the service<br />
provider asks the publisher for the electronic version of the book. The publisher sends his TEI-XML file to the service<br />
60
CWA <strong>15778</strong>:2008 (E)<br />
Provider. The service Provider produces the accessible version of the book. Printed (Braille/enlarged) copies are sent by<br />
standard mail. If an electronic document is ordered, the service provider encodes the files with the DRM system using<br />
the data from the student’s USB dongle. The book is placed on a server, where the student can download the book.<br />
When the student has the reader software installed and the dongle plugged in, he can open the book and read it.<br />
7.4.9 Agreement between Publisher and Service Provider<br />
To ensure that the process works efficiently, an <strong>agreement</strong> between publishers and service providers has been drafted.<br />
The core articles of the <strong>agreement</strong> are:<br />
<br />
<br />
<br />
<br />
The publishers provide their electronic source documents<br />
It must be ensured, that the books are only given to people with a special need<br />
A DRM system must be used therefore<br />
It must be a "closed" system with registered users<br />
The <strong>agreement</strong> will be signed by every publisher and service provider. If a service provider needs a book from a<br />
publisher he can ask for it under the condition of the framework <strong>agreement</strong>.<br />
7.4.10 Conclusion<br />
The most important result of the project is the fact that handing over digital copies of print published documents is<br />
guaranteed in the future.<br />
The project showed that it is technically feasible to create XML versions of books by using the print ready version of a<br />
document. The experience also showed that the quality of the XML after just using the functions provided by the<br />
authoring tools is not good enough. A lot of work has to be done afterwards by cleaning and revising the XML document.<br />
The people who are performing this work will have to have some basic XML skills. It will also be a challenge to convince<br />
the publishers to create documents that can be exported into XML without a lot of additional effort. In some areas at the<br />
moment there are only limited possibilities to sources from publishers, especially in areas, where books consist mainly of<br />
pictures, graphics and other visual content. Another challenge is the integration of non-text content like mathematical or<br />
chemical expressions.<br />
The project made obvious that all publishers pass their layout data to the print office by using PDF. An important task for<br />
the future will be the development of a program to allow authoring systems to create PDF files that are either accessible<br />
or allow a conversion back into a useful format.<br />
In any case these are only first, but important steps towards multi channel publishing. More work is needed for a more<br />
efficient production of different versions of one source document.<br />
7.5 Best practice for distributing accessible content<br />
The “ideal accessible information network” could be a structured and collaborative network of organisations producing<br />
accessible information for print disabled people. To improve its efficiency, it should be a technical network with<br />
normalised tools and practices, with technical experts to keep working with publishers for innovating solutions.<br />
Reliable technical solutions should be set up to distribute accessible electronic documents to print disabled people.<br />
These solutions must guarantee intellectual property rights without restricting access to information. This ideal accessible<br />
information network should be a trust network, where the actors are well identified and work responsibly and accountably.<br />
Exchanges with publishers should be enhanced and organised by trusted intermediaries to discuss intellectual property<br />
rights and structured files provision. These intermediaries should be legally acknowledged as public authorities.<br />
Publishers should be provided with clear specifications on the file formats they can provide to trusted intermediaries. If<br />
necessary, guidelines or tools can be supplied to help publishers in integrating accessibility in their production chains.<br />
Publishers should introduce accessibility in the contract they sign with their subcontractors and service providers. They<br />
should also guide their authors in creating structured information using the prescribed authoring tools.<br />
61
CWA <strong>15778</strong>:2008 (E)<br />
When projects on electronic products are launched, accessibility should be introduced in the functional specifications and<br />
considered in the project design and realisation. 81<br />
7.5.1 Current examples of good practice<br />
These examples of current good practice cover the following areas:<br />
<br />
<br />
<br />
Technical books<br />
School books<br />
Electronic books<br />
7.5.2 Technical books<br />
These are examples of best practice in distributing accessible content in the form of technical books.<br />
7.5.2.1 O'Reilly Media Inc.<br />
Technical books and articles have changed considerably with the advances of new information technologies. The<br />
example of O'Reilly Media Inc. illustrates how technical information can be disseminated in both paper books and<br />
electronic documents. O'Reilly media was originally a technical writing consulting company. Today it has become one of<br />
the most famous publishers of books for software developers with its iconic “animal books” and “In a Nutshell” references.<br />
O'Reilly publications activities are not reduced to paper books; it offers many on-line services as its “Safari Books Online”<br />
82<br />
service, a web-based subscription service that offers a searchable reference library of computer books from different<br />
publishers. This on-line library allows subscribers to search across more than 3,000 books, parts or entire books can be<br />
read on-line, the catalogue can be browsed by category. Chapters of books can be downloaded for viewing off-line.<br />
O'Reilly has published a number of Open Books – books with various forms of “open” copyright. 83 These books can be<br />
out of print or written by authors who thought that their books had to be widely distributed under a particular open<br />
copyright. Through its Open Library project, the Internet Archive 84 is scanning and hosting PDF versions of O'Reilly open<br />
books.<br />
A number of the open books are also available as HTML or PDF e-books on the O'Reilly web site. These documents are<br />
structured in chapters, sub chapters and contain table of contents to access information. 85<br />
To create such a variety of products and services around paper books, O'Reilly has set up a complete publication<br />
process starting from authors to the final products.<br />
Authors are provided with very strict guidelines for the final book submission. The approved formats are: - Microsoft Word<br />
for PC or Mac, tagged to O'Reilly's paragraph and character style template, - XML tagged according to the DocBook Lite<br />
DTD, - Adobe FrameMaker tagged according to the paragraph and character style tags in O'Reilly's templates. Once the<br />
final draft is submitted by the authors and properly reviewed by a technical committee, the book is prepared for print by<br />
O'Reilly staff. Illustrations are re-done by graphic artists, the cover is designed.<br />
The camera and press ready material is then produced by the production group late in the production process. Adobe<br />
FrameMaker is used to prepare the final document that will be sent to press. 86<br />
81 http://wiki.euain.org/doku.php?id=wiki:distribution:distributing_content:best_practice<br />
82 http://safari.oreilly.com/<br />
83 http://www.oreilly.com/openbook/<br />
84 http://www.archive.org<br />
85 An example of an HTML open book can be found at:<br />
http://www.oreilly.com/catalog/debian/chapter/book/index.html<br />
86 http://www.oreilly.com/oreilly/author/<br />
62
CWA <strong>15778</strong>:2008 (E)<br />
This particular example shows a publisher who has created a rationalised multichannel publication process. Both paper<br />
books and digital publications are created from the authors submissions. This process is well documented. It involves<br />
authors in the early stages guiding them through the edition. Even if it takes place in a favourable context (authors are<br />
computing scientists and O'Reilly is specialised in technical publication), this example can be considered as encouraging<br />
for structured content publication and leading to accessibility.<br />
7.5.3 School books<br />
These are examples of best practice in distributing accessible content in the form of school books.<br />
7.5.3.1 Bordas-Nathan electronic school books<br />
The French publishers Bordas 87 and Nathan 88 are major school books publishers in France. In addition to a very large<br />
catalogue of paper books for pupils, teachers and pedagogues, these publishers create multimedia products as web sites<br />
and CDROMs for teachers and pupils. Since 2000, Bordas and Nathan have been also actively involved in electronic<br />
schoolbags projects.<br />
BrailleNet 89<br />
has studied the accessibility of an electronic History and Geography book for 15 years old pupils published<br />
by Bordas. The electronic book content is based on the paper version with extra multimedia documents as Macromedia<br />
Flash animations or audio and video sequences. The aim of this project for the publisher was also to explore<br />
multichannel publishing to deliver information to different target devices as personal digital assistants (PDA) and third<br />
generation cell phones.<br />
The final version of the electronic book for computer is a thick client application based on a modified version of Mozilla: -<br />
the book's static content is encoded in XHTML, - the user interface is described with XUL, - and the major part of the<br />
application mechanisms are developed in JavaScript. The electronic book for computer offers the following functionality: -<br />
read textual content, view pictures, play video and audio documents, - search content in the whole book using an<br />
integrated search engine, - browse the table of content of the book and move directly to a given part, chapter, lesson<br />
thanks to encoded links, - annotate images using a minimal editing toolkit containing a brush and a colour selector, -<br />
create and edit XHTML content for pupil’s personal homework.<br />
BrailleNet carried out accessibility tests with common assistive technologies such as screen readers and magnifier<br />
software. The results were negative because of compatibility issues between Mozilla and the screen readers. Even if<br />
these technical problems could have been solved, the use of Macromedia Flash was also another important barrier to<br />
access.<br />
However the book's high level of structure and the large amount of semantic information added to it was a good<br />
opportunity to improve its accessibility. It was decided to go deeper in the study, the publisher let BrailleNet access the<br />
build chain of the application and develop solutions to create an accessible version of the book.<br />
Both XHTML content and XUL content are automatically built from XML data. The source of the whole application is<br />
contained in a unique XML document following a DTD developed by the publisher. This DTD is divided in two parts: - a<br />
first subset is common to every electronic book produced by the publisher, it is close to XHTML, and describes<br />
information as paragraphs, divisions, and images; - a second subset is specific to a given book. It defines the grammar<br />
and the vocabulary of the book: which containers are used, how they can be nested. This part was particularly interesting<br />
because the publisher chose to structure information in a very semantic way in order to output media-specific structures<br />
later in the production process.<br />
The publisher provided BrailleNet with this DTD and the XML source file of the application. It was interesting to note that<br />
the DTD already contained the necessary structures to add textual alternatives to images. However, all these alternatives<br />
were empty in the XML document. But all the documents (textual documents, illustrative pictures, video and audio) were<br />
provided with a textual legend introducing and describing the content.<br />
87 http://www.nathan.fr/Multimedia/cartable/default.asp<br />
88 http://www.editions-bordas.com/<br />
89 http://www.braillenet.org/<br />
63
CWA <strong>15778</strong>:2008 (E)<br />
With the publisher's DTD and XML source files, BrailleNet was able to develop a set of XSL style sheets to convert the<br />
XML source document into XML dtbook and then to an accessible XHTML book where a user can easily navigate with<br />
tables of contents, many internal links, textual alternatives to images etc.<br />
This example is encouraging for the integration of accessibility in the publishing industry. It shows how a publisher was<br />
led to add structure to its content to create new innovative products for its customers. It also shows how easy it was to<br />
create an accessible version of the electronic book from the XML source files of the publisher. This demonstrates the<br />
technical convergence of publishers and accessibility in electronic documents, especially in the case of multichannel<br />
publishing. This technical convergence is also illustrated by the example of a mathematics book from the same publisher:<br />
it was decided that mathematical content will be stored in MathML instead of images in order to be able to convert them<br />
later in the building chain if necessary.<br />
7.5.4 Electronic books<br />
These are examples of best practice in distributing accessible content in the form of school books.<br />
7.5.4.1 Numilog<br />
Numilog is an electronic bookseller based in France and selling books for both French-speaking and English-speaking<br />
markets. The Numilog website proposes a large catalogue of titles for its customers (more than 23,000 available titles).<br />
Customers can choose the format they prefer between: - Mobipocket format (PRC) readable on a computer with<br />
Microsoft operating system, Palm, Pocket PC and smartphone; - Microsoft Reader format (LIT) also readable on all the<br />
above platforms; - PDF for PC and Mac. The catalogue is composed of novels, books for children, documentaries, nonfiction<br />
books (computing, management, biology law, economics, ...) and dictionaries. 90<br />
7.5.4.2 Relations with publishers<br />
Numilog has business relations with publishers to negotiate the rights to distribute their books on the Internet, the selling<br />
price and eventually source file provision. English and American publishers set the price Numilog must pay to obtain the<br />
rights to sell electronic books from their catalogue. Then Numilog is free to decide the selling price and so its profit<br />
margin. The French publishers collaborating with Numilog have decided to apply the policy they use with conventional<br />
book distributors: book prices are set by the publishers themselves. This follows the French law about book prices, called<br />
“loi Lang”. This law was passed in 1981 and prescribes that anyone publishing or importing a book has to define its<br />
selling price. This price must be respected by all distributors. This law does not include electronic books but the<br />
publishers have decided to apply it to this particular case. This means that Numilog has to negotiate with publishers the<br />
selling price in order to keep a de<strong>cen</strong>t profit margin. This price of the electronic version is always lower than the price of<br />
printed books. Most of the time, Numilog has to pay to obtain the electronic files from publishers, and the price fluctuates.<br />
Publishers almost always provide Numilog with files of their books. 90% of these files are prepress files in PDF, Quark<br />
XPress or Adobe inDesign. PDF files are always optimised for press, with cutting lines and very large pictures. The other<br />
10% are author’s files, usually in Microsoft Word. Sometimes Numilog must digitise books because publishers cannot<br />
provide digital files.<br />
Numilog has chosen Adobe Content Server 91<br />
to secure eBook distribution. Content Server is a Web-based system for<br />
publishers, distributors, libraries and booksellers. It automates the supply chain for eBooks and other media by providing:<br />
- an interface for eBook publishing, distribution and procurement; - a way to manage and protect digital rights; - a secure<br />
repository with encryption of eBooks and authentication of transactions; - a business to business transactions model,<br />
including selling eBooks to clients and procurement from vendors; - a lending model for online libraries ... Numilog has<br />
become an Adobe partner for this particular product and can sell li<strong>cen</strong>ces and offer to host this service for a customer.<br />
7.3.3. eBooks preparation<br />
Numilog has set strict quality requirements for eBooks: - file size must be optimised (finding a good ratio between quality<br />
and file size); - all the eBooks must have a cover page; - file textual content must not be provided as images (textual<br />
electronic books only) so that content can be searched, magnified, selected (but not copied for security reasons); -<br />
technical books must be structured and offer a convenient way to access information quickly; - footnotes and endnotes<br />
must be hyper-textual links as often as possible; - external references are checked to avoid broken links. To prepare the<br />
eBooks, Numilog employs people to rework the files provided by publishers.<br />
90 http://www.numilog.fr/<br />
91 http://www.adobe.com/products/contentserver/index.html<br />
64
CWA <strong>15778</strong>:2008 (E)<br />
7.5.4.3 Conclusion<br />
This experience shows again a convergence of interests between the publishing world and the accessible information<br />
world because Numilog's eBook quality requirements are close to accessibility requirements. Numilog faces the same<br />
issues the accessible information world currently faces, publishers files are not structured for electronic publication, and<br />
the additional costs to make structured information from unstructured files is prohibitive.<br />
65
CWA <strong>15778</strong>:2008 (E)<br />
8 Identified gaps and areas for further research<br />
In Section 1 we asked two fundamental questions:<br />
<br />
<br />
How do we describe the questions and problems of end users and content providers?<br />
How do we marry both within manageable and consistent frameworks?<br />
This <strong>workshop</strong> has sought to provide some pointers towards answering these questions. As can be seen, there are many<br />
ongoing initiatives and projects which seek to incorporate accessibility within publishing processes. Largely through<br />
necessity, most of these initiatives have been undertaken by specialist organisations supporting print impaired people. In<br />
some instances, and sometimes in collaboration with publishers and other content providers, this has resulted in<br />
innovative practices and far greater access to information.<br />
However, progress has been fragmented and often very slow. The provision of alternative format materials varies greatly<br />
from country to country (and within national boundaries), according to local conditions and according to the economic<br />
vagaries of provision and depending on different types of impairment. These problems are well-described elsewhere.<br />
8.1 Descriptions & Requirements<br />
Considerable work has been done by specialist organisations in establishing end user requirements and a number of<br />
preferred output formats are well-established. There is less consistency, however, in relating these requirements to reusable<br />
models for use within content processing environments. Combined with a historical tendency to separate<br />
alternative format production from mainstream production and to focus on separate specialised formats, this has lead to<br />
a fragmented approach to implementing accessibility within both specialist and mainstream processes. Similarly, there is<br />
very little consistency in the work undertaken by content providers in this area and there is a corresponding<br />
fragmentation of effort. Such work that has been undertaken tends to focus on web authoring issues and this only<br />
addresses one part of the accessible content processing chain. Indeed, an important outcome from this <strong>workshop</strong> is the<br />
realisation that generic processing models are required for this work to proceed with any degree of coherence. 92<br />
8.1.1 Further Research on Descriptions & Requirements<br />
Systematic descriptions of end user requirements remain problematic. Such requirements are constantly changing and<br />
there has been little effort made to capture the dynamic nature of these requirements.<br />
Further research s required to establish requirements for different types of impairments and to compare and examine<br />
where points of intersection might lead to collaborative efforts.<br />
8.2 Process & Content Modelling<br />
This <strong>workshop</strong> has sought to provide information about different s<strong>cen</strong>arios and to point towards ‘real-life’ examples which<br />
have proven successful. The motivation for content providers to create accessible information will always vary but there<br />
is a clear need for generic processing frameworks which can make this as straightforward as possible.<br />
It appears unrealistic to expect that any one format be accepted by all stakeholders and universally applied. Multimedia<br />
content processing can involve many different types of software and many different processes, thereby making it very<br />
difficult indeed to introduce accessible content processing at the right stages. No single input, representation or output<br />
format can contain these complexities. Given the general move towards distributed media, it is perhaps better to focus on<br />
building frameworks which enable accessible content processing, according to the local preferences and requirements of<br />
all the stakeholders. In short, accessibility is not a format or a product: it is a process.<br />
Our modern use of multimedia information requires that information and services accommodate different presentations<br />
and interaction designs design at the user interface level, on the basis of requirements that include user needs,<br />
preferences, personalisation, customisation, adaptation and constraints; characteristics of the tasks to be performed (e.g.<br />
repetitive, knowledge-intensive, collaborative); capabilities of available access devices; and contextual information. In<br />
92 See for example, the ongoing work of the ProAccess project (http://proaccess.euain.org), co-ordinated by<br />
the Italian Publishers Association and supported by the Federation of European Publishers.<br />
66
CWA <strong>15778</strong>:2008 (E)<br />
providing this level of usability, a fundamental accessibility can be achieved which allows users to interact with content at<br />
a much deeper level.<br />
This work may be at a relatively early stage, but most importantly it is a mainstream endeavour: the modelling and<br />
aggregation of content is a <strong>cen</strong>tral concern for all those in the private and public sector.<br />
The ever-increasing complexity of dealing with information from structured and unstructured media (images, sound, text,<br />
recordings, etc.), in several working modalities and in multitasking modes, makes adapting to context and content a<br />
necessity. Additionally, what is becoming more and more important is a universal, scalable, adaptive and customisable<br />
multidimensional interface to media content where appropriate media viewpoints/perspectives can be presented to the<br />
users adapted to their preferences, workflow constraints, and interaction models.<br />
Existing approaches to user-content interaction are characterised by a lack of a holistic view to the complex problem of<br />
designing accessibility of interaction and content and they fail to look at the process of authoring, managing and<br />
delivering the content as being highly inter-woven. Also, the majority of these approaches either look at accessibility,<br />
personalisation or context of use problems; they do not deal with the more complex issue of user interaction and content<br />
presentation as a whole and the open-ended and frequently changing real world environments.<br />
The various MPEG family members operate at different abstraction levels with some communication between these<br />
abstraction levels. The process of contriving a procedure to interface the various processing levels should be based on<br />
use. The difficulty lies in achieving a level of description of the user requirements that allows re-description in<br />
technological terms. This re-description ideally leads to specifications and ultimately implementations. These<br />
implementations ‘prove’ the viability of the concept: it is the proof of the hypothesis. The process of standardisation that<br />
runs in parallel with this ensures extraction of higher level descriptions and these are aggregated down to the earlier<br />
family members. Using this built-in feature to provide ‘slots’ for common and specialised accessibility requirements would<br />
create what we refer to as accessibility from scratch (see above). If embedded in the family tradition of the MPEG<br />
initiative, accessibility might become a commonly available feature instead of a workaround necessity.<br />
The representation of the interplay between the various user groups should always remain accessible. If all relevant<br />
entities in a representation system remain accessible, creating meaningful mappings is a matter of connecting the<br />
appropriate entities. For this reason, accessibility from scratch is of fundamental importance.<br />
8.2.1 Further research on process and content modelling<br />
There is a need to develop open source frameworks to bridge the gap between original content design heuristics and<br />
intuitive multimodal interfaces required for content and communication systems.<br />
Such frameworks would build in profile-based access to information, content and services, which not only bring together<br />
and extend state-of-the-art technologies for information access, but also conform to standards and guidelines available<br />
for accessibility, usability, scalability and adaptability.<br />
There is a need to conduct basic research to establish the nature of the interaction between people with cognitive<br />
impairments and multimedia information. A critical and guiding factor is that the supply of information should be<br />
determined by the end user from a <strong>cen</strong>tral content reserve, thereby allowing the end- user the freedom to explore the<br />
information as they see fit and to make their own choices regarding how the information is to be displayed, rather than<br />
through the sometimes discriminatory filtering processes of information gatekeepers.<br />
8.3 Introducing and using metadata for accessibility purposes<br />
People compress information. People decompress information. The compression procedure involves filtering out<br />
redundant information based on the perspective of the user. How do we decide which redundant data entities are<br />
relevant for the user? What to use? On what requirements are these redundant data entities based? Whose<br />
requirements? How do we marry the existence of these accessibility metadata entities with the requirements as<br />
described in “common” metadata entities? More importantly, how do we ensure a synchronised and therefore valid<br />
coupling between any kind of content with these metadata entities? How do we ensure that any metatags themselves<br />
remain accessible? What is the context of any accessibility metatags that are to be conceived?<br />
How then can we make sure that the context remains consistent? If we describe the knowledge that is applied to enable<br />
processes to exist in a digital system that parallels analogue organisational systems, knowledge is transferred from the<br />
individual participants to a shared information framework. The use of knowledge can be separated into three parts: the<br />
body of information that is contained inside knowledge structures; static information about the knowledge processing,<br />
which is also known as meta-information or metatags; dynamic information that is used to describe the processes and<br />
67
CWA <strong>15778</strong>:2008 (E)<br />
procedures to retrieve, transform or use the content. By introducing metatags that aim to address the needs for<br />
accessible information processing, it is mandatory to describe the procedures that will meaningfully interpret these meta<br />
tags to communicate the content in a way that enables every person to appreciate the content. Creating meaningful<br />
mappings between the static redundant information -the meta tags- and the dynamic processes<br />
Many people believe structure to be static: from a meta modelling perspective this is not the case. It is well known that if<br />
the representation of the information at hand is perceived by the system and mapped onto a framework, the information<br />
is then usable in a multitude of ways: and for this reason non-programmers will often promote the use of XML.<br />
However, this mark up and the set of tools that surround it are simply a set of tools which exist to achieve this objective. If<br />
the architecture of the system does not answer the wider range of needs, requirements and questions, the mark up<br />
cannot paper over the cracks. In order to build extensibility into a system, the architecture should be such that every<br />
element used for processing the information is adaptable. This can be achieved by building a representation layer which<br />
builds an object oriented structure from the information and which is free to adapt the meta relationships and hierarchies<br />
intrinsic in that data genus. This is defined by identifying the parameters upon which the structure is built, and ensuring<br />
they are interconnected in such a way that promotes future adaptability without degrading the system: which is to say,<br />
using the right parameters for accessible information processing.<br />
As noted in 8.1.1 above, the goal should be to anticipate the changes in user requirements. These changes can occur in<br />
the very nature of the requirements, such as new functional groups or in the definition of the existing requirements, such<br />
as additional details. These aims should be pursued by adding redundant information in the form of meta tags, thus<br />
augmenting the quality of the content. The content itself and the existing meta tag structures, including their mapping to<br />
the meta modelling domain, is not allowed to change. From a meta modelling perspective, this allows us to meet<br />
changing requirements for the future, because if the requirements demand additional detail in the form of features or<br />
metadata, we can unveil the metadata that is available.<br />
8.3.1 Further research on introducing and using metadata for accessibility purposes<br />
Further research is required to identify and investigate the ways in which metadata can help achieve efficient and futureproof<br />
solutions to accessibility.<br />
In order to make this perceived information useful, it must be represented within an architecture which allows the<br />
accessibility requirements to be questioned in more than one way. Such architecture must enable both the core system<br />
to adapt to new and changing representation requirements, and to allow (theoretically) infinite user requirements.<br />
8.4 Standards and personalisation of content<br />
Personalisation of media makes several demands on standards. In particular, it requires:<br />
that the modalities in content are identified<br />
that the modalities or adaptations of them that a user requires in the context are identified<br />
that the glue standards that enable these things to work exist<br />
We perceive the world partly by using our senses. Modalities are the aspects or components of media or system<br />
interfaces that correspond with those senses and enable us to perceive them. For example a video usually has a visual<br />
aspect or component and this corresponds with the sense of sight. Without at least some match between the modalities<br />
available in the media and the senses a user has available at the time or access modes a user can use there can be no<br />
perception or use of the media. Therefore it is very useful if the modalities available in media or provided in an interface<br />
can be described. Doing so permits matching to the modalities a user has available or the authoring of adaptations to<br />
enable that matching.<br />
There is not a simple precise match between modalities and senses because some senses are used in complex<br />
interactive ways – for example the sense of sight and reading – but there is a general broad matching.<br />
Traditionally, ways to describe media on computers has developed in ways suited to the needs of computer design and<br />
not so well suited to use by people. An example would be the Multipurpose Internet Mail Extensions standard for<br />
describing media formats (MIME-types) with email but widely used for other purposes. This does not describe modalities.<br />
Only now are appropriate standards supporting modality description emerging.<br />
ISO/IEC JTC 1/SC 36 Committee for Information Technology for Learning <strong>Education</strong> and Training has<br />
produced a standard for description of access modalities and adaptations for those. This is the Individualized<br />
68
CWA <strong>15778</strong>:2008 (E)<br />
Adaptability and Accessibility in E-learning, <strong>Education</strong> and Training. 24751. This was initially developed in IMS<br />
and then internationalised in ISO. It is planned to be publicly available at the end 2007.<br />
The standard provides for description and matching of resources to personal contexts/requirements. It has<br />
three parts – a Framework that shows how to use the other parts, A standard for description of Digital<br />
Resources (Digital Resource Description DRD) and a standard for description of functional learner<br />
requirements Personal Needs and Preferences (PNP) that enables matching of resources and adaptations to<br />
user requirements and specific contexts. Currently the vocabulary for modalities within the standard can be<br />
used to describe the modalities of visual, textual, auditory, tactile and olfactory. Vocabularies for description of<br />
adaptations for these (for example audio description) are more extensive as these extend to representation<br />
forms used on computers and with assistive technology. Several new parts to the standard are under<br />
construction including parts for description of requirements and components for offline media and services<br />
and and for places and events.<br />
8.4.1 Further Research on personalisation of content<br />
In order usefully to use the modalities within media to match to or adapt to user requirements a few glue standards are<br />
needed. This is an area of active development in the standards.<br />
To make it all work there is needed the development of best practices with particular media. This CWA presents some of<br />
those best practices. Even more progress towards providing truly personalisable media could be obtained with the<br />
development of more formally-described practices. Ontologies describing the use of different media components across<br />
different media types would be very useful here. For example, most media types contain alternatives that could be<br />
described and matched to a context as described above, but the media types all do it different ways. What is needed is<br />
some way to operate across media types with the same principles.<br />
IEEE Learning Technology Standards Committee is developing a standard currently called Resource Aggregation Model<br />
for Learning, <strong>Education</strong> and Training. This work is constructing a standard ontology that tackles this cross-media issue,<br />
though the precise requirements for adaptation for accessibility have not yet been addressed in that work. The work can<br />
be found on http://www.ieeeltsc.org/working-groups/wg11CMI/ramlet/<br />
Alternatives within media types are further described elsewhere in this CWA.<br />
8.5 Li<strong>cen</strong>sing and technical protection measures<br />
It is evident that neither at international level nor at European level is there any requirement to provide exceptions to<br />
copyright protection facilitating access to protected materials by visually impaired people. Further, the measures on anticircumvention<br />
of technical protection measures introduced by the WIPO Copyright Treaty (WCT) and the WIPO<br />
Performances and Phonograms Treaty (WPPT) in 1996 are not matched by any provision to accommodate exceptions to<br />
copyright protection. The European Union Copyright Directive does address this issue in article 6.4.1 but gives no<br />
indication how the conflict is to be resolved nor does it require resolution of the conflict for content made available via<br />
interactive digital services.<br />
The legislative provisions found at national level are diverse. The reasons for this state of affairs include the following:<br />
The variety of digital file formats used by publishers<br />
The complexities of format and structure conversion and the provision of the corresponding resource<br />
requirement<br />
The concerns of publishers regarding the release to third parties of digital text files<br />
Concerns by publishers that they may be impeded from collaborating by not having the requisite<br />
rights to authorise conversion into particular formats e.g. audio books<br />
Security is a vital issue for publishers and technical protection measures are a complex issue for most content holders.<br />
Every publisher’s content, client base and requirements are different, which often results in a personalised set of<br />
requirements for each case. As a result, approaches to li<strong>cen</strong>sing and <strong>agreement</strong>s on accessible formats are often<br />
negotiated on a case-by-case basis. Naturally, publishers have to be confident that any digital format is being delivered<br />
through secure gateways to only the people who are intended to receive it. For these reasons, there is a perception that<br />
the provision of digital files in alternative formats may compromise technical protection measures. Combined with a<br />
widespread belief that the provision of accessible format materials is expensive and time-consuming, only limited<br />
progress has thus far been made.<br />
69
CWA <strong>15778</strong>:2008 (E)<br />
Much of the discussion around DRM and Accessibility has necessarily focused on the right of access versus the need to<br />
protect content. However, points of common interest exist and the development of trusted intermediary concepts can<br />
offer real-world solutions.<br />
From a technical perspective, earlier problems relating to the digitisation of materials have been largely overcome and<br />
re<strong>cen</strong>t formats (such as XML, RDF, METS, MARC21 etc) provide a realistic basis for implementing the different aspects<br />
of this work. It is now possible to address the key concerns of content creators and providers and coherently to address<br />
issues such as: automation of document structuring, adherence to emerging standards, workflow support, digital rights<br />
management and secure distribution platforms.<br />
As the lifetime of a book gets shorter and shorter, publishers frequently have to offer access to digital versions of that<br />
book and taking this into account when constructing the layout brings us much closer to real accessibility in the wider<br />
sense. Indeed, it has been the accessibility community that has in many ways pioneered new structures for digital<br />
content, as these developments are often borne of need.<br />
Trusted intermediaries establish a personalised relationship between content holders and specialist organisations<br />
whereby publishers and agencies serving blind and partially sighted people work together in a secure and trusting<br />
environment to increase the quantity and timeliness of titles available in an accessible format. Within trusted intermediary<br />
frameworks, DRM is an enabler of controlled access. A number of different security methods are being developed or are<br />
already in use for making content available in this way.<br />
As far as security is concerned, the higher the level the more likely publishers are to allow content to be made available<br />
in accessible digital formats. At present, the security systems used are simple, they use basic encryption technologies<br />
with key exchange mechanisms. The potential for the release of content is considerable – although there are few<br />
recorded instances of such occurring. Once decrypted, content is available to anyone, authorised or not. The ability to<br />
attach content to particular devices, or better to provide access only to authorised users, requires a level of DRM<br />
sophistication that is not yet generally in place in services catering to the needs of visually impaired people.<br />
8.5.1 Further research on li<strong>cen</strong>sing and technical protection measures<br />
There is a need to examine and describe existing practice in this area, with particular focus on the implementation of<br />
trusted intermediary environments.<br />
Further research is required to examine accessibility in the wider sense and to examine the requirements for modelling<br />
accessibility and DRM within emerging multimedia environments.<br />
9 Conclusion and future work<br />
Accessible content started as a niche market mainly targeting people with visual impairments and dyslexic users.<br />
Publishers have re<strong>cen</strong>tly realised the potential market and are offering alternative book versions (audio, large print, etc.)<br />
of newly published editions together with the printed ones. Accessible content can be used in different situations and<br />
cover part of the changing requirements of the user population in the framework of ambient intelligence for content<br />
anytime, anywhere, and with any service. Usage examples include, users on the move, multitasking environments that<br />
require hands-free, eyes-free access and browsing, learning and instructional s<strong>cen</strong>arios, and many others.<br />
Despite this, the full potential of accessible content has not been released yet, due to a mismatch between existing<br />
standards regarding accessibility and their implementation in publishing processes. This CWA aims to reduce that gap<br />
and realise the full market potential of accessible content. Therefore, several real-life publishing s<strong>cen</strong>arios have been<br />
analysed and related actors, formats, conversions and standards presented.<br />
In this elaboration the following conclusions can be drawn:<br />
(a) There are many common tasks that need to be covered when introducing accessibility into the publishing workflow.<br />
Those tasks deal with the definition of document style sheets that are agreed and disseminated to the actors, manual<br />
correction of conversion input/output (in the case where content creators did not follow style sheet requirements),<br />
extension of descriptions regarding visual data (images, videos) and other multimedia material and lead to common<br />
requirements.<br />
(b) Specific format conversions need to take place in order to accommodate user needs. In these format conversions, the<br />
original structure, annotation and metadata and intellectual property rights of the parent documents need to be inherited<br />
to the child documents and not be lost. One of the strategies that seem to be most viable is to use a media rich (including<br />
70
CWA <strong>15778</strong>:2008 (E)<br />
annotations, and metadata) structured format as reference and then create the desired output operating direct on that<br />
format. An additional advantage of this strategy is that this format can be ideally also used for backups and to be stored<br />
in the publishing archive.<br />
c) Existing research, standards and technologies can be used in order to transform current publishing workflows into<br />
accessible content processing workflows. In order to be successful in this transformation, the barriers publishers face for<br />
deploying accessibility into their workflow must fall and the in<strong>cen</strong>tives for producing accessible content increased.<br />
Barriers will fall when automated format conversions and clear stylesheets and guidelines are followed. Although there is<br />
still a lot of research to be done in order to robust and reliable produce automatically accessible documents from any<br />
source, there are nowadays means to economically implement accessible content processing workflows. In<strong>cen</strong>tives for<br />
producing accessible content may come from opening new markets for publishers e.g. personalised information delivery,<br />
electronic distribution, efficient storage, preservation, search and retrieval of publishing titles, and therefore a greater<br />
economic return is expected.<br />
The analysis of accessible information provision showed also that there are some specific steps that need to<br />
be done to complement and extent the accessible information publishing process. Those are:<br />
<br />
<br />
<br />
<br />
For software developers and ICT researchers, to develop frameworks and tools to support actors in<br />
content production to fulfil the accessibility requirements, including automated conversion of both<br />
single and multi-type composite document formats into accessible documents, personalised<br />
presentation of content and adaptive content interfaces, li<strong>cen</strong>sing a technical protection measures.<br />
The focus should be to solutions that address specific sector requirements (eGovernment, eLearning,<br />
Medical Documents, Scientific documents, etc.)<br />
For accessibility researchers to look more clearly and define requirements of users with special<br />
needs in the information provision, and based on that, propose and/or adapt standards for the<br />
publishing and content management industry.<br />
For the content management industry to bring accessible information provision on its agenda and<br />
start proposing standards in this area, supporting open framework and standards development.<br />
For publishers and publishing associations to work on elaborating tools used by authors in the<br />
content authoring process, and systems used in content production and delivery with the aim to<br />
provide detailed input for on-going accessible format and accessibility software and standards<br />
development and be part of future development on process and content modelling efforts for<br />
accessible information provision<br />
It is clear from the above elaboration that accessible information provision requires interdisciplinary efforts in<br />
order to be realised. We look forward to this realisation process and hope to attract the best solutions from<br />
each area and to see barriers to access information reducing.<br />
71
CWA <strong>15778</strong>:2008 (E)<br />
Appendix A – Relevant standards<br />
Nr. Title Subject (if available) URL<br />
BP – PRD<br />
BP – PRO<br />
MPEG 21<br />
Open eBook Publication Structure<br />
specification<br />
ST – PRD<br />
The International Standard Text Code<br />
(ISTC)<br />
This document, derived from the guidelines for TEI Lite, provides an introduction<br />
to the recommendations of the Text Encoding Initiative (TEI), by describing a<br />
subset to, and extension of, the full TEI encoding scheme developed for marking<br />
up OUCS web pages and course documentation.<br />
I have prepared a set of XSLT specifications to transform TEI XML documents to<br />
HTML, and to XSL Formatting Objects. I have con<strong>cen</strong>trated on TEI Lite, but<br />
adding support for other modules is fairly easy, and I am working my way<br />
through the TEI as applications come along. In the main, the setup has been<br />
used on `new' documents, i.e. reports and web pages that I have authored from<br />
scratch, rather than traditional TEI-encoded existing material.<br />
MPEG-21 is developed within the International Standard Organisation (ISO) and<br />
aims at defining a normative open framework for multimedia delivery and<br />
consumption for use by all the players in the delivery and consumption chain<br />
The Text Encoding Initiative (TEI) Guidelines are an international and<br />
interdisciplinary standard that facilitates libraries, museums, publishers, and<br />
individual scholars represent a variety of literary and linguistic texts for online<br />
research, teaching, and preservation.<br />
The International Standard Text Code (ISTC) is developed by ISO Project 21047<br />
and aims at a unique, international identification of individual textual works.<br />
http://www.chiariglione.org/mpeg/standards/<br />
mpeg-21/mpeg-21.htm<br />
http://www.idpf.org/specs.htm<br />
http://www.tei-c.org/<br />
http://www.collectionscanada.ca/iso/tc46sc9<br />
/wg3.htm<br />
AEN/CTN 139<br />
AEN/CTN 153<br />
Authoring Tools<br />
Working Group<br />
(AUWG)<br />
Computer applications for people with<br />
disabilities. Computer accessibility<br />
requirements. Software<br />
Audio description for visually impaired<br />
people. Guidelines for audio description<br />
procedures and for the preparation of<br />
audio guides<br />
Authoring Tool Accessibility Guidelines<br />
1.0 (ATAG 1.0)<br />
http://www.aenor.es/desarrollo/inicio/home/<br />
home.asp<br />
http://www.aenor.es/desarrollo/inicio/home/<br />
home.asp<br />
http://www.oucs.ox.ac.uk/oucsweb/teioucs.xml<br />
http://xml.web.cern.ch/XML/www.teic.org/Stylesheets/teixsl.html<br />
http://www.w3.org/TR/2000/REC-ATAG10-<br />
20000203/<br />
72
CWA <strong>15778</strong>:2008 (E)<br />
Nr. Title Subject (if available) URL<br />
Authoring Tools<br />
Working Group<br />
(AUWG)<br />
BS 7000-6:2005<br />
Authoring Tool Accessibility Guidelines<br />
2.0 (ATAG 2.0)<br />
Design management systems. Managing<br />
inclusive design. Guide<br />
http://www.w3.org/TR/ATAG20/<br />
http://www.bsiglobal.com/Quality_management/Design/bs<br />
7000-6.xalter<br />
CAN/CSA-B659-01 Design for Aging http://www.csaintl.org/onlinestore/GetCatalogItemDetails.a<br />
sp?mat=000000000002012683<br />
Draft ISO/IEC 24751<br />
Individualised Adaptability and<br />
Accessibility in E-learning, <strong>Education</strong><br />
and Training<br />
The scope of this multi- part standard is to provide a common framework to<br />
facilitate matching of learner accessibility needs and preferences with<br />
appropriate learning resources and user interfaces.<br />
Part 1: Framework<br />
Part 2: Access For All Personal Needs and Preferences Statement<br />
Part 3: Access For All Digital Resource Description<br />
Part 1: http://jtc1sc36.org/doc/36N1139.pdf<br />
(temporary URL)<br />
Part 2: http://jtc1sc36.org/doc/36N1140.pdf<br />
(temporary URL)<br />
Part 3: http://jtc1sc36.org/doc/36N1141.pdf<br />
(temporary URL)<br />
Draft ISO/IEC 26513 Software and systems engineering –<br />
User documentation requirements for<br />
documentation evaluators and testers.<br />
Draft ISO/IEC 26514 Software and systems engineering –<br />
User documentation requirements for<br />
documentation designers and<br />
developers.<br />
http://www.jtc1-sc7.org/<br />
To be developed by ISO/IEC JTC 1/SC 7/WG 2 – Software and systems<br />
documentation.<br />
Should contain both requirements and recommendations on all aspects of<br />
documentation evaluation and testing.<br />
Being developed by ISO/IEC JTC 1/SC 7/WG 2 – Software and systems<br />
documentation. Contains both requirements and recommendations on all<br />
aspects of documentation including planning, design, production and<br />
maintenance. Several clauses provide guidance on accessible documentation,<br />
notably 12.5<br />
Estimated ISO publication in 2010.<br />
Estimated ISO publication in 2008.<br />
EN 1332-4<br />
Identification Card Systems - Manmachine<br />
interface - Part 4: Coding of<br />
user requirements for people with special<br />
needs<br />
Machine readable cards facilitate the provision of a growing variety of services<br />
across Europe. The purpose of EN 1332 is to increase the accessibility of these<br />
services for the benefit of consumers. This will be achieved by facilitating the<br />
inter-sector and cross border interpretability of machine-readable cards and to<br />
do so with the maximum possible degree of user-friendliness. EN 1332<br />
addresses the needs of all users, including people with special needs, for<br />
example the aged, minors, the disabled, the visually impaired, those with<br />
learning difficulties, first time users, those not conversant with the local<br />
language.<br />
http://www.bsi-<br />
global.com/en/Shop/Publication-<br />
Detail/?pid=000000000030009505<br />
73
CWA <strong>15778</strong>:2008 (E)<br />
Nr. Title Subject (if available) URL<br />
ETSI EG 202 416:<br />
User interfaces: Set up procedures for<br />
mobile terminals and services<br />
Being developed by ETSI HF STF 285 – Guidelines for set up procedures for<br />
mobile terminals and services.<br />
The ETSI Guidelines (EG) will include documentation and include<br />
recommendations for disabled and elderly users.<br />
Available from ETSI web site free-ofcharge.<br />
http://www.etsi.org/services_products/freest<br />
andard/home.htm<br />
ETSI EG 202 417<br />
User education guidelines for mobile<br />
terminals and e-services<br />
Available from ETSI web site free-ofcharge.<br />
http://www.etsi.org/services_products/freest<br />
andard/home.htm<br />
IEEE standard<br />
RAMlet – Reference model for resource<br />
aggregation.<br />
http://ieeeltsc.org/wg11CMI/ramlet/Pub/<br />
also:<br />
http://www.ieeeltsc.org/workinggroups/wg11CMI/ramlet/Pub/RAMLET_proj<br />
ect_description.pdf/view<br />
IMS standard<br />
IMS AccessForAll Meta-data<br />
Specification 1.0<br />
http://www.imsglobal.org/accessibility<br />
IMS standard<br />
IMS Learner Information Package<br />
Accessibility for LIP 1.0<br />
http://www.imsglobal.org/accessibility<br />
ISO 10075:1991<br />
Ergonomic principles related to mental<br />
work-load -- General terms and<br />
definitions<br />
http://www.iso.org/iso/en/CatalogueDetailPa<br />
ge.CatalogueDetail?CSNUMBER=18045&I<br />
CS1=13&ICS2=180&ICS3=<br />
ISO 14915-1:2002<br />
Software ergonomics for multimedia user<br />
interfaces -- Part 1: Design principles<br />
and framework<br />
http://www.iso.org/iso/en/CatalogueDetailPa<br />
ge.CatalogueDetail?CSNUMBER=25578&s<br />
copelist=ALL<br />
ISO 14915-2:2003<br />
Software ergonomics for multimedia user<br />
interfaces -- Part 2: Multimedia<br />
navigation and control<br />
http://www.iso.org/iso/en/CatalogueDetailPa<br />
ge.CatalogueDetail?CSNUMBER=28583&s<br />
copelist=ALL<br />
ISO 20282-1:2006 Ease of operation of everyday products -<br />
- Part 1: Context of use and user<br />
characteristics<br />
ISO 20282-1:2006 provides requirements and recommendations for the design<br />
of easy-to-operate everyday products, where ease of operation addresses a<br />
subset of the concept of usability concerned with the user interface by taking<br />
account of the relevant user characteristics and the context of use.<br />
http://www.iso.org/iso/en/CatalogueDetailPa<br />
ge.CatalogueDetail?CSNUMBER=34122&s<br />
copelist=PROGRAMME<br />
74
CWA <strong>15778</strong>:2008 (E)<br />
Nr. Title Subject (if available) URL<br />
ISO 2108:2005 Information and documentation --<br />
International standard book number<br />
(ISBN)<br />
ISO TS 20282-2:2006 Ease of operation of everyday products -<br />
- Part 2: Test method<br />
The purpose of ISO 2108:2005 is to establish the specifications for the<br />
International Standard Book Number (ISBN) as a unique international<br />
identification system for each product form or edition of a monographic<br />
publication published or produced by a specific publisher. It specifies the<br />
construction of an ISBN, the rules for its assignment and use, the metadata to be<br />
associated with the ISBN allocation, and the administration of the ISBN system.<br />
ISO 20282-2:2006 specifies a test method for measuring the ease of operation<br />
of "walk-up-and-use" products. The purpose of the test is to provide a basis for<br />
predicting the ease of operation of a walk-up-and-use product, including<br />
measures of its effectiveness and efficiency of operation, and the satisfaction of<br />
the intended user population in its expected context of use.<br />
http://www.isbninternational.org/en/index.html<br />
http://www.iso.org/iso/en/CatalogueDetailPa<br />
ge.CatalogueDetail?CSNUMBER=36452&s<br />
copelist=PROGRAMME<br />
ISO/AWI TR 22411<br />
ISO/CD 9241-20<br />
ISO/DIS 9241-151<br />
ISO/DIS 9241-171<br />
ISO/DIS 9241-300<br />
ISO/IEC 11581<br />
Ergonomic data and guidelines for the<br />
application of ISO/IEC Guide 71 in<br />
standards related to products and<br />
services to address the needs of older<br />
persons and persons with disabilities<br />
Ergonomics of human system interaction<br />
-- Accessibility guideline for information<br />
communication equipment and services -<br />
- Part 20: General guidelines<br />
Ergonomics of human-system interaction<br />
-- Part 151: Guidance on World Wide<br />
Web user interfaces<br />
Ergonomics of human-system interaction<br />
-- Part 171: Guidance on software<br />
accessibility<br />
Ergonomics of human-system interaction<br />
-- Part 300: Introduction to requirements<br />
and measurement techniques for<br />
electronic visual displays<br />
Information technology -- User system<br />
interfaces and symbols-part 1-6<br />
http://www.iso.org/iso/en/CatalogueDetailPa<br />
ge.CatalogueDetail?CSNUMBER=40933&s<br />
copelist=PROGRAMME<br />
http://www.iso.org/iso/en/CatalogueDetailPa<br />
ge.CatalogueDetail?CSNUMBER=40727&s<br />
copelist=PROGRAMME<br />
http://www.iso.org/iso/en/CatalogueDetailPa<br />
ge.CatalogueDetail?CSNUMBER=37031&s<br />
copelist=ALL<br />
http://www.iso.org/iso/en/CatalogueDetailPa<br />
ge.CatalogueDetail?CSNUMBER=39080&s<br />
copelist=ALL<br />
http://www.iso.org/iso/en/CatalogueDetailPa<br />
ge.CatalogueDetail?CSNUMBER=40096&s<br />
copelist=PROGRAMME<br />
http://www.iso.org/iso/en/CatalogueListPag<br />
e.CatalogueList?COMMID=4768&scopelist<br />
=ALL<br />
75
CWA <strong>15778</strong>:2008 (E)<br />
Nr. Title Subject (if available) URL<br />
ISO/IEC 18019:2004<br />
Guidelines for the design and<br />
preparation of user documentation for<br />
application software<br />
Provides guidelines for the design and preparation of user documentation for<br />
application software. It describes how to establish what information users need,<br />
how to determine the way in which that information should be presented to the<br />
users, and how then to prepare the information and make it available. Contains<br />
recommendations on implementing accessibility for documentation (clause<br />
4.2.6).<br />
ISO/IEC 18019:2004 Software and system engineering --<br />
Guidelines for the design and<br />
preparation of user documentation for<br />
application software<br />
http://www.iso.org/iso/en/CatalogueDetailPa<br />
ge.CatalogueDetail?CSNUMBER=30804&I<br />
CS1=35&ICS2=80&ICS3=<br />
ISO/IEC 26300:2006<br />
OASIS standard<br />
Information technology -- Open<br />
Document Format for Office Applications<br />
(OpenDocument) v1.0<br />
The OpenDocument specification defines an XML schema for office applications<br />
and its semantics. The schema is suitable for office documents, including text<br />
documents, spreadsheets, charts and graphical documents like drawings or<br />
presentations, but is not restricted to these kinds of documents.<br />
The schema provides for high-level information suitable for editing documents. It<br />
defines suitable XML structures for office documents and is friendly to<br />
transformations using XSLT or similar XML-based tools<br />
http://www.oasisopen.org/committees/tc_home.php?wg_abb<br />
rev=office,<br />
http://www.iso.org/iso/en/CatalogueDetailPa<br />
ge.CatalogueDetail?CSNUMBER=43485&s<br />
copelist=ALL<br />
ISO/IEC CD 24756<br />
Information technology -- Algorithmic<br />
framework for determining accessibility<br />
for individual users of interactive systems<br />
http://www.iso.org/iso/en/CatalogueListPag<br />
e.CatalogueList?COMMID=4768&scopelist<br />
=ALL<br />
ISO/IEC CD TR<br />
19766<br />
Information Technology -- Guidelines for<br />
the design of icons and symbols<br />
accessible to all users, including the<br />
elderly and persons with disabilities<br />
http://www.iso.org/iso/en/CatalogueListPag<br />
e.CatalogueList?COMMID=4768&scopelist<br />
=ALL<br />
ISO/IEC DTR 19765<br />
Information technology -- Survey of<br />
existing icons and symbols for elderly<br />
and disabled persons<br />
http://www.iso.org/iso/en/CatalogueListPag<br />
e.CatalogueList?COMMID=4768&scopelist<br />
=ALL<br />
ISO/IEC NP 24786-1<br />
Information Technology - User Interfaces<br />
- Accessible User Interface for<br />
Accessibility Setting on Information<br />
Devices -- Part 1: General and methods<br />
to start<br />
http://www.iso.org/iso/en/CatalogueDetailPa<br />
ge.CatalogueDetail?CSNUMBER=41556&s<br />
copelist=ALL<br />
ISO/IEC TR 19764<br />
Information technology -- Guidelines,<br />
methodology and reference criteria for<br />
cultural and linguistic adaptability in<br />
information technology products<br />
http://www.iso.org/iso/en/CatalogueListPag<br />
e.CatalogueList?COMMID=4768&scopelist<br />
=ALL<br />
76
CWA <strong>15778</strong>:2008 (E)<br />
Nr. Title Subject (if available) URL<br />
ISO/TR 16982:2002<br />
Ergonomics of human-system interaction<br />
-- Usability methods supporting human<strong>cen</strong>tred<br />
design<br />
http://www.iso.ch/iso/en/CatalogueDetailPa<br />
ge.CatalogueDetail?CSNUMBER=31176&s<br />
copelist=<br />
ISO/TS 16071:2003<br />
Ergonomics of human-system interaction<br />
-- Guidance on accessibility for humancomputer<br />
interfaces<br />
http://www.iso.org/iso/en/CatalogueDetailPa<br />
ge.CatalogueDetail?CSNUMBER=30858&s<br />
copelist=ALL<br />
Italian Government<br />
Law n. 4, January 9, 2004 - Provisions to<br />
support the access to information<br />
technologies for the disabled (also<br />
known as "The Stanca Act").<br />
http://www.pubbliaccesso.gov.it/english/ind<br />
ex.htm<br />
JIS X 8341-1<br />
Japanese Industrial Standards<br />
Committee.<br />
Guidelines for older persons and persons with disabilities -- Information and<br />
communications equipment, software and services -- Part 1: Common<br />
Guidelines<br />
http://www.webstore.jsa.or.jp/webstore/Com<br />
/FlowControl.jsp?lang=en&bunsyoId=JIS+X<br />
+8341-<br />
1%3A2004&dantaiCd=JIS&status=1&page<br />
No=0<br />
JISS0032<br />
Japanese Industrial Standards<br />
Committee<br />
Guidelines for the elderly and people with disabilities - Visual signs and displays<br />
- Estimation of minimum legible size for a Japanese single character<br />
http://www.webstore.jsa.or.jp/webstore/Com<br />
/FlowControl.jsp?lang=en&bunsyoId=JIS+S<br />
+0032%3A2003&dantaiCd=JIS&status=1&<br />
pageNo=0<br />
JISS0033<br />
Japanese Industrial Standards<br />
Committee<br />
Guidelines for the elderly and people with dasabilities - Visual signs and displays<br />
- A method for color combination based on categories of fundamental colors as a<br />
functionof age<br />
http://www.webstore.jsa.or.jp/webstore/Com<br />
/FlowControl.jsp?lang=en&bunsyoId=JIS+S<br />
+0033%3A2006&dantaiCd=JIS&status=1&<br />
pageNo=0<br />
JISZ8071<br />
Japanese Industrial Standards<br />
Committee<br />
Guilines for standards developers to address the needs of older persons and<br />
persons with disabilities<br />
http://www.webstore.jsa.or.jp/webstore/Com<br />
/FlowControl.jsp?lang=en&bunsyoId=JIS+Z<br />
+8071%3A2003&dantaiCd=JIS&status=1&<br />
pageNo=0<br />
Nordic Cooperation on<br />
Disability<br />
Nordic Guidelines for Computer<br />
Accessibility<br />
http://trace.wisc.edu/docs/nordic_guidelines<br />
/nordic_guidelines.htm<br />
OASIS standard OASIS DITA Language Specification The Darwin Information Typing Architecture (DITA) specification defines both a)<br />
a set of document types for authoring and organising topic-oriented information;<br />
and b) a set of mechanisms for combining and extending document types using<br />
a process called specialisation.<br />
http://www.oasisopen.org/committees/download.php/12091/<br />
cd2.zip<br />
77
CWA <strong>15778</strong>:2008 (E)<br />
Nr. Title Subject (if available) URL<br />
Standard from:<br />
EDItEUR jointly with<br />
Association of<br />
American Publishers,<br />
Book Industry<br />
Communication and<br />
the Book Industry<br />
Study Group.<br />
ONIX International<br />
ONIX International is the international standard for representing and<br />
communicating book industry product information in electronic form,<br />
incorporating the core content which has been specified in national initiatives<br />
such as BIC Basic and AAP’s ONIX Version 1'<br />
http://www.editeur.org/onix.html<br />
Swedish Government,<br />
Office of the Disability<br />
Ombudsman<br />
Guidelines for an accessible public<br />
administration<br />
http://www.tillganglig.se/start.asp?lang=en&<br />
sida=1450<br />
US section 508 US Section 508 , on the requirements for accessibility for public procurement.<br />
This act requires all federal agencies' electronic and information technology is<br />
accessible to people with disabilities.<br />
http://www.section508.gov/<br />
User Agent<br />
Accessibility<br />
Guidelines Working<br />
Group (UAWG)<br />
User Agent Accessibility Guidelines<br />
(UAAG) 1.0<br />
http://www.w3.org/TR/2002/REC-UAAG10-<br />
20021217/<br />
W3C<br />
Recommendation<br />
EARL (W3) , the Evaluation and Report<br />
Language.<br />
The Evaluation and Report Language is a standardized vocabulary to express<br />
test results.<br />
http://www.w3.org/TR/EARL10-Schema/<br />
W3C<br />
Recommendation<br />
Web Content Accessibility Guidelines 1.0<br />
WCAG 1.0 has 14 guidelines that are general principles of accessible design.<br />
Each guideline has one or more checkpoints that explain how the guideline<br />
applies in a specific area.<br />
http://www.w3.org/WAI/intro/wcag.php<br />
W3C<br />
Recommendation<br />
Web Content Accessibility Guidelines 2.0<br />
Following WCAG makes Web content more accessible to the vast majority of<br />
users, including people with disabilities and older users, using many different<br />
devices including a wide variety of assistive technology.<br />
http://www.w3.org/TR/2005/WD-WCAG20-<br />
20051123/<br />
W3C<br />
Recommendation<br />
W3C<br />
Recommendation<br />
XHTML (1.0) W3C (X)HTML Working group http://www.w3.org/TR/xhtml1/,<br />
Working group homepage:<br />
http://www.w3.org/MarkUp/<br />
XML standard W3C XML standards http://www.w3.org/TR/2006/REC-xml-<br />
20060816/<br />
Working group homepage:<br />
http://www.w3.org/XML/<br />
78
CWA <strong>15778</strong>:2008 (E)<br />
Appendix B – Relevant European organisations<br />
Title Subject URL<br />
AAP Open eBook Standards Project and AAP/Andersen<br />
Consulting eBook Study<br />
AXMEDIS (EC Project)<br />
BISG (Book Industry Study Group)<br />
The goal of the AAP Open ebook Standards Project was to recommend standards<br />
and requirements in the areas of Digital Rights Management [ref], Metadata and<br />
Numbering that will enable an open, competitive marketplace for ebook commerce<br />
on a large scale. The intention is to consider all aspects of the burgeoning ebook<br />
marketplace in developing standards and requirements to promote its growth.<br />
AXMEDIS is developing technologies to reduce the costs of digital content<br />
production, distribution and protections. It is an environment where digital content<br />
producers, aggregators and distributors can gain access to a wide range of digital<br />
contents.<br />
A membership-supported, not-for-profit research organisation comprised of<br />
organisations from every sector of the publishing community. Its goal is to provide<br />
accurate and current research information about the industry for its members and<br />
others. BISG produced jointly with the Association of American Publishers and<br />
Book Industry Communication the ONIX International standard.<br />
http://www.publishers.org/digital/index.cfm<br />
http://www.axmedis.org/<br />
http://www.bisg.org/<br />
BrailleNet, France: Digital Document Delivery for the Blind in France http://www.braillenet.org/<br />
DAISY Consortium<br />
DBK, De Braillekrant, Belgium<br />
The DAISY Consortium's mission is to develop the International Standard and<br />
implementation strategies for the production, exchange, and use of Digital Talking<br />
Books in both developed and developing countries, with special attention to<br />
integration with mainstream technology, to ensure access to information for people<br />
with print disabilities.<br />
Private foundation supported by the Katholieke Universiteit Leuven and Sensotec<br />
company, producing daily newspapers in Braille since 1993, in Daisy format since<br />
2003 and in audio format starting April 2007.<br />
http://www.daisy.org/<br />
http://www.braillekrant.be/<br />
DEDICON Netherlands: National Federation of Libraries for the Blind http://www.dedicon.nl/<br />
Digital Media Project<br />
On the policy and legal side, new policies should be determined and legacy policies<br />
should be revised<br />
On the technical side, a DRM platform should be designed offering the following<br />
main features<br />
http://www.chiariglione.org/project/<br />
Dolphin Audio Publishing, United Kingdom: Multimedia solutions for the adaptive technology industries http://www.yourdolphin.com/<br />
79
CWA <strong>15778</strong>:2008 (E)<br />
ENABLES Enhanced Network Accessibility for the Blind<br />
and Visually Impaired<br />
ETSI Task Force STF 286<br />
The specific objectives in this area include:<br />
Developing techniques that will convert existing non-accessible Web contents into<br />
accessible forms according to the user's need.<br />
Investigate multimodal representation for different contents, different applications<br />
and different user disabilities.<br />
Producing guidelines for creating multimodal representation and toolkits for<br />
developers to create accessible Web contents in general and images in particular.<br />
ETSI Human Factors designing access symbols to indicate special services for<br />
disabled users of ICT equipment<br />
http://www.enabledweb.org/AWC.htm<br />
http://www.etsi.org/pressroom/Previous/200<br />
5/2005_05_stf286.htm<br />
EUAIN (EC Project) The EUAIN project (European Accessible Information Network) aims to promote e-<br />
Inclusion as a core horizontal building block in the establishment of the Information<br />
Society by creating a European Accessible Information Network to bring together<br />
the different actors in the content creation and publishing industries around a<br />
common set of objectives relating to the provision of accessible information.<br />
http://www.euain.org/modules/wfsection/<br />
FEP, Belgium: Federation of European Publishers, Brussels. http://www.fep-fee.be/<br />
FORCE, Netherlands:<br />
Independent foundation for education and support for the print impaired in<br />
developing countries<br />
http://www.force-foundation.org.uk/<br />
Institute Integriert Studieren, Austria Austria-wide Institute for Information systems Supporting Print Disabled Students http://www.integriert-studieren.jku.at/<br />
International Digital Publishing Forum (IDPF),<br />
ISO/IEC JTC 1 Special Working Group on Accessibility<br />
standards<br />
National Council For the Blind of Ireland Media<br />
Conversion Service (NCB IMCS)<br />
The International Digital Publishing Forum (IDPF), formerly the Open eBook Forum<br />
(OeBF), is the trade and standards association for the digital publishing industry.<br />
Established by JTC 1 (10/2004) to:<br />
- establish user requirements for accessibility standards<br />
- prepare an inventory of existing accessibility standards & legislation<br />
- Identify gaps (and overlaps) in accessibility standardisation<br />
- Work with standards bodies (ISO, IEC, ITU, CEN, ETSI, etc) to prepare new<br />
standards<br />
The MCS converts information and documents into formats accessible to<br />
people with vision impairments for a range of public and private<br />
clients. The MCS provides 3 specific services:<br />
1. Conversion into Braille and audio formats<br />
2. Audio-description<br />
3. MCS Consultancy and QA Services<br />
4. Braille Training for the Pharmaceutical Sector<br />
http://www.idpf.org/<br />
http://www.jtc1access.org/<br />
www.ncbi.ie<br />
80
CWA <strong>15778</strong>:2008 (E)<br />
OASIS DITA Technical Committee<br />
The purpose of the OASIS DITA Technical Committee (TC) is to define and<br />
maintain the Darwin Information Typing Architecture (DITA) and to promote the use<br />
of the architecture for creating standard information types and domain-specific<br />
markup vocabularies.<br />
http://www.oasisopen.org/committees/tc_home.php?wg_abb<br />
rev=dita<br />
OASIS ODF Technical Committee http://www.oasisopen.org/committees/workgroup.php?wg_a<br />
bbrev=odf-adoption<br />
ONCE, Spain: Spanish National Organisation for the Blind http://www.once.es/<br />
Royal National Institute of the Blind (UK)<br />
New RNIB web site addressing software accessibility<br />
Royal National Institute of the Blind<br />
RNIB<br />
Society for Technical Communication (US)<br />
UK’s leading charity offering information, support and advice to over two million<br />
people with sight problems. The Web Access Centre provides web designers and<br />
managers with the tools and resources needed to plan, build and test accessible<br />
websites. RNIB also offers paid for web accessibility consultancy services including<br />
website audits, advice, presentations and seminars. The directory of accessible<br />
websites lists the sites that have passed the RNIB audit within the past year.<br />
The STC is a US-based international organisation and has a Special Interest<br />
Groups addressing the accessibility of documentation (A-SIG).<br />
http://www.rnib.org.uk/xpedio/groups/public/<br />
documents/publicwebsite/public_sachome.h<br />
csp<br />
www.rnib.org.uk<br />
http://www.stcsig.org/sn/index.shtml<br />
SUTII, Poland: University Technical Research Department<br />
TechDis (a JISC Advisory Service)<br />
Technologie-Zentrum Informatik, University Bremen,<br />
Germany:<br />
Web Accessibility Business Case Documents<br />
Web Standards Project<br />
The mission of the JISC TechDis Service is to support the education sector in<br />
achieving greater accessibility and inclusion by stimulating innovation and providing<br />
expert advice and guidance on disability and technology. The TechDis website<br />
features many aids for accessible document processing, including the Accessibility<br />
Essentials guides to inclusive use of Microsoft (R) Word and Adobe (R) PDFs.<br />
Institute at the University Bremen with the main aim to develop cutting edge<br />
technologies in computer science and engineering and transfer those into practice.<br />
Special focus on multimedia content accessibility and presentation<br />
The Web Accessibility Initiative (WAI) <strong>Education</strong> and Outreach Working Group<br />
(EOWG). "Developing a Web Accessibility Business Case for Your Organisation."<br />
describes social, technical, financial, legal and policy aspects of Web accessibility.<br />
It is designed to help organisations develop their own customised business case for<br />
Web accessibility. It provides text that can be used as is, as well as guidance on<br />
identifying the most relevant factors for a specific organisation.<br />
The Web Standards Project is a grassroots coalition fighting for standards that<br />
ensure simple, affordable access to web technologies for all.<br />
http://www.techdis.ac.uk<br />
http://www.tzi.de/<br />
http://www.w3.org/WAI/bcase/<br />
http://www.w3.org/WAI/<br />
http://www.webstandards.org/<br />
81
CWA <strong>15778</strong>:2008 (E)<br />
Appendix C – Sustainability: network of interested parties for ongoing support and further development<br />
Name Enterprise Country Function Email address<br />
P. Abrahams Bloor Research United Kingdom Practice leader Accessibility<br />
&Usability<br />
peter.abrahams@bloor-research.com<br />
Mr. A. Arch Online Accessibility Consulting Vision Australia Manager andrew.arch@nils.org.au<br />
Mr. H. Aspelund<br />
Directorate for Health and Social Affairs, The<br />
Delta Centre<br />
Norway Adviser ICT Haakon.Aspelund@shdir.no<br />
haa@shdir.no<br />
Mr. S. Ball JISC TechDis Service United Kingdom Senior Advisor simon@techdis.ac.uk<br />
Mrs. A. Bergman-Tahon Federation of European Publishers Belgium Director abergman@fep-fee.be<br />
Mr. H. Bjarnø Visual Impairment Knowledge Centre Denmark hbj@visinfo.dk<br />
Mrs. T. Bogner Institut Integriert Studieren Austria tanja.bogner@jku.at<br />
L. Bowick Ministere de l’Agriculture et de la Peche France Reseaux et telecommunications lisa.bowick@educagri.fr<br />
Mr. M. Brauer Sun Microsystems Germany Technical Architect Software<br />
Engineering<br />
Michael.brauer@sun.com<br />
Mrs. Lino Brundu Alitha of Milan Italy Chairman Lino.Brundu@libero.it<br />
Mr. Bruno bruno@vonniman.com<br />
Mr. D. Burger Association BrailleNet France Dominique.Burger@snv.jussieu.fr<br />
82
CWA <strong>15778</strong>:2008 (E)<br />
Name Enterprise Country Function Email address<br />
Mrs. J. Clark Jenny.Clark@rnib.org.uk<br />
Mr. M. Cooper Web Accessibility Specialist Web accessibility specialist cooper@w3.org<br />
Mr. D. Crombie DEDICON Netherlands Head International Projects dcrombie@dedicon.nl<br />
Mrs. J. Darzentas University of the Aegean Greece jennyd@aegean.gr<br />
Mr. D. Day OASIS DITA Technical Committee IBM Lead DITA Architect dond@us.ibm.com<br />
Mr. A. Egger University of Applied Sciences Austria alexander.egger@campus02.at<br />
Mr. J. Engelen Katholieke Universiteit Leuven Belgium jan.engelen@esat.kuleuven.be<br />
Mrs. B. Fanning AIIM USA Director BFANNING@AIIM.ORG<br />
Mr. M. Ford Martin Ford Consultancy Italy Consultant martin.ford@libero.it<br />
Mr. T. Fraser Monotype Imaging Ltd United Kingdom Finance Director timothy.fraser@monotypeimaging.co.uk<br />
Mr. dr. J. Friedrich IBM Germany Germany Program Manager ICT<br />
Standardization<br />
jochen@de.ibm.com<br />
Mr. J. Garner INCITS/Information Technology Industry Council USA jgarner@itic.org<br />
Cerys Giddings IBM UK Ltd UK cgiddin@uk.ibm.com<br />
Mrs. K. Grant Kate@ninetiles.com<br />
Mr. C. Gravenhorst claus.gravenhorst@ccs-gmbh.de<br />
83
CWA <strong>15778</strong>:2008 (E)<br />
Name Enterprise Country Function Email address<br />
Mr. A. Haffner Technische Universität Dresden Germany Professur Mensch-Maschine-<br />
Kommunikation<br />
alexander.haffner@inf.tu-dresden.de<br />
Mrs. Mayumi Handa RiverDocs Ireland Test Manager mayumi.handa@riverdocs.com<br />
A.K. Heath Axelrod Access For All United Kingdom andyheath@axelrod.plus.com<br />
Mr. S. Herramhof Sandor.Herramhof@jku.at<br />
Mr. R. Hodgkinson Institute of Scientific and Technical<br />
Communicators (UK)<br />
United Kingdom Consultant Richard_Hodgkinson@btinternet.com<br />
Mr. M. Horstmann TZI, University Bremen Germany mir@tzi.de<br />
Mr. A. Houser arh@groupwellesley.com<br />
Mr. Dr. G. Ioannidis IN2 and TZI - University. Bremen Germany Director gi@in-two.com and<br />
george.ioannidis@tzi.de<br />
Mr. H. Janczikowski hartmut.janczikowski@ccs-gmbh.de<br />
Mr. G. Kerscher kerscher@montana.com<br />
Mr. S. Klironomos FORTH/ ICS Greece Secretariat manager arh@groupwellesley.com<br />
Mr. M. Koettstorfer marco.koettstorfer@jku.at<br />
Ms. MIRA KOIVUSILTA STAKES, R&D Centre for Health and Welfare Finland Mira.Koivusilta@stakes.fi<br />
Mr. P. Korn Sun Microsystems United States Accessibility Architect peter.korn@sun.com<br />
84
CWA <strong>15778</strong>:2008 (E)<br />
Name Enterprise Country Function Email address<br />
Mr. N. Kovacs DIN Committee Information Technology Germany NIKOLAUS.KOVACS@DIN.DE<br />
Mr. F.J. Martinez-Calvo ONCE Spain fmc@once.es<br />
Dr. Thomas Kahlisch Deutsche Zentralbücherei für Blinde zu Leipzig Germany Director thomas.kahlisch@dzb.de<br />
K. Lindelien Standards Norway Norway KLi@standard.no<br />
Mrs. L McNamee Texthelp Systems Northern Ireland Marketing manager louise@texthelp.com<br />
Mr. S. McGrenery CTO Ireland seamus.mcgrenery@bigriverdocs.com<br />
Mr. N. McKenzie DEDICON Netherlands nmackenzie@dedicon.nl<br />
Mr. D. Mann RNIB david.mann@rnib.org.uk<br />
Mrs. M. McRae OASIS-OPEN mary.mcrae@oasis-open.org<br />
Mr. C. Menezes UNESCO France Senior programme specialist c.menezes@unesco.org<br />
Mr. F. Middelkoop DEDICON fmiddelkoop@fnb.nl<br />
Mr. Dr. K. Miesenberger University of Linz Austria klaus.miesenberger@jku.at<br />
Mr. J. O Connor NCBI (National Council For The Blind Of Ireland) Ireland Web Accessibility Consultant joshue.oconnor@ncbi.ie<br />
Mr. H. O’Neill Central Remedial Clinic Ireland Project Coordinator honeill@crc.ie<br />
Mr. R. Orme RNIB Richard.Orme@rnib.org.uk<br />
P. Permezel Cosmosbay-Vectis France Chef de project ppermezel@cosmosbay-vectis.com<br />
85
CWA <strong>15778</strong>:2008 (E)<br />
Name Enterprise Country Function Email address<br />
Ms. C. Pollitt National Library for the Blind United Kingdom carol.pollitt@nlbuk.org<br />
Mrs. F. Preteux francoise.preteux@int-evry.fr<br />
Mr. K. Richter klaus.richter@bam.de<br />
Mr. Roel van Gils roel@anysurfer.be<br />
Mr. Dr. R. Romero Fundacion SIDAR Spain rafael@rafaelromero.com<br />
Mr. R. Ruemer University of Linz Austria reinhard.ruemer@jku.at<br />
Mrs. A. Salaun European Commission Belgium anne.salaun@cec.eu.int<br />
S. Schotel DEDICON Netherlands sschotel@dedicon.nl<br />
Mr. Shadi Abou-Zahra W3C Web Accessibility Initiative France Web Accessibility Specialist for<br />
Europe<br />
shadi@w3.org<br />
Ms. S. Sollat C-LHISTOIRE Digital-Prod France Executive Producer dmz@c-lhistoire.com<br />
Mr. F. van Stek DEDICON fvstek@fnb.nl<br />
Mr. C. Stephan cstephan@wxs.nl<br />
Mr. M. Straat Adobe mstraat@adobe.com<br />
Mr. C. Strobbe K.U.Leuven - Departement of Electrical<br />
Engineering - Research Group on Document<br />
Architectures<br />
Belgium Christophe.Strobbe@esat.kuleuven.be<br />
86
CWA <strong>15778</strong>:2008 (E)<br />
Name Enterprise Country Function Email address<br />
Paivi Tahkokallio STAKES, R&D Centre for Health and Welfare Finland paivi.tahkokallio@stakes.fi,<br />
Mr. D. Taylor Lightning Source UK Ltd United Kingdom Managing Director David.Taylor@lightningsource.com<br />
Mr. Malte Timmermann Sun Microsystems GmbH Germany Technical Architect Software<br />
Engineering<br />
Malte.Timmermann@sun.com<br />
Mr. T. Tontchev toni.tontchev@dzb.de<br />
Mr. D. Tucker FORCE Foundation Netherlands dtucker@f-force.nl<br />
Mr. S. Tyler RNIB steve.tyler@rnib.org.uk<br />
Mr. L. Van den Berghe<br />
CEN/ISSS - Information Society Standardization<br />
System<br />
Belgium Workshop manager luc.vandenberghe@<strong>cen</strong>orm.be<br />
Mr. C. Walinn Danish National Library for the Blind Denmark <strong>cwa</strong>@dbb.dk<br />
Mrs. M. White mandy.white@rnib.org.uk<br />
Mr. R. Winiarsczyk Silesian University of Technology Poland head of research group Ryszard.Winiarczyk@polsl.pl<br />
Mr. T. Worthington tom.worthington@tomw.net<br />
Mr. J. Worsfold Dolphin Computer Access United Kingdom john.worsfold@dolphinuk.co.uk<br />
Mr. W. Wünschmann ww2@inf.tu-dresden.de<br />
Mr. Dr. J. Rietveld Netherlands Standardization institute Netherlands Standardization Consultant jan.rietveld@NEN.nl<br />
87
CWA <strong>15778</strong>:2008 (E)<br />
Appendix D – Abbreviations List<br />
Abbreviation<br />
Text<br />
AAATE<br />
Association for the Advancement of Assistive Technology in Europe<br />
AIIM<br />
The Enterprise Content Management Association<br />
AIP<br />
Accessible Information Processing<br />
ANSI<br />
American National Standards Institute<br />
ASCII<br />
American Standard Code for Information Interchange<br />
CDDA<br />
Compact Disk Digital Audio<br />
CD-ROM<br />
Compact Disc read-only memory<br />
CEN<br />
COMITÉ EUROPÉEN DE NORMALISATION<br />
CMS<br />
Content Management System<br />
CSS<br />
Cascading Style Sheets<br />
CWA<br />
CEN Workshop Agreement<br />
DAISY<br />
Digital Accessible Information System<br />
DATSCG<br />
Design for All and Assistive Technologies Standardization Co-ordination Group<br />
DEXA<br />
Database and Expert Systems Applications<br />
DITA<br />
Darwin Information Typing Architecture<br />
DPA<br />
Document Processing for Accessibility<br />
DRM<br />
Digital Rights Management<br />
DTB<br />
Digital Talking Book<br />
DTD<br />
Document Type Definitions<br />
DTP<br />
Desktop publishing<br />
EARL<br />
Evaluation and Report Language<br />
EC<br />
European Commission<br />
ECM<br />
Enterprise Content Management<br />
EDeAN<br />
European Design for All e-Accessibility Network<br />
EIDD<br />
European Institute for Design and Disability<br />
88
CWA <strong>15778</strong>:2008 (E)<br />
ETSI<br />
European Telecommunications standards Institute<br />
EU<br />
European Union<br />
EUAIN<br />
European accessible information network<br />
EXD<br />
Encrypted XML document<br />
GNOME<br />
GNU Object Model Environment<br />
GNU<br />
a recursive acronym for GNU's Not Unix<br />
HTML<br />
Hypertext Markup Language<br />
ICCHP<br />
International Conference on Computers Helping People with Special Needs<br />
ICS<br />
International Classification for Standards<br />
ICTSB<br />
Information Communication Technologies Standards Board<br />
IEC<br />
International Electrotechnical Commission<br />
IEEE<br />
Institute of Electrical and Electronics Engineers<br />
IMS<br />
IMS Global Learning Consortium<br />
ISO<br />
International Organization for Standardization<br />
ISSS<br />
Information Society Standardization System<br />
JAWS<br />
Job Access With Speech<br />
JTC 1 Joint Technical Committee 1<br />
KDE<br />
K Desktop Environment<br />
LaTeX<br />
LaTeX is a document preparation system for high-quality typesetting.<br />
LCNS<br />
Lecture Notes in Computer Science<br />
LIP<br />
Learner Information Package<br />
MARC21<br />
Machien-Readable Cataloging<br />
MathML<br />
Mathemtatical Markup Language<br />
METS<br />
Metadata Encoding and Transmission Standard<br />
MIME<br />
Multipurpose Internet Mail Extensions<br />
MP3 MPEG-1 Audio Layer 3<br />
MPEG<br />
Moving Picture Experts Group<br />
NIMAS<br />
National Instructional Materials Accessibility Standard<br />
89
CWA <strong>15778</strong>:2008 (E)<br />
NISO<br />
Navigation Control Centre<br />
OASIS<br />
Organization for the Advancement of Structured Information Standards<br />
OCR<br />
Optical character recognition<br />
ODF<br />
OpenDocument Format<br />
OECD<br />
Organisation for Ecomic Co-operation and Development<br />
PC<br />
Personal Computer<br />
PDF<br />
Portable Document Format<br />
RDF<br />
Resource Description Framework<br />
RNIB<br />
Royal National Institute of the Blind<br />
RTD<br />
Research and Technology Development<br />
RTF<br />
Rich Text Format<br />
SC<br />
Subcommittee<br />
SCORM<br />
Sharable Content Object Reference Model<br />
SGML<br />
Standard Generalised Markup Language<br />
SME<br />
Small and medium enterprises<br />
SMIL<br />
Synchronized Multimedia Integration Language<br />
STM<br />
Scientific Technical Medical<br />
TC<br />
Technical Committee<br />
TC/HF<br />
Technical committee Human Factors<br />
TEI<br />
Text Encoding Initiative<br />
TeX<br />
TeX is a typesetting language.<br />
US<br />
United States<br />
USB<br />
Umoversa; Serial Bus<br />
VHS<br />
Video Home System<br />
VPAT<br />
Voluntary Product Accessibility Template<br />
W3C<br />
World Wide Web Consortium<br />
WAI<br />
Web Accessibility Initiatieve<br />
WAV<br />
short for Waveform<br />
90
CWA <strong>15778</strong>:2008 (E)<br />
WCAG<br />
Web Content Accessibility Guidelines<br />
WCT<br />
WIPO Copyright Treaty<br />
WG<br />
Working Group<br />
WIPO<br />
World Intellectual Property Organization<br />
WPPT<br />
WIPO Performances and Phonograms Treaty<br />
XHTML<br />
Extensible Hypertext Markup Language<br />
XML<br />
Extensible Markup Language<br />
XSLT<br />
XSL Transformations<br />
91