07.08.2013 Views

C3.5.0 Integration with BHL Portal - OpenUp!

C3.5.0 Integration with BHL Portal - OpenUp!

C3.5.0 Integration with BHL Portal - OpenUp!

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Project Acronym: <strong>OpenUp</strong>!<br />

Grant Agreement No: 270890<br />

COMPONENT REPORT<br />

Project Title: Opening up the Natural History Heritage for Europeana<br />

<strong>C3.5.0</strong> <strong>Integration</strong> <strong>with</strong> <strong>BHL</strong> <strong>Portal</strong><br />

Revision: Version 1.1<br />

Authors (in alphabetical order):<br />

Benda Odo AIT Forschungsgesellschaft mbH<br />

Höller Astrid AIT Forschungsgesellschaft mbH<br />

Koch Gerda AIT Forschungsgesellschaft mbH<br />

Koch Walter AIT Forschungsgesellschaft mbH<br />

Malarky Gavin Natural History Museum London<br />

P Public<br />

Project co-funded by the European Commission <strong>with</strong>in the ICT Policy Support Programme<br />

Dissemination Level<br />

C Confidential, only for members of the consortium and the Commission Services x<br />

AIT, 2013 <strong>C3.5.0</strong> v1.1 p. 1


Revision History<br />

Revision Date Author Organisation Description<br />

Draft 2013-03-11 G. Malarky NHM Investigation input<br />

Draft 2013-03-22 O. Benda AIT Draft and implementation<br />

Draft 2013-04-03 A. Höller AIT Draft and concept of document<br />

Draft 2013-04-04 A. Höller AIT <strong>Integration</strong> of <strong>BHL</strong> in Pentaho<br />

Draft 2013-04-05 A. Höller AIT Examples<br />

Draft 2013-04-22 G. Koch AIT Revision and Examples<br />

Draft 2013-04-22 A. Höller AIT References, Screenshots<br />

Draft 2013-04-22 W. Koch AIT Revision<br />

Version 1.0 2013-04-22 G. Koch AIT Finalization<br />

Version 1.1 2013-04-23 G. Koch AIT Page 1 update<br />

Statement of Originality<br />

This deliverable contains original unpublished work except where clearly<br />

indicated otherwise. Acknowledgement of previously published material and of<br />

the work of others has been made through appropriate citation, quotation or<br />

both.<br />

Distribution<br />

Recipient Date Version Accepted YES/NO<br />

TMG 2013-04-22<br />

2013-04-23<br />

Project Coordinator 2013-04-22<br />

2013-04-23<br />

AIT, 2013 <strong>C3.5.0</strong> v1.1 p. 2<br />

1.0<br />

1.1<br />

1.0<br />

1.1


Table of Contents<br />

1 DESCRIPTION OF WORK ................................................................................................................... 4<br />

2 THE <strong>BHL</strong> WEB PORTAL ...................................................................................................................... 5<br />

2.1 Access to bibliographic information in the <strong>BHL</strong> portal .............................................. 6<br />

2.2 Contextualization <strong>with</strong> bibliographic information by “Bibliography by URL” .............. 10<br />

3 INTEGRATING “BIBLIOGRAPHY BY URL” INTO PENTAHO.................................................................. 10<br />

3.1 The Stylesheet ................................................................................................ 11<br />

4 EXAMPLE ....................................................................................................................................... 13<br />

5 LIST OF FIGURES ............................................................................................................................. 16<br />

6 LIST OF REFERENCES ....................................................................................................................... 16<br />

AIT, 2013 <strong>C3.5.0</strong> v1.1 p. 3


1 DESCRIPTION OF WORK<br />

The aim of this document is to describe the contextualization of <strong>OpenUp</strong>! metadata <strong>with</strong> <strong>BHL</strong> bibliographic<br />

information data. Figure 1 shows the overall workflow of ingesting records into Europeana <strong>with</strong> the step<br />

covered in this document marked red.<br />

Figure 1 Ingesting records into Europeana (overall workflow)<br />

For the purpose of this report the possibilities of integration of <strong>OpenUp</strong>! metadata <strong>with</strong> the <strong>BHL</strong>-Europe web<br />

portal were evaluated. Investigations showed that the <strong>BHL</strong>-Europe web portal offers no API for data linkage.<br />

But <strong>BHL</strong>-Europe is part of the international <strong>BHL</strong> (Biodiversity Heritage Library) consortium<br />

http://biodivlib.wikispaces.com/<strong>BHL</strong>+Consortium+Membership and therefore the API “Bibliography by URL”<br />

provided by the Biodiversity Heritage Library (http://www.biodiversitylibrary.org/) serves very well to link the<br />

<strong>OpenUp</strong>! data to biodiversity heritage literature.<br />

For a better understanding this document starts <strong>with</strong> an overview of the <strong>BHL</strong> portal and its search functions.<br />

Then the API 1 of <strong>BHL</strong> called “Bibliography by URL” is introduced. This API is integrated in the Pentaho<br />

Transformation to create a new dc:relation element (see chapter 3).<br />

1 Application Programming Interface, http://en.wikipedia.org/wiki/Application_programming_interface 22 April 2013<br />

AIT, 2013 <strong>C3.5.0</strong> v1.1 p. 4


2 THE <strong>BHL</strong> WEB PORTAL<br />

<strong>BHL</strong>, the Biodiversity Heritage Library, features a sustained human-accessible Web portal -<br />

http://www.biodiversitylibrary.org – giving access to biodiversity literature held in the storage system (see<br />

Figure 2).<br />

Figure 2 The <strong>BHL</strong> web portal<br />

The Biodiversity Heritage Library has implemented a new user interface on March 18, 2013 (see Figure 3 <strong>with</strong><br />

new features of the portal).<br />

AIT, 2013 <strong>C3.5.0</strong> v1.1 p. 5


Figure 3 The new <strong>BHL</strong>-Europe web portal 2<br />

2.1 Access to bibliographic information in the <strong>BHL</strong> portal<br />

To search the <strong>BHL</strong> collections the General Search can be used by typing a term in the simple search box (see<br />

Figure 4 <strong>with</strong> the example “Parnassius apollo”).<br />

2 https://docs.google.com/document/d/1XXFm4MsAEX-3iZq7UlZDGvxkZHvFztf0gcINEfDU27s/pub# 22 April 2013.<br />

AIT, 2013 <strong>C3.5.0</strong> v1.1 p. 6


Figure 4 General Search for “Parnassius apollo”<br />

By clicking on the magnifier symbol or by pressing enter the results will be shown (see Figure 5).<br />

Figure 5 Result of general search for “Parnassius apollo”<br />

The simple search is looking for matching results in the indexed fields of five different result types. The result<br />

overview shows the result types <strong>with</strong> the numbers of found records in brackets:<br />

In our example:<br />

Books/Journals (0)<br />

Articles/Chapters/Treatments (4)<br />

Authors (0)<br />

Subjects (0)<br />

Scientific Names (87)<br />

When clicking on the result type “Scientific Names” there are 87 Scientific Names found. Each name is listed<br />

<strong>with</strong> the number of its occurrence (see Figure 6).<br />

AIT, 2013 <strong>C3.5.0</strong> v1.1 p. 7


Figure 6 Scientific Names found for “Parnassius apollo”<br />

By clicking on one of the result names (in this example “Parnassius apollo”) the complete bibliography of this<br />

term is shown <strong>with</strong> its Type, Title, Authors, Volume, Date and Page number (see Figure 7).<br />

AIT, 2013 <strong>C3.5.0</strong> v1.1 p. 8


Figure 7 Bibliography for “Parnassius apollo”<br />

When clicking on the magnifier symbol at the right the bibliography result is shown in a new window (see Figure<br />

8).<br />

Figure 8 One of the bibliography results containing the scientific name “Parnassius apollo”<br />

AIT, 2013 <strong>C3.5.0</strong> v1.1 p. 9


2.2 Contextualization <strong>with</strong> bibliographic information by “Bibliography by URL”<br />

<strong>BHL</strong> is building APIs (Application Programming Interfaces) to allow individual users and data providers to remix<br />

and reuse <strong>BHL</strong> content 3 and makes its metadata available for public use under the CC0 1.0 Universal (CC0 1.0)<br />

Public Domain Dedication license 4 .<br />

One of these APIs is “Bibliography by URL”. To easily link into a list of all pages containing a given scientific<br />

name, use the following URL:<br />

http://www.biodiversitylibrary.org/name/Scientific_name<br />

where Scientific_name is any uninomial, binomial, or trinomial. Replace spaces <strong>with</strong> the underscore<br />

( _ ) character 5 .<br />

In the previous example the link would be<br />

http://www.biodiversitylibrary.org/name/Parnassius_apollo<br />

When typing the link in the search bar the result shown in Figure 7 Bibliography for “Parnassius apollo” appears.<br />

3 INTEGRATING “BIBLIOGRAPHY BY URL” INTO PENTAHO<br />

The transformation of the <strong>OpenUp</strong>! metadata from the ABCD format to the ESE/EDM format is processed <strong>with</strong><br />

the Pentaho Kettle PDI tool.<br />

In the Transformation “ABCD206_to_ESE34” (see Figure 9) the step “ABCD to ESE XSL” (marked red) contains<br />

the stylesheet which is modified to integrate the <strong>BHL</strong> web portal data.<br />

3 http://biodivlib.wikispaces.com/Developer+Tools+and+API 22 April 2013.<br />

4 http://creativecommons.org/publicdomain/zero/1.0/ 22 April 2013.<br />

5 http://biodivlib.wikispaces.com/Developer+Tools+and+API 22 April 2013.<br />

AIT, 2013 <strong>C3.5.0</strong> v1.1 p. 10


3.1 The Stylesheet<br />

Figure 9 Pentaho Transformation ABCD206_to_ESE34<br />

To integrate the <strong>BHL</strong> web portal to the Transformation in Pentaho, the XSL Stylesheet used in the Add Constants<br />

step “ABCD to ESE XSL” is extended (see Figure 10).<br />

AIT, 2013 <strong>C3.5.0</strong> v1.1 p. 11


The following element is added:<br />

<br />

Figure 10 The XSL Stylesheet used in the Pentaho Transformation<br />

<br />

<br />

http://www.biodiversitylibrary.org/name/<br />

<br />

<br />

<br />

<br />

<br />

<br />

So every time a “FullScientificNameString” appears in a record the link<br />

http://www.biodiversitylibrary.org/name/{FullScientificNameString} is created and inserted in the dc:relation<br />

element in order to link to the available bibliography in the Biodiversity Heritage Library.<br />

AIT, 2013 <strong>C3.5.0</strong> v1.1 p. 12


4 EXAMPLE<br />

After transforming a datasource <strong>with</strong> Pentaho the result can be viewed at the OAI-Provider platform. Figure 11<br />

shows one record of the butterfly “Parnassius apollo” including the <strong>BHL</strong>-link in the element “Relation”.<br />

Figure 11 Record on the OAI platform <strong>with</strong> <strong>BHL</strong>-link<br />

When clicking on the link the bibliography shown in Figure 12 (compare Figure 7) opens in a new window.<br />

AIT, 2013 <strong>C3.5.0</strong> v1.1 p. 13


Figure 12 Bibliographic information for “Parnassius apollo”<br />

The contextualization of <strong>OpenUp</strong>! data <strong>with</strong> <strong>BHL</strong> content in the Europeana data display is demonstrated <strong>with</strong><br />

the butterfly “Sericinus montela Gray, 1852” in Figure 13.<br />

AIT, 2013 <strong>C3.5.0</strong> v1.1 p. 14


Figure 13 Contextualization <strong>with</strong> <strong>BHL</strong> content<br />

On the top left hand side of the figure the record is shown in the Europeana portal <strong>with</strong> the <strong>BHL</strong> link in the<br />

metadata field “Relation”. When clicking on this link the bibliographic results of <strong>BHL</strong> will display (compare Figure<br />

12). Here all literature and the pages where the scientific name is quoted appear in the result list. Finally – when<br />

clicking on one of the bibliographic titles – the original book content appears.<br />

AIT, 2013 <strong>C3.5.0</strong> v1.1 p. 15


5 LIST OF FIGURES<br />

Figure 1 Ingesting records into Europeana (overall workflow) .................................................... 4<br />

Figure 2 The <strong>BHL</strong> web portal ................................................................................................. 5<br />

Figure 3 The new <strong>BHL</strong>-Europe web portal ............................................................................... 6<br />

Figure 4 General Search for “Parnassius apollo” ....................................................................... 7<br />

Figure 5 Result of general search for “Parnassius apollo” .......................................................... 7<br />

Figure 6 Scientific Names found for “Parnassius apollo” ............................................................ 8<br />

Figure 7 Bibliography for “Parnassius apollo” ........................................................................... 9<br />

Figure 8 One of the bibliography results containing the scientific name “Parnassius apollo” ........... 9<br />

Figure 9 Pentaho Transformation ABCD206_to_ESE34 ........................................................... 11<br />

Figure 10 The XSL Stylesheet used in the Pentaho Transformation........................................... 12<br />

Figure 11 Record on the OAI platform <strong>with</strong> <strong>BHL</strong>-link ............................................................... 13<br />

Figure 12 Bibliographic information for “Parnassius apollo” ..................................................... 14<br />

Figure 13 Contextualization <strong>with</strong> <strong>BHL</strong> content ........................................................................ 15<br />

6 LIST OF REFERENCES<br />

ABCD - Access to Biological Collection Data. http://wiki.tdwg.org/ABCD 22 April 2013.<br />

ABCDEFG - Access to Biological Collection Databases Extended for Geosciences.<br />

http://wiki.tdwg.org/twiki/bin/view/ABCD/DesignAbcdExtensions 22 April 2013.<br />

<strong>BHL</strong> – Biodiversity Heritage Library, Developer Tools and API,<br />

http://biodivlib.wikispaces.com/Developer+Tools+and+API 22 April 2013.<br />

C3.2.1 Domain specific vocabularies for EUROPEANA – interim, <strong>OpenUp</strong>! component<br />

Europeana Data Model Mapping Guidelines. http://pro.europeana.eu/web/guest/edm-documentation 22 April<br />

2013.<br />

Europeana Data Model Definition v5.2.3. http://pro.europeana.eu/web/guest/edm-documentation 22 April<br />

2013.<br />

Europeana Semantic Elements (ESE) documentation http://pro.europeana.eu/web/guest/technicalrequirements<br />

22 April 2013.<br />

AIT, 2013 <strong>C3.5.0</strong> v1.1 p. 16

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!