C3.5.0 Integration with BHL Portal - OpenUp!
C3.5.0 Integration with BHL Portal - OpenUp!
C3.5.0 Integration with BHL Portal - OpenUp!
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
Project Acronym: <strong>OpenUp</strong>!<br />
Grant Agreement No: 270890<br />
COMPONENT REPORT<br />
Project Title: Opening up the Natural History Heritage for Europeana<br />
<strong>C3.5.0</strong> <strong>Integration</strong> <strong>with</strong> <strong>BHL</strong> <strong>Portal</strong><br />
Revision: Version 1.1<br />
Authors (in alphabetical order):<br />
Benda Odo AIT Forschungsgesellschaft mbH<br />
Höller Astrid AIT Forschungsgesellschaft mbH<br />
Koch Gerda AIT Forschungsgesellschaft mbH<br />
Koch Walter AIT Forschungsgesellschaft mbH<br />
Malarky Gavin Natural History Museum London<br />
P Public<br />
Project co-funded by the European Commission <strong>with</strong>in the ICT Policy Support Programme<br />
Dissemination Level<br />
C Confidential, only for members of the consortium and the Commission Services x<br />
AIT, 2013 <strong>C3.5.0</strong> v1.1 p. 1
Revision History<br />
Revision Date Author Organisation Description<br />
Draft 2013-03-11 G. Malarky NHM Investigation input<br />
Draft 2013-03-22 O. Benda AIT Draft and implementation<br />
Draft 2013-04-03 A. Höller AIT Draft and concept of document<br />
Draft 2013-04-04 A. Höller AIT <strong>Integration</strong> of <strong>BHL</strong> in Pentaho<br />
Draft 2013-04-05 A. Höller AIT Examples<br />
Draft 2013-04-22 G. Koch AIT Revision and Examples<br />
Draft 2013-04-22 A. Höller AIT References, Screenshots<br />
Draft 2013-04-22 W. Koch AIT Revision<br />
Version 1.0 2013-04-22 G. Koch AIT Finalization<br />
Version 1.1 2013-04-23 G. Koch AIT Page 1 update<br />
Statement of Originality<br />
This deliverable contains original unpublished work except where clearly<br />
indicated otherwise. Acknowledgement of previously published material and of<br />
the work of others has been made through appropriate citation, quotation or<br />
both.<br />
Distribution<br />
Recipient Date Version Accepted YES/NO<br />
TMG 2013-04-22<br />
2013-04-23<br />
Project Coordinator 2013-04-22<br />
2013-04-23<br />
AIT, 2013 <strong>C3.5.0</strong> v1.1 p. 2<br />
1.0<br />
1.1<br />
1.0<br />
1.1
Table of Contents<br />
1 DESCRIPTION OF WORK ................................................................................................................... 4<br />
2 THE <strong>BHL</strong> WEB PORTAL ...................................................................................................................... 5<br />
2.1 Access to bibliographic information in the <strong>BHL</strong> portal .............................................. 6<br />
2.2 Contextualization <strong>with</strong> bibliographic information by “Bibliography by URL” .............. 10<br />
3 INTEGRATING “BIBLIOGRAPHY BY URL” INTO PENTAHO.................................................................. 10<br />
3.1 The Stylesheet ................................................................................................ 11<br />
4 EXAMPLE ....................................................................................................................................... 13<br />
5 LIST OF FIGURES ............................................................................................................................. 16<br />
6 LIST OF REFERENCES ....................................................................................................................... 16<br />
AIT, 2013 <strong>C3.5.0</strong> v1.1 p. 3
1 DESCRIPTION OF WORK<br />
The aim of this document is to describe the contextualization of <strong>OpenUp</strong>! metadata <strong>with</strong> <strong>BHL</strong> bibliographic<br />
information data. Figure 1 shows the overall workflow of ingesting records into Europeana <strong>with</strong> the step<br />
covered in this document marked red.<br />
Figure 1 Ingesting records into Europeana (overall workflow)<br />
For the purpose of this report the possibilities of integration of <strong>OpenUp</strong>! metadata <strong>with</strong> the <strong>BHL</strong>-Europe web<br />
portal were evaluated. Investigations showed that the <strong>BHL</strong>-Europe web portal offers no API for data linkage.<br />
But <strong>BHL</strong>-Europe is part of the international <strong>BHL</strong> (Biodiversity Heritage Library) consortium<br />
http://biodivlib.wikispaces.com/<strong>BHL</strong>+Consortium+Membership and therefore the API “Bibliography by URL”<br />
provided by the Biodiversity Heritage Library (http://www.biodiversitylibrary.org/) serves very well to link the<br />
<strong>OpenUp</strong>! data to biodiversity heritage literature.<br />
For a better understanding this document starts <strong>with</strong> an overview of the <strong>BHL</strong> portal and its search functions.<br />
Then the API 1 of <strong>BHL</strong> called “Bibliography by URL” is introduced. This API is integrated in the Pentaho<br />
Transformation to create a new dc:relation element (see chapter 3).<br />
1 Application Programming Interface, http://en.wikipedia.org/wiki/Application_programming_interface 22 April 2013<br />
AIT, 2013 <strong>C3.5.0</strong> v1.1 p. 4
2 THE <strong>BHL</strong> WEB PORTAL<br />
<strong>BHL</strong>, the Biodiversity Heritage Library, features a sustained human-accessible Web portal -<br />
http://www.biodiversitylibrary.org – giving access to biodiversity literature held in the storage system (see<br />
Figure 2).<br />
Figure 2 The <strong>BHL</strong> web portal<br />
The Biodiversity Heritage Library has implemented a new user interface on March 18, 2013 (see Figure 3 <strong>with</strong><br />
new features of the portal).<br />
AIT, 2013 <strong>C3.5.0</strong> v1.1 p. 5
Figure 3 The new <strong>BHL</strong>-Europe web portal 2<br />
2.1 Access to bibliographic information in the <strong>BHL</strong> portal<br />
To search the <strong>BHL</strong> collections the General Search can be used by typing a term in the simple search box (see<br />
Figure 4 <strong>with</strong> the example “Parnassius apollo”).<br />
2 https://docs.google.com/document/d/1XXFm4MsAEX-3iZq7UlZDGvxkZHvFztf0gcINEfDU27s/pub# 22 April 2013.<br />
AIT, 2013 <strong>C3.5.0</strong> v1.1 p. 6
Figure 4 General Search for “Parnassius apollo”<br />
By clicking on the magnifier symbol or by pressing enter the results will be shown (see Figure 5).<br />
Figure 5 Result of general search for “Parnassius apollo”<br />
The simple search is looking for matching results in the indexed fields of five different result types. The result<br />
overview shows the result types <strong>with</strong> the numbers of found records in brackets:<br />
In our example:<br />
Books/Journals (0)<br />
Articles/Chapters/Treatments (4)<br />
Authors (0)<br />
Subjects (0)<br />
Scientific Names (87)<br />
When clicking on the result type “Scientific Names” there are 87 Scientific Names found. Each name is listed<br />
<strong>with</strong> the number of its occurrence (see Figure 6).<br />
AIT, 2013 <strong>C3.5.0</strong> v1.1 p. 7
Figure 6 Scientific Names found for “Parnassius apollo”<br />
By clicking on one of the result names (in this example “Parnassius apollo”) the complete bibliography of this<br />
term is shown <strong>with</strong> its Type, Title, Authors, Volume, Date and Page number (see Figure 7).<br />
AIT, 2013 <strong>C3.5.0</strong> v1.1 p. 8
Figure 7 Bibliography for “Parnassius apollo”<br />
When clicking on the magnifier symbol at the right the bibliography result is shown in a new window (see Figure<br />
8).<br />
Figure 8 One of the bibliography results containing the scientific name “Parnassius apollo”<br />
AIT, 2013 <strong>C3.5.0</strong> v1.1 p. 9
2.2 Contextualization <strong>with</strong> bibliographic information by “Bibliography by URL”<br />
<strong>BHL</strong> is building APIs (Application Programming Interfaces) to allow individual users and data providers to remix<br />
and reuse <strong>BHL</strong> content 3 and makes its metadata available for public use under the CC0 1.0 Universal (CC0 1.0)<br />
Public Domain Dedication license 4 .<br />
One of these APIs is “Bibliography by URL”. To easily link into a list of all pages containing a given scientific<br />
name, use the following URL:<br />
http://www.biodiversitylibrary.org/name/Scientific_name<br />
where Scientific_name is any uninomial, binomial, or trinomial. Replace spaces <strong>with</strong> the underscore<br />
( _ ) character 5 .<br />
In the previous example the link would be<br />
http://www.biodiversitylibrary.org/name/Parnassius_apollo<br />
When typing the link in the search bar the result shown in Figure 7 Bibliography for “Parnassius apollo” appears.<br />
3 INTEGRATING “BIBLIOGRAPHY BY URL” INTO PENTAHO<br />
The transformation of the <strong>OpenUp</strong>! metadata from the ABCD format to the ESE/EDM format is processed <strong>with</strong><br />
the Pentaho Kettle PDI tool.<br />
In the Transformation “ABCD206_to_ESE34” (see Figure 9) the step “ABCD to ESE XSL” (marked red) contains<br />
the stylesheet which is modified to integrate the <strong>BHL</strong> web portal data.<br />
3 http://biodivlib.wikispaces.com/Developer+Tools+and+API 22 April 2013.<br />
4 http://creativecommons.org/publicdomain/zero/1.0/ 22 April 2013.<br />
5 http://biodivlib.wikispaces.com/Developer+Tools+and+API 22 April 2013.<br />
AIT, 2013 <strong>C3.5.0</strong> v1.1 p. 10
3.1 The Stylesheet<br />
Figure 9 Pentaho Transformation ABCD206_to_ESE34<br />
To integrate the <strong>BHL</strong> web portal to the Transformation in Pentaho, the XSL Stylesheet used in the Add Constants<br />
step “ABCD to ESE XSL” is extended (see Figure 10).<br />
AIT, 2013 <strong>C3.5.0</strong> v1.1 p. 11
The following element is added:<br />
<br />
Figure 10 The XSL Stylesheet used in the Pentaho Transformation<br />
<br />
<br />
http://www.biodiversitylibrary.org/name/<br />
<br />
<br />
<br />
<br />
<br />
<br />
So every time a “FullScientificNameString” appears in a record the link<br />
http://www.biodiversitylibrary.org/name/{FullScientificNameString} is created and inserted in the dc:relation<br />
element in order to link to the available bibliography in the Biodiversity Heritage Library.<br />
AIT, 2013 <strong>C3.5.0</strong> v1.1 p. 12
4 EXAMPLE<br />
After transforming a datasource <strong>with</strong> Pentaho the result can be viewed at the OAI-Provider platform. Figure 11<br />
shows one record of the butterfly “Parnassius apollo” including the <strong>BHL</strong>-link in the element “Relation”.<br />
Figure 11 Record on the OAI platform <strong>with</strong> <strong>BHL</strong>-link<br />
When clicking on the link the bibliography shown in Figure 12 (compare Figure 7) opens in a new window.<br />
AIT, 2013 <strong>C3.5.0</strong> v1.1 p. 13
Figure 12 Bibliographic information for “Parnassius apollo”<br />
The contextualization of <strong>OpenUp</strong>! data <strong>with</strong> <strong>BHL</strong> content in the Europeana data display is demonstrated <strong>with</strong><br />
the butterfly “Sericinus montela Gray, 1852” in Figure 13.<br />
AIT, 2013 <strong>C3.5.0</strong> v1.1 p. 14
Figure 13 Contextualization <strong>with</strong> <strong>BHL</strong> content<br />
On the top left hand side of the figure the record is shown in the Europeana portal <strong>with</strong> the <strong>BHL</strong> link in the<br />
metadata field “Relation”. When clicking on this link the bibliographic results of <strong>BHL</strong> will display (compare Figure<br />
12). Here all literature and the pages where the scientific name is quoted appear in the result list. Finally – when<br />
clicking on one of the bibliographic titles – the original book content appears.<br />
AIT, 2013 <strong>C3.5.0</strong> v1.1 p. 15
5 LIST OF FIGURES<br />
Figure 1 Ingesting records into Europeana (overall workflow) .................................................... 4<br />
Figure 2 The <strong>BHL</strong> web portal ................................................................................................. 5<br />
Figure 3 The new <strong>BHL</strong>-Europe web portal ............................................................................... 6<br />
Figure 4 General Search for “Parnassius apollo” ....................................................................... 7<br />
Figure 5 Result of general search for “Parnassius apollo” .......................................................... 7<br />
Figure 6 Scientific Names found for “Parnassius apollo” ............................................................ 8<br />
Figure 7 Bibliography for “Parnassius apollo” ........................................................................... 9<br />
Figure 8 One of the bibliography results containing the scientific name “Parnassius apollo” ........... 9<br />
Figure 9 Pentaho Transformation ABCD206_to_ESE34 ........................................................... 11<br />
Figure 10 The XSL Stylesheet used in the Pentaho Transformation........................................... 12<br />
Figure 11 Record on the OAI platform <strong>with</strong> <strong>BHL</strong>-link ............................................................... 13<br />
Figure 12 Bibliographic information for “Parnassius apollo” ..................................................... 14<br />
Figure 13 Contextualization <strong>with</strong> <strong>BHL</strong> content ........................................................................ 15<br />
6 LIST OF REFERENCES<br />
ABCD - Access to Biological Collection Data. http://wiki.tdwg.org/ABCD 22 April 2013.<br />
ABCDEFG - Access to Biological Collection Databases Extended for Geosciences.<br />
http://wiki.tdwg.org/twiki/bin/view/ABCD/DesignAbcdExtensions 22 April 2013.<br />
<strong>BHL</strong> – Biodiversity Heritage Library, Developer Tools and API,<br />
http://biodivlib.wikispaces.com/Developer+Tools+and+API 22 April 2013.<br />
C3.2.1 Domain specific vocabularies for EUROPEANA – interim, <strong>OpenUp</strong>! component<br />
Europeana Data Model Mapping Guidelines. http://pro.europeana.eu/web/guest/edm-documentation 22 April<br />
2013.<br />
Europeana Data Model Definition v5.2.3. http://pro.europeana.eu/web/guest/edm-documentation 22 April<br />
2013.<br />
Europeana Semantic Elements (ESE) documentation http://pro.europeana.eu/web/guest/technicalrequirements<br />
22 April 2013.<br />
AIT, 2013 <strong>C3.5.0</strong> v1.1 p. 16