18.02.2013 Views

Timing, hosts and locations of (grouped) events of NanoImpactNet

Timing, hosts and locations of (grouped) events of NanoImpactNet

Timing, hosts and locations of (grouped) events of NanoImpactNet

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

NanoSafetyCluster - Compendium 2012<br />

2 Summary<br />

NHECD is free access, robust <strong>and</strong> sustainable web based<br />

information system including a knowledge repository on the<br />

impact <strong>of</strong> nanoparticles on health, safety <strong>and</strong> the environment. It<br />

includes a robust content management system (CMS) as its<br />

backbone, to hold unstructured data (e.g., scientific papers <strong>and</strong><br />

other relevant publications). It also includes a mechanism for<br />

automatically updating its knowledge repository, thus enabling the<br />

creation <strong>of</strong> a large <strong>and</strong> developing collection <strong>of</strong> published data on<br />

environmental <strong>and</strong> health effects following exposure to<br />

nanoparticles.<br />

NHECD is based on text mining methods <strong>and</strong> algorithms that make<br />

possible the transition from metadata (such as author names,<br />

journals, keywords) to more sophisticated metadata (such as<br />

whether the paper contains graphs) <strong>and</strong> to additional information<br />

extracted from the scientific papers themself. These methods <strong>and</strong><br />

algorithms were implemented to specifically extract pertinent<br />

information from large amount <strong>of</strong> documents. NHECD created a<br />

systematic domain model <strong>of</strong> concepts <strong>and</strong> terms (i.e., a wide set <strong>of</strong><br />

domain taxonomies) to support the categorization <strong>of</strong> published<br />

papers <strong>and</strong> the information extraction process within this project.<br />

The unique features <strong>of</strong> NHECD allow different user groups -<br />

academics, industry, public institutions <strong>and</strong> the public at large - to<br />

easily access, locate <strong>and</strong> retrieve information relevant to their<br />

needs. The creation <strong>of</strong> the NHECD knowledge repository enriches<br />

public underst<strong>and</strong>ing <strong>of</strong> the impact <strong>of</strong> nanoparticles on health <strong>and</strong><br />

the environments; it supports a safe <strong>and</strong> responsible development<br />

<strong>and</strong> use <strong>of</strong> engineered nanoparticles; <strong>and</strong> represents a useful<br />

instrument for the implementation <strong>of</strong> relevant regulatory<br />

measures <strong>and</strong> law making.<br />

3 Background<br />

The potential <strong>of</strong> Nanotechnology to bring scientific advancement<br />

<strong>and</strong> different economic benefits is strictly dependent on the<br />

success <strong>of</strong> the approaches <strong>and</strong> strategies chosen to guarantee a<br />

safe <strong>and</strong> responsible development, production <strong>and</strong> use <strong>of</strong><br />

engineered nanoparticles <strong>and</strong> nano-technology-based materials<br />

<strong>and</strong> products.<br />

Research on the health, safety <strong>and</strong> environmental impact <strong>of</strong><br />

nanoparticles is currently rising due to the enlarged interest <strong>of</strong> the<br />

general public as well policy makers. On one h<strong>and</strong>, European<br />

industry <strong>and</strong> consumers want to learn more about these issues<br />

since a growing number <strong>of</strong> commercial products containing<br />

nanoparticles are currently advertised; on the other h<strong>and</strong>,<br />

environmental groups <strong>and</strong> ethical committees are asking for the<br />

implementation <strong>of</strong> clear <strong>and</strong> defined regulatory measures<br />

concerning the products development <strong>and</strong> research experiment<br />

involving Nanotechnology <strong>and</strong> especially nanoparticles.<br />

To meet this growing dem<strong>and</strong>, different institutions such as<br />

Universities <strong>and</strong> National Task groups have started launching<br />

electronic information repository, web portals etc. to provide to<br />

the public access to all sorts <strong>of</strong> documents on related topics.<br />

However, the majority <strong>of</strong> the existing databases <strong>and</strong> content<br />

management systems are operated manually, i.e. only a limited<br />

amount <strong>of</strong> data can be processed, <strong>and</strong> the taxonomy <strong>and</strong> ontology<br />

guiding the documents' categorization procedure .<br />

Due to the steadily increase <strong>of</strong> scientific papers <strong>and</strong> other types <strong>of</strong><br />

publications within this field, there is an urgent need for a new<br />

quality <strong>of</strong> information management capable <strong>of</strong> h<strong>and</strong>ling <strong>and</strong><br />

processing large amounts <strong>of</strong> different types <strong>of</strong> pertinent<br />

documents.<br />

4 NHECD Achievements To-Date<br />

NHECD, currently at the end <strong>of</strong> month 36, has by now achieved:<br />

• NHECD frontend, the interface <strong>of</strong> NHECD to the three<br />

communities targeted for it, namely nano-tox scientists, regulators<br />

<strong>and</strong> the public. NHECD frontend is a comprehensive solution<br />

designed to allow users to search for relevant information using a<br />

state-<strong>of</strong>-the-art graphical user interface (GUI) matching diverse<br />

types <strong>of</strong> users (regular users, sophisticated users <strong>and</strong> more). The<br />

GUI allows for taxonomic search, simple / advanced search, fulltext<br />

search, intelligent search (a unique method enables<br />

researchers to search for the information extracted from the<br />

scientific papers) <strong>and</strong> any combination <strong>of</strong> the above search<br />

methods.<br />

NHECD website is reachable at http://nhecd.jrc.ec.europa.eu<br />

• A backend system based on a robust content management<br />

system <strong>and</strong> its accompanying modules such as classification, full<br />

text search <strong>and</strong> more.<br />

• A crawling system intended to navigate selected websites in<br />

order to obtain (automatically) all the relevant published material<br />

related to NHECD.<br />

• A rich set <strong>of</strong> computer based taxonomies related to the NHECD<br />

target areas.<br />

• A body <strong>of</strong> classified knowledge consisting <strong>of</strong> scientific papers<br />

related to in-vivo/in-vitro, ecotox <strong>and</strong> occupational nanotoxicology.<br />

The corpus currently contains around 10,000 papers in<br />

NHECD subject.<br />

• Online validation tools (IE <strong>and</strong> crawler) used to train the system<br />

to extract information <strong>and</strong> enhance the quality <strong>of</strong> the results, as<br />

well as to build <strong>and</strong> maintain the above body <strong>of</strong> knowledge.<br />

• Information extraction (IE) algorithms <strong>and</strong> methods especially<br />

crafted to create a layer <strong>of</strong> comments (in the context <strong>of</strong> NHECD<br />

the comments are an information layer on top <strong>of</strong> the scientific<br />

paper itself) to enhance the knowledge found on NHECD's body <strong>of</strong><br />

knowledge. IE results satisfactory <strong>and</strong> maturing, yet there are<br />

some issues, such as relation extraction, for which the literature is<br />

still being developed. NHECD keeps working towards achieving<br />

optimal algorithms.<br />

• The infrastructure for the backend, frontend <strong>and</strong> all related<br />

components is established along with administration <strong>and</strong><br />

maintenance procedures.<br />

236 Compendium <strong>of</strong> Projects in the European NanoSafety Cluster

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!