15.11.2012 Views

icegov2012 proceedings

icegov2012 proceedings

icegov2012 proceedings

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

conclusions that might arise rom the data, when interpreted by<br />

external parties [9].<br />

Notwithstanding, despite having seemingly unambiguous<br />

permission to release data to the public, the City Development<br />

department is often in the dark as to where data actually resides.<br />

Even if the data location is fully known, it does not warrant<br />

release. Illustrative of this is a request for the locations of public<br />

transport data. While this data is controlled by City Development,<br />

the data is owned on a metropolitan regional level. As a result,<br />

requests for the data’s release needs to go to metropolitan level,<br />

even though the data location is known. Due to its ownership<br />

being external, it is subsequently unclear whether the data can be<br />

classified under the CC0 license mentioned above. This illustrates<br />

that defining the judicial aspects of data release does not translate<br />

to release. Practical or technical issues remain.<br />

Due to the product focussed work process present at City<br />

Development, the extreme decentralization is not conducive to an<br />

open data policy. Additionally, departments within City<br />

Development are controllers of datasets, but do not necessarily<br />

own them. This potentially creates uncertainty as to the<br />

applicability of the license, but also ambiguity as to who has the<br />

final word in releasing the data.<br />

Furthermore, while City Development has been very clear in<br />

taking care of the judicial side of open data, the data that is<br />

currently classified for re-use does not compromise all data. The<br />

real-time status of many parking garages in the city is such an<br />

example. This data, whilst known and shown in certain locations<br />

in the city, is not available to third parties.<br />

While City Development and City Works both deal for a big part<br />

with data in a digital format, the City Archive is historically much<br />

more focussed on physical artefacts. Data release by City Archive<br />

has up until now been limited to the opening up of the search<br />

Application Programming Interface (API), in order to access the<br />

object metadata. There is thus no raw, unedited data about<br />

archived objects such as high-resolution images, or historical<br />

maps available in digital form, for reasons discussed earlier.<br />

Table 1. Indicators of data release based on observations<br />

Themes City Development City Works Library City Archive<br />

Use of data Execute development policy. Maintain public<br />

objects, producing<br />

geographical products.<br />

Support core services,<br />

internally and<br />

externally<br />

Source of data Externally gathered Internally produced Internally produced<br />

through services<br />

processes and<br />

externally bought<br />

Enhance services,<br />

catalogue objects<br />

Metadata internally<br />

created, artefacts<br />

externally supplied<br />

Data storage Decentralized Central Central Central (meta data<br />

only)<br />

Suitability of data for Medium: combination of High: many sets related Low: data personally High: data not<br />

release personally identifiable to public objects, non- identifiable or subject personally identifiable<br />

information, live sensor data personal.<br />

to copyright<br />

and owned by the<br />

and object data.<br />

Archive<br />

Data released None Object data of public Book metadata Search API of archive<br />

artefacts<br />

metadata<br />

Biggest threshold Decentralized data storage No legal framework for Personally identifiable Costs associated with<br />

and internal data<br />

the supply of data, data, uncertain digitalization of<br />

management processes. commercial interests. organizational status,<br />

copyright.<br />

archive artefacts.<br />

10<br />

As mentioned before, the Library has more comprehensive digital<br />

service provision, due to the reliance on data to manage the<br />

collection and serve the public. However, the only set of<br />

information that is currently available for use contains book meta<br />

data. The use of this data currently falls into a grey zone:<br />

bibliographic information gets bought from a commercial<br />

organization, which has implications on its release as open data,<br />

while for education purposes, re-use would be allowed. While the<br />

library is also in possession of other data that is without copyright<br />

restrictions, this comprises information about book lending, or<br />

statistics about members. This data could theoretically be<br />

personally identifiable, which complicates its use as open data.<br />

Even in cases where data is anonymized it might still be possible<br />

to extract personally identifiable data, through methods discussed<br />

by [18].<br />

From these observations, we note that the judicial issues, which<br />

include privacy and copyright concerns, the way data is collected<br />

and how it is stored, play an important role in data release.<br />

Therefore, we distinguish the following indicators that may be<br />

important for data release:<br />

o Use of data, i.e., the way data is used by the department.<br />

o Source of data, i.e., how is a set of data obtained?<br />

o Data Storage, i.e., is data stored centrally, or decentralized?<br />

o Suitability of data for release, i.e., are there rules and<br />

regulations that determine whether a dataset may be released<br />

or not, such as privacy or copyright.<br />

In Table 1, we summarize how the several departments score on<br />

each of the indicators together with the results of dependent<br />

variable “data released”. Based on the research performed, we<br />

also name the biggest threshold to data release. These relate to<br />

indicators important for data release.<br />

This overview of the release of data for use within the education<br />

program provides an initial view to the barriers of data release as<br />

experienced by the various participating partners. In the following

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!