14.01.2014 Views

From archivist to digital archivist - Hull History Centre

From archivist to digital archivist - Hull History Centre

From archivist to digital archivist - Hull History Centre

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

<strong>From</strong> <strong>archivist</strong> <strong>to</strong> <strong>digital</strong> <strong>archivist</strong><br />

Simon Wilson, Digital Archivist (AIMS Project)


Previous experience (in Jan 2010)<br />

Used ICT with archives, but no previous experience<br />

with born-<strong>digital</strong> archives<br />

Digitisation experience, Mersey Gateway<br />

NOF project – creating <strong>digital</strong> content<br />

and metadata etc<br />

University had a <strong>digital</strong> reposi<strong>to</strong>ry (Fedora)<br />

in-place, but the Archives had no policies<br />

or procedures for born-<strong>digital</strong> collections<br />

<strong>From</strong> Archivist <strong>to</strong> Digital Archivist | 2nd Sep 2011 | 2


The AIMS Project<br />

An inter-Institutional Model for Stewardship funded by The<br />

Andrew W. Mellon foundation<br />

Each partner employed a <strong>digital</strong> <strong>archivist</strong> for two years <strong>to</strong><br />

process born-<strong>digital</strong> collections held by the partners<br />

Started with traditional archival theory/principles; identify<br />

commonality - not looking <strong>to</strong> create a single solution<br />

<strong>From</strong> Archivist <strong>to</strong> Digital Archivist | 2nd Sep 2011 | 3


What is already in your s<strong>to</strong>re?<br />

Born-<strong>digital</strong> media within paper collections – media has been<br />

accessioned (but not the contents..)<br />

Traditional survey skills:<br />

- identify media & content issues, level of cataloguing etc<br />

- quantity & range of media formats - inform plans/policies<br />

Screenshot of survey spreadsheet using MS Excel<br />

<strong>From</strong> Archivist <strong>to</strong> Digital Archivist | 2nd Sep 2011 | 4


Dealing with deposi<strong>to</strong>rs<br />

Existing skills, but relationship<br />

is more critical than ever before<br />

New questions <strong>to</strong> ask<br />

- hardware, software, passwords,<br />

social media, e-mail accounts etc<br />

Web-form <strong>to</strong> collect information<br />

from deposi<strong>to</strong>r; worked better as<br />

a framework for discussions<br />

Screenshot of web –based survey developed by the AIMS Project <strong>From</strong> Archivist <strong>to</strong> Digital Archivist | 2nd Sep 2011 | 5


Deposi<strong>to</strong>rs awareness and perceptions<br />

Some were very willing <strong>to</strong> work with us, allowed us <strong>to</strong> browse<br />

their server and identify material of interest (<strong>to</strong> us)<br />

Some were keen but then got cold feet; reluctant <strong>to</strong> help us<br />

identify current records likely <strong>to</strong> be<br />

of archival interest<br />

Emphasis on our role in managing<br />

information (regardless of format)<br />

Must explain why timeframe for<br />

action is so radically different<br />

<strong>From</strong> Archivist <strong>to</strong> Digital Archivist | 2nd Sep 2011 | 6


Accessioning<br />

Manifest of the material transferred<br />

(similar <strong>to</strong> initial box list on deposit)<br />

Use DROID <strong>to</strong> identify file types<br />

– but it doesn’t recognise everything!<br />

Transfer files from old/current media<br />

<strong>to</strong> network s<strong>to</strong>rage (easier <strong>to</strong> refresh)<br />

Update existing policies/procedures (not distinct policies)<br />

- integrate specific aspects relating <strong>to</strong> born-<strong>digital</strong> archives<br />

<strong>From</strong> Archivist <strong>to</strong> Digital Archivist | 2nd Sep 2011 | 7


New forensic <strong>to</strong>ols for accessioning<br />

Forensic workstation - re-purpose an old PC<br />

with floppy disk, CD drive and USB ports<br />

Engaged colleagues in ICT in our work –<br />

<strong>to</strong>ok the old PC and added internal zip drive<br />

New PC for hard drives & lap<strong>to</strong>ps – larger<br />

volume of files <strong>to</strong> capture/transfer<br />

Write-blocker allows safe capture of files<br />

from media/hard drives; use with FTK<br />

Imager software<br />

Tableau write-blocker<br />

<strong>From</strong> Archivist <strong>to</strong> Digital Archivist | 2nd Sep 2011 | 8


Arrangement and Description<br />

How do we arrange and describe born-<strong>digital</strong> archives?<br />

New challenges <strong>to</strong> be faced:<br />

– volume of material; is it practical <strong>to</strong><br />

describe each file?<br />

– material comes in an order; do we need<br />

<strong>to</strong> change this?<br />

– can you make an appraisal decision on<br />

a file you can’t view?<br />

– integrate material in<strong>to</strong> single finding aid<br />

Toad in the <strong>Hull</strong>, design for a <strong>to</strong>ad that was never built (Larkin with Toads archive)<br />

<strong>From</strong> Archivist <strong>to</strong> Digital Archivist | 2nd Sep 2011 | 9


Arrangement and Description<br />

AIMS <strong>digital</strong> <strong>archivist</strong>s identified gap relating <strong>to</strong> A&D of born<strong>digital</strong><br />

archives in Fedora; functionality shaped and refined by<br />

our experiences<br />

Developers at Stanford –<br />

proof of concept Hypatia <strong>to</strong>ol<br />

Use drag’n’drop <strong>to</strong> create the<br />

intellectual arrangement in<strong>to</strong><br />

Fedora sets (objects not moved)<br />

Assign rights & permissions <strong>to</strong><br />

a file, series or entire collection<br />

<strong>From</strong> Archivist <strong>to</strong> Digital Archivist | 2nd Sep 2011 | 10


Axiell<br />

Working with colleagues at LSE,<br />

Parliamentary Archives, Wellcome<br />

Library etc<br />

How CALM can exchange data<br />

(via an API) with a <strong>digital</strong><br />

reposi<strong>to</strong>ry – first version of this<br />

was included on 9.1 CD<br />

<strong>From</strong> Archivist <strong>to</strong> Digital Archivist | 2nd Sep 2011 | 11


User experience<br />

Paper archives offer a sense of discovery<br />

Notebooks - plot outlines, notes<br />

of meetings with producers,<br />

snippets of dialogue etc with<br />

different work all intermingled<br />

With born-<strong>digital</strong> material the same information is<br />

likely <strong>to</strong> be dispersed across separate files<br />

<strong>From</strong> Archivist <strong>to</strong> Digital Archivist | 2nd Sep 2011 | 12


User expectations<br />

Expectation of access and delivery online<br />

- confirming a user’s identity?<br />

- material collected much closer <strong>to</strong> creation - but issues<br />

relating <strong>to</strong> access <strong>to</strong> sensitive material remain unchanged<br />

New <strong>to</strong>ols:<br />

new opportunities<br />

for access/analysis<br />

- visualisation<br />

- word clouds<br />

<strong>From</strong> Archivist <strong>to</strong> Digital Archivist | 2nd Sep 2011 | 13


archives workflow...<br />

Quarantine<br />

Check for<br />

mould &<br />

infestation etc<br />

Archives<br />

Work room<br />

Initial review<br />

/ appraisal<br />

Donor survey<br />

talk <strong>to</strong> deposi<strong>to</strong>r<br />

Re-box the material in<strong>to</strong><br />

archive folders / boxes etc<br />

REPOSITORY<br />

Produce surrogate<br />

copies for user<br />

access (m/film)<br />

Cataloguing Room<br />

first serious look at material,<br />

appraisal, arrangement &<br />

description; identify any<br />

legal/access restrictions etc<br />

CALM<br />

catalogue<br />

original item produced<br />

in search room<br />

WEB PLATFORM – catalogue available online


AIMS, CALM & Fedora workflow...<br />

Digital Assets<br />

<br />

Forensic<br />

workstation<br />

Virus check,<br />

manifest etc<br />

Network<br />

s<strong>to</strong>rage<br />

Initial review<br />

/ appraisal<br />

Donor survey<br />

secure web-form<br />

on a SQL Lite database<br />

KEY<br />

Digital Asset<br />

Metadata<br />

Rubymatica<br />

virus check, checksum, create manifest<br />

(with copy of original manifest and<br />

Donor Survey for ref) creates SIP<br />

web<br />

services<br />

FEDORA<br />

(Technical metadata)<br />

(Access & Security)<br />

transformation layer –<br />

migrate file on <strong>to</strong> access<br />

format etc<br />

Hypatia (Arrangement<br />

& Description Tool)<br />

inc browse/view/delete files;<br />

set intellectual arrangement<br />

and align <strong>digital</strong> objects with<br />

this; set user permissions;<br />

create descriptive record etc<br />

web<br />

services<br />

EAD for paper/hybrid collection<br />

CALM<br />

(Descriptive metadata)<br />

catalogue<br />

Digital Objects<br />

access copy<br />

Solr index<br />

EAD Finding aid inc<br />

links <strong>to</strong> <strong>digital</strong> objects<br />

Solr index<br />

WEB PLATFORM (merged Solr indexes & Blacklight)


Scale of the task<br />

• Made considerable progress - document process, workflows<br />

and written Idiots Guides<br />

• We don’t have a complete solution in-place yet, already have<br />

40,000 born-<strong>digital</strong> files (33.5 GB) from just 4 collections<br />

• Expect <strong>to</strong> have 1m+ born-<strong>digital</strong> files<br />

in 5 years & backlog measured in TB<br />

• E-mail/databases <strong>to</strong> contend with<br />

<strong>From</strong> Archivist <strong>to</strong> Digital Archivist | 2nd Sep 2011 | 16


Conclusion<br />

• Nature (and format) of archives has changed dramatically,<br />

but most of our professional skills are still relevant<br />

• Some new <strong>to</strong>ols, new software, new terminology and more<br />

acronyms <strong>to</strong> learn (esp OAIS)<br />

• Can’t “keep everything” and hope Google create an<br />

algorithm <strong>to</strong> enable access<br />

• Confidence <strong>to</strong> make the transition from paper <strong>to</strong> <strong>digital</strong>;<br />

hands-on learning - play with sample files/media etc<br />

<strong>From</strong> Archivist <strong>to</strong> Digital Archivist | 2nd Sep 2011 | 17


Contact details<br />

Simon Wilson<br />

Digital Archivist (AIMS Project)<br />

<strong>Hull</strong> His<strong>to</strong>ry <strong>Centre</strong><br />

Tel 01482 317506<br />

Email s.wilson@hull.ac.uk<br />

www.hullhis<strong>to</strong>rycentre.org.uk<br />

AIMS Project blog – http://born-<strong>digital</strong>-archives.blogspot.com<br />

Portrait of Claude-Henri Watalet blogging, after Jean-Baptiste Greuze<br />

http://www.flickr.com/pho<strong>to</strong>s/notionscapital/2497196140/in/pho<strong>to</strong>stream/<br />

<strong>From</strong> Archivist <strong>to</strong> Digital Archivist | 2nd Sep 2011 | 18

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!