14.01.2014 Views

Born-Digital archives at Hull - Hull History Centre

Born-Digital archives at Hull - Hull History Centre

Born-Digital archives at Hull - Hull History Centre

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

<strong>Born</strong>-<strong>Digital</strong> <strong>archives</strong> <strong>at</strong> <strong>Hull</strong>:<br />

early steps & early lessons<br />

Simon Wilson, Senior Archivist


Overview<br />

Setting the scene<br />

- born-digital <strong>archives</strong><br />

- the AIMS Project & White Paper<br />

Steps taken<br />

- background research<br />

- practical steps<br />

- questions asked<br />

Lessons learnt<br />

- where are we now?<br />

Archives and Society | 6th Mar 2012 | 2


Setting the scene


<strong>Born</strong>-digital <strong>archives</strong><br />

The message and the medium are different<br />

Both are thre<strong>at</strong>ened by obsolescence<br />

Files are usually copies with cre<strong>at</strong>or<br />

keeping the originals (may still be in use)<br />

Not talking about digitis<strong>at</strong>ion where<br />

m<strong>at</strong>erial is converted into digital form<strong>at</strong><br />

Archives and Society | 6th Mar 2012 | 4


The AIMS Project<br />

An inter-Institutional Model for Stewardship funded by The<br />

Andrew W. Mellon found<strong>at</strong>ion<br />

Each partner employed a digital archivist for two years to<br />

process born-digital m<strong>at</strong>erial in their collections<br />

Started with traditional archival theory/principles; identify<br />

commonality - not looking to cre<strong>at</strong>e a single solution<br />

Archives and Society | 6th Mar 2012 | 5


AIMS Project – White Paper<br />

Good-practice, based on the partners shared experiences<br />

- written by archivists….for archivists<br />

- starts from ‘paper-based’ archival<br />

principles<br />

- not based on specific infrastructure<br />

or tools<br />

- technical and professional standards<br />

- build-upon work of other projects<br />

Archives and Society | 6th Mar 2012 | 6


AIMS Project – White Paper<br />

Framework split the workflow into four main areas:<br />

1. Collection Development<br />

2. Accessioning<br />

3. Arrangement & Description<br />

4. Discovery & Access<br />

Each section identifies key factors for success, pre-requisites,<br />

objectives, outcomes, tasks and decision points<br />

Appendices inc workflows, policies, case studies, templ<strong>at</strong>es etc<br />

http://www2.lib.virginia.edu/aims/whitepaper/<br />

Archives and Society | 6th Mar 2012 | 7


Steps taken


Background research<br />

With no previous experience<br />

- began to do some background research<br />

- articles, websites (DCC, DPC roadshows)<br />

- hardware changes, media form<strong>at</strong>s<br />

- software upd<strong>at</strong>es & backward comp<strong>at</strong>ibility<br />

- workflow, OAIS, tools, other projects etc<br />

Could easily have continued like this<br />

......for the entire project!<br />

Archives and Society | 6th Mar 2012 | 9


Collections survey<br />

Media already held amongst the paper collections<br />

– media has been accessioned (but not the contents...)<br />

Traditional survey skills:<br />

- identify media & content issues, level of c<strong>at</strong>aloguing etc<br />

- quantity & range of media form<strong>at</strong>s - inform plans/policies<br />

Screenshot of survey spreadsheet using MS Excel<br />

Archives and Society | 6th Mar 2012 | 10


Autom<strong>at</strong>ic file lists<br />

(Karen’s Directory Printer) free tool; retrieves key inform<strong>at</strong>ion<br />

about each file/folder inc d<strong>at</strong>e cre<strong>at</strong>ed, checksum etc<br />

Archives and/or depositor<br />

can cre<strong>at</strong>e a file manifest <strong>at</strong><br />

the time of transfer<br />

Use retrospectively for<br />

m<strong>at</strong>erial already deposited<br />

Screenshot of Karen’s Directory Printer<br />

Archives and Society | 6th Mar 2012 | 11


Photography of media<br />

We have also developed processes for<br />

taking photographs of the digital media<br />

– wanted a simple process, th<strong>at</strong> could be<br />

easily (and quickly) repe<strong>at</strong>ed; using a<br />

transparency sheet proved effective<br />

– don’t expect to keep the media forever<br />

– the labels may include important<br />

inform<strong>at</strong>ion for researchers<br />

Archives and Society | 6th Mar 2012 | 12


Forensic workst<strong>at</strong>ion<br />

Re-use an old PC<br />

- we were about to throw out<br />

- had floppy drive, CD drive and<br />

USB ports = options<br />

ICT provided clean disk image<br />

- also added internal zip drive<br />

Archives and Society | 6th Mar 2012 | 13


Forensic workst<strong>at</strong>ion<br />

Purchased a new PC<br />

- needed larger capacity to capture hard<br />

drives & laptops (both considered to be<br />

reasonable scenarios in the next 2 years)<br />

Write-blocker allows safe capture of files<br />

from media/hard drives; use with FTK<br />

Imager software<br />

Archives and Society | 6th Mar 2012 | 14


Dealing with depositors<br />

Existing skills, but rel<strong>at</strong>ionship<br />

is more critical than ever before<br />

New questions to ask<br />

- hardware, software, passwords,<br />

social media, e-mail accounts etc<br />

Web-form to collect inform<strong>at</strong>ion<br />

from depositor; worked better as<br />

a framework for discussions<br />

Screenshot of web –based survey developed by the AIMS Project<br />

Archives and Society | 6th Mar 2012 | 15


Arrangement<br />

How do we arrange born-digital <strong>archives</strong>?<br />

New challenges to be faced:<br />

– m<strong>at</strong>erial tends to come to us in an order<br />

do we (or should we) change this?<br />

– integr<strong>at</strong>e paper and born-digital m<strong>at</strong>erial<br />

into single finding aid<br />

– can you make an appraisal decision on<br />

a file you can’t view?<br />

Design for the Toad in the <strong>Hull</strong> (Larkin with Toads archive)<br />

Archives and Society | 6th Mar 2012 | 16


Description<br />

How do we describe born-digital <strong>archives</strong>?<br />

New challenges to be faced:<br />

– volume of m<strong>at</strong>erial<br />

– is it practical to describe each file?<br />

– should you describe a file you can’t view<br />

Is the archivist’s description as critical as it was?<br />

- if (via the repository) you can let users search the text of<br />

the archive itself - not just the archivist’s description of it<br />

Design for The LarKintoad (Larkin with Toads archive)<br />

Archives and Society | 6th Mar 2012 | 17


Arrangement and Description – missing tool?<br />

AIMS digital archivists identified gap rel<strong>at</strong>ing to A&D of borndigital<br />

<strong>archives</strong> in Fedora; functionality shaped and refined by<br />

our experiences<br />

Developers <strong>at</strong> Stanford –<br />

proof of concept Hyp<strong>at</strong>ia tool<br />

Use drag’n’drop to cre<strong>at</strong>e the<br />

intellectual arrangement into<br />

Fedora sets (objects not moved)<br />

Assign rights & permissions to<br />

a file, a series or entire collection<br />

Archives and Society | 6th Mar 2012 | 18


Lessons learnt


Doing something<br />

Familiarity with concepts & issues encountered in<br />

background research - gre<strong>at</strong>ly enhanced by practical work<br />

- items / files identified in the survey was also an easier<br />

starting point than a new digital-only accession<br />

- download free tools including DROID,<br />

Karen, FTK Imager etc<br />

- cre<strong>at</strong>e a set of files to play with<br />

- questions are good<br />

Archives and Society | 6th Mar 2012 | 20


Make friends<br />

With ICT colleagues with-in your institution<br />

- forensic workst<strong>at</strong>ion<br />

- digital repository<br />

With other <strong>archives</strong><br />

- share/exchange experiences<br />

- technical capabilities for specific media<br />

You are not alone<br />

Archives and Society | 6th Mar 2012 | 21


Depositors awareness and perceptions<br />

Some were very willing to work with us, allowed us to browse<br />

their server and identify m<strong>at</strong>erial of interest (to us)<br />

Some were keen but then got cold feet; reluctant to help us<br />

identify current records likely to be of archival interest<br />

We need to place emphasis on our role<br />

in managing inform<strong>at</strong>ion (regardless<br />

of form<strong>at</strong>)<br />

We must explain why timeframe for<br />

collecting/action is radically different<br />

Archives and Society | 6th Mar 2012 | 22


User experience<br />

Paper <strong>archives</strong> offer a sense of discovery...<br />

Notebooks - plot outlines, notes<br />

of meetings with producers,<br />

snippets of dialogue etc with<br />

different work all intermingled<br />

With born-digital m<strong>at</strong>erial the same inform<strong>at</strong>ion<br />

is likely to be dispersed across separ<strong>at</strong>e files<br />

Archives and Society | 6th Mar 2012 | 23


User expect<strong>at</strong>ions<br />

Expect<strong>at</strong>ion of access and delivery online<br />

- confirming a user’s identity?<br />

- m<strong>at</strong>erial collected much closer to cre<strong>at</strong>ion - but issues<br />

rel<strong>at</strong>ing to access to sensitive m<strong>at</strong>erial remain unchanged<br />

New tools:<br />

new opportunities<br />

for access/analysis<br />

- visualis<strong>at</strong>ion<br />

- word clouds<br />

Archives and Society | 6th Mar 2012 | 24


Where are we now..?<br />

We have 7 born-digital collections – 46,576 files (122 GB)<br />

– manifest (accession receipt list)<br />

– files are currently stored on interim network storage<br />

- confidence to ask each depositor about b-d <strong>archives</strong><br />

Institutional practice<br />

– documented workflow, tools used etc based on “playing”<br />

– policies include reference to born-digital m<strong>at</strong>erials<br />

<strong>Born</strong>-digital <strong>archives</strong> has a much higher profile<br />

– included in library’s str<strong>at</strong>egic aims and objectives<br />

Archives and Society | 6th Mar 2012 | 25


Still to tackle...?<br />

– bulk-ingest import 10,000+ files (ie a single accession)<br />

– A&D is largely dependant upon Hyp<strong>at</strong>ia (in development)<br />

– roll-out more active collecting of b-d in next few years<br />

- still looking <strong>at</strong> options for access<br />

- still form<strong>at</strong>s we haven’t tackled yet - e-mail & d<strong>at</strong>abases !<br />

Wh<strong>at</strong> is being cre<strong>at</strong>ed with-in the University?<br />

- hope to start working with departments<br />

- start with Comms & Marketing dept<br />

Archives and Society | 6th Mar 2012 | 26


Conclusion<br />

N<strong>at</strong>ure (and form<strong>at</strong>) of <strong>archives</strong> has changed dram<strong>at</strong>ically, but<br />

most of our professional skills are still relevant<br />

Some new tools, new software, new terminology and more<br />

acronyms to learn (esp OAIS)<br />

Some m<strong>at</strong>erial has already been “lost”<br />

– need to start doing something to minimise the impact<br />

Confidence to make the transition from paper to digital;<br />

hands-on learning - play with sample files/media etc<br />

Archives and Society | 6th Mar 2012 | 27


Contact details<br />

Simon Wilson<br />

Senior Archivist<br />

<strong>Hull</strong> <strong>History</strong> <strong>Centre</strong><br />

Tel 01482 317506<br />

Email s.wilson@hull.ac.uk<br />

www.hullhistorycentre.org.uk<br />

AIMS blog: http://born-digital-<strong>archives</strong>.blogspot.com<br />

White Paper: http://www2.lib.virginia.edu/aims/whitepaper/<br />

Portrait of Claude-Henri W<strong>at</strong>alet blogging, after Jean-Baptiste Greuze<br />

http://www.flickr.com/photos/notionscapital/2497196140/in/photostream/<br />

Archives and Society | 6th Mar 2012 | 28

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!