Born-Digital archives at Hull - Hull History Centre
Born-Digital archives at Hull - Hull History Centre
Born-Digital archives at Hull - Hull History Centre
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
<strong>Born</strong>-<strong>Digital</strong> <strong>archives</strong> <strong>at</strong> <strong>Hull</strong>:<br />
early steps & early lessons<br />
Simon Wilson, Senior Archivist
Overview<br />
Setting the scene<br />
- born-digital <strong>archives</strong><br />
- the AIMS Project & White Paper<br />
Steps taken<br />
- background research<br />
- practical steps<br />
- questions asked<br />
Lessons learnt<br />
- where are we now?<br />
Archives and Society | 6th Mar 2012 | 2
Setting the scene
<strong>Born</strong>-digital <strong>archives</strong><br />
The message and the medium are different<br />
Both are thre<strong>at</strong>ened by obsolescence<br />
Files are usually copies with cre<strong>at</strong>or<br />
keeping the originals (may still be in use)<br />
Not talking about digitis<strong>at</strong>ion where<br />
m<strong>at</strong>erial is converted into digital form<strong>at</strong><br />
Archives and Society | 6th Mar 2012 | 4
The AIMS Project<br />
An inter-Institutional Model for Stewardship funded by The<br />
Andrew W. Mellon found<strong>at</strong>ion<br />
Each partner employed a digital archivist for two years to<br />
process born-digital m<strong>at</strong>erial in their collections<br />
Started with traditional archival theory/principles; identify<br />
commonality - not looking to cre<strong>at</strong>e a single solution<br />
Archives and Society | 6th Mar 2012 | 5
AIMS Project – White Paper<br />
Good-practice, based on the partners shared experiences<br />
- written by archivists….for archivists<br />
- starts from ‘paper-based’ archival<br />
principles<br />
- not based on specific infrastructure<br />
or tools<br />
- technical and professional standards<br />
- build-upon work of other projects<br />
Archives and Society | 6th Mar 2012 | 6
AIMS Project – White Paper<br />
Framework split the workflow into four main areas:<br />
1. Collection Development<br />
2. Accessioning<br />
3. Arrangement & Description<br />
4. Discovery & Access<br />
Each section identifies key factors for success, pre-requisites,<br />
objectives, outcomes, tasks and decision points<br />
Appendices inc workflows, policies, case studies, templ<strong>at</strong>es etc<br />
http://www2.lib.virginia.edu/aims/whitepaper/<br />
Archives and Society | 6th Mar 2012 | 7
Steps taken
Background research<br />
With no previous experience<br />
- began to do some background research<br />
- articles, websites (DCC, DPC roadshows)<br />
- hardware changes, media form<strong>at</strong>s<br />
- software upd<strong>at</strong>es & backward comp<strong>at</strong>ibility<br />
- workflow, OAIS, tools, other projects etc<br />
Could easily have continued like this<br />
......for the entire project!<br />
Archives and Society | 6th Mar 2012 | 9
Collections survey<br />
Media already held amongst the paper collections<br />
– media has been accessioned (but not the contents...)<br />
Traditional survey skills:<br />
- identify media & content issues, level of c<strong>at</strong>aloguing etc<br />
- quantity & range of media form<strong>at</strong>s - inform plans/policies<br />
Screenshot of survey spreadsheet using MS Excel<br />
Archives and Society | 6th Mar 2012 | 10
Autom<strong>at</strong>ic file lists<br />
(Karen’s Directory Printer) free tool; retrieves key inform<strong>at</strong>ion<br />
about each file/folder inc d<strong>at</strong>e cre<strong>at</strong>ed, checksum etc<br />
Archives and/or depositor<br />
can cre<strong>at</strong>e a file manifest <strong>at</strong><br />
the time of transfer<br />
Use retrospectively for<br />
m<strong>at</strong>erial already deposited<br />
Screenshot of Karen’s Directory Printer<br />
Archives and Society | 6th Mar 2012 | 11
Photography of media<br />
We have also developed processes for<br />
taking photographs of the digital media<br />
– wanted a simple process, th<strong>at</strong> could be<br />
easily (and quickly) repe<strong>at</strong>ed; using a<br />
transparency sheet proved effective<br />
– don’t expect to keep the media forever<br />
– the labels may include important<br />
inform<strong>at</strong>ion for researchers<br />
Archives and Society | 6th Mar 2012 | 12
Forensic workst<strong>at</strong>ion<br />
Re-use an old PC<br />
- we were about to throw out<br />
- had floppy drive, CD drive and<br />
USB ports = options<br />
ICT provided clean disk image<br />
- also added internal zip drive<br />
Archives and Society | 6th Mar 2012 | 13
Forensic workst<strong>at</strong>ion<br />
Purchased a new PC<br />
- needed larger capacity to capture hard<br />
drives & laptops (both considered to be<br />
reasonable scenarios in the next 2 years)<br />
Write-blocker allows safe capture of files<br />
from media/hard drives; use with FTK<br />
Imager software<br />
Archives and Society | 6th Mar 2012 | 14
Dealing with depositors<br />
Existing skills, but rel<strong>at</strong>ionship<br />
is more critical than ever before<br />
New questions to ask<br />
- hardware, software, passwords,<br />
social media, e-mail accounts etc<br />
Web-form to collect inform<strong>at</strong>ion<br />
from depositor; worked better as<br />
a framework for discussions<br />
Screenshot of web –based survey developed by the AIMS Project<br />
Archives and Society | 6th Mar 2012 | 15
Arrangement<br />
How do we arrange born-digital <strong>archives</strong>?<br />
New challenges to be faced:<br />
– m<strong>at</strong>erial tends to come to us in an order<br />
do we (or should we) change this?<br />
– integr<strong>at</strong>e paper and born-digital m<strong>at</strong>erial<br />
into single finding aid<br />
– can you make an appraisal decision on<br />
a file you can’t view?<br />
Design for the Toad in the <strong>Hull</strong> (Larkin with Toads archive)<br />
Archives and Society | 6th Mar 2012 | 16
Description<br />
How do we describe born-digital <strong>archives</strong>?<br />
New challenges to be faced:<br />
– volume of m<strong>at</strong>erial<br />
– is it practical to describe each file?<br />
– should you describe a file you can’t view<br />
Is the archivist’s description as critical as it was?<br />
- if (via the repository) you can let users search the text of<br />
the archive itself - not just the archivist’s description of it<br />
Design for The LarKintoad (Larkin with Toads archive)<br />
Archives and Society | 6th Mar 2012 | 17
Arrangement and Description – missing tool?<br />
AIMS digital archivists identified gap rel<strong>at</strong>ing to A&D of borndigital<br />
<strong>archives</strong> in Fedora; functionality shaped and refined by<br />
our experiences<br />
Developers <strong>at</strong> Stanford –<br />
proof of concept Hyp<strong>at</strong>ia tool<br />
Use drag’n’drop to cre<strong>at</strong>e the<br />
intellectual arrangement into<br />
Fedora sets (objects not moved)<br />
Assign rights & permissions to<br />
a file, a series or entire collection<br />
Archives and Society | 6th Mar 2012 | 18
Lessons learnt
Doing something<br />
Familiarity with concepts & issues encountered in<br />
background research - gre<strong>at</strong>ly enhanced by practical work<br />
- items / files identified in the survey was also an easier<br />
starting point than a new digital-only accession<br />
- download free tools including DROID,<br />
Karen, FTK Imager etc<br />
- cre<strong>at</strong>e a set of files to play with<br />
- questions are good<br />
Archives and Society | 6th Mar 2012 | 20
Make friends<br />
With ICT colleagues with-in your institution<br />
- forensic workst<strong>at</strong>ion<br />
- digital repository<br />
With other <strong>archives</strong><br />
- share/exchange experiences<br />
- technical capabilities for specific media<br />
You are not alone<br />
Archives and Society | 6th Mar 2012 | 21
Depositors awareness and perceptions<br />
Some were very willing to work with us, allowed us to browse<br />
their server and identify m<strong>at</strong>erial of interest (to us)<br />
Some were keen but then got cold feet; reluctant to help us<br />
identify current records likely to be of archival interest<br />
We need to place emphasis on our role<br />
in managing inform<strong>at</strong>ion (regardless<br />
of form<strong>at</strong>)<br />
We must explain why timeframe for<br />
collecting/action is radically different<br />
Archives and Society | 6th Mar 2012 | 22
User experience<br />
Paper <strong>archives</strong> offer a sense of discovery...<br />
Notebooks - plot outlines, notes<br />
of meetings with producers,<br />
snippets of dialogue etc with<br />
different work all intermingled<br />
With born-digital m<strong>at</strong>erial the same inform<strong>at</strong>ion<br />
is likely to be dispersed across separ<strong>at</strong>e files<br />
Archives and Society | 6th Mar 2012 | 23
User expect<strong>at</strong>ions<br />
Expect<strong>at</strong>ion of access and delivery online<br />
- confirming a user’s identity?<br />
- m<strong>at</strong>erial collected much closer to cre<strong>at</strong>ion - but issues<br />
rel<strong>at</strong>ing to access to sensitive m<strong>at</strong>erial remain unchanged<br />
New tools:<br />
new opportunities<br />
for access/analysis<br />
- visualis<strong>at</strong>ion<br />
- word clouds<br />
Archives and Society | 6th Mar 2012 | 24
Where are we now..?<br />
We have 7 born-digital collections – 46,576 files (122 GB)<br />
– manifest (accession receipt list)<br />
– files are currently stored on interim network storage<br />
- confidence to ask each depositor about b-d <strong>archives</strong><br />
Institutional practice<br />
– documented workflow, tools used etc based on “playing”<br />
– policies include reference to born-digital m<strong>at</strong>erials<br />
<strong>Born</strong>-digital <strong>archives</strong> has a much higher profile<br />
– included in library’s str<strong>at</strong>egic aims and objectives<br />
Archives and Society | 6th Mar 2012 | 25
Still to tackle...?<br />
– bulk-ingest import 10,000+ files (ie a single accession)<br />
– A&D is largely dependant upon Hyp<strong>at</strong>ia (in development)<br />
– roll-out more active collecting of b-d in next few years<br />
- still looking <strong>at</strong> options for access<br />
- still form<strong>at</strong>s we haven’t tackled yet - e-mail & d<strong>at</strong>abases !<br />
Wh<strong>at</strong> is being cre<strong>at</strong>ed with-in the University?<br />
- hope to start working with departments<br />
- start with Comms & Marketing dept<br />
Archives and Society | 6th Mar 2012 | 26
Conclusion<br />
N<strong>at</strong>ure (and form<strong>at</strong>) of <strong>archives</strong> has changed dram<strong>at</strong>ically, but<br />
most of our professional skills are still relevant<br />
Some new tools, new software, new terminology and more<br />
acronyms to learn (esp OAIS)<br />
Some m<strong>at</strong>erial has already been “lost”<br />
– need to start doing something to minimise the impact<br />
Confidence to make the transition from paper to digital;<br />
hands-on learning - play with sample files/media etc<br />
Archives and Society | 6th Mar 2012 | 27
Contact details<br />
Simon Wilson<br />
Senior Archivist<br />
<strong>Hull</strong> <strong>History</strong> <strong>Centre</strong><br />
Tel 01482 317506<br />
Email s.wilson@hull.ac.uk<br />
www.hullhistorycentre.org.uk<br />
AIMS blog: http://born-digital-<strong>archives</strong>.blogspot.com<br />
White Paper: http://www2.lib.virginia.edu/aims/whitepaper/<br />
Portrait of Claude-Henri W<strong>at</strong>alet blogging, after Jean-Baptiste Greuze<br />
http://www.flickr.com/photos/notionscapital/2497196140/in/photostream/<br />
Archives and Society | 6th Mar 2012 | 28