From archivist to digital archivist - Hull History Centre
From archivist to digital archivist - Hull History Centre
From archivist to digital archivist - Hull History Centre
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
<strong>From</strong> <strong>archivist</strong> <strong>to</strong> <strong>digital</strong> <strong>archivist</strong><br />
Simon Wilson, Digital Archivist (AIMS Project)
Previous experience (in Jan 2010)<br />
Used ICT with archives, but no previous experience<br />
with born-<strong>digital</strong> archives<br />
Digitisation experience, Mersey Gateway<br />
NOF project – creating <strong>digital</strong> content<br />
and metadata etc<br />
University had a <strong>digital</strong> reposi<strong>to</strong>ry (Fedora)<br />
in-place, but the Archives had no policies<br />
or procedures for born-<strong>digital</strong> collections<br />
<strong>From</strong> Archivist <strong>to</strong> Digital Archivist | 2nd Sep 2011 | 2
The AIMS Project<br />
An inter-Institutional Model for Stewardship funded by The<br />
Andrew W. Mellon foundation<br />
Each partner employed a <strong>digital</strong> <strong>archivist</strong> for two years <strong>to</strong><br />
process born-<strong>digital</strong> collections held by the partners<br />
Started with traditional archival theory/principles; identify<br />
commonality - not looking <strong>to</strong> create a single solution<br />
<strong>From</strong> Archivist <strong>to</strong> Digital Archivist | 2nd Sep 2011 | 3
What is already in your s<strong>to</strong>re?<br />
Born-<strong>digital</strong> media within paper collections – media has been<br />
accessioned (but not the contents..)<br />
Traditional survey skills:<br />
- identify media & content issues, level of cataloguing etc<br />
- quantity & range of media formats - inform plans/policies<br />
Screenshot of survey spreadsheet using MS Excel<br />
<strong>From</strong> Archivist <strong>to</strong> Digital Archivist | 2nd Sep 2011 | 4
Dealing with deposi<strong>to</strong>rs<br />
Existing skills, but relationship<br />
is more critical than ever before<br />
New questions <strong>to</strong> ask<br />
- hardware, software, passwords,<br />
social media, e-mail accounts etc<br />
Web-form <strong>to</strong> collect information<br />
from deposi<strong>to</strong>r; worked better as<br />
a framework for discussions<br />
Screenshot of web –based survey developed by the AIMS Project <strong>From</strong> Archivist <strong>to</strong> Digital Archivist | 2nd Sep 2011 | 5
Deposi<strong>to</strong>rs awareness and perceptions<br />
Some were very willing <strong>to</strong> work with us, allowed us <strong>to</strong> browse<br />
their server and identify material of interest (<strong>to</strong> us)<br />
Some were keen but then got cold feet; reluctant <strong>to</strong> help us<br />
identify current records likely <strong>to</strong> be<br />
of archival interest<br />
Emphasis on our role in managing<br />
information (regardless of format)<br />
Must explain why timeframe for<br />
action is so radically different<br />
<strong>From</strong> Archivist <strong>to</strong> Digital Archivist | 2nd Sep 2011 | 6
Accessioning<br />
Manifest of the material transferred<br />
(similar <strong>to</strong> initial box list on deposit)<br />
Use DROID <strong>to</strong> identify file types<br />
– but it doesn’t recognise everything!<br />
Transfer files from old/current media<br />
<strong>to</strong> network s<strong>to</strong>rage (easier <strong>to</strong> refresh)<br />
Update existing policies/procedures (not distinct policies)<br />
- integrate specific aspects relating <strong>to</strong> born-<strong>digital</strong> archives<br />
<strong>From</strong> Archivist <strong>to</strong> Digital Archivist | 2nd Sep 2011 | 7
New forensic <strong>to</strong>ols for accessioning<br />
Forensic workstation - re-purpose an old PC<br />
with floppy disk, CD drive and USB ports<br />
Engaged colleagues in ICT in our work –<br />
<strong>to</strong>ok the old PC and added internal zip drive<br />
New PC for hard drives & lap<strong>to</strong>ps – larger<br />
volume of files <strong>to</strong> capture/transfer<br />
Write-blocker allows safe capture of files<br />
from media/hard drives; use with FTK<br />
Imager software<br />
Tableau write-blocker<br />
<strong>From</strong> Archivist <strong>to</strong> Digital Archivist | 2nd Sep 2011 | 8
Arrangement and Description<br />
How do we arrange and describe born-<strong>digital</strong> archives?<br />
New challenges <strong>to</strong> be faced:<br />
– volume of material; is it practical <strong>to</strong><br />
describe each file?<br />
– material comes in an order; do we need<br />
<strong>to</strong> change this?<br />
– can you make an appraisal decision on<br />
a file you can’t view?<br />
– integrate material in<strong>to</strong> single finding aid<br />
Toad in the <strong>Hull</strong>, design for a <strong>to</strong>ad that was never built (Larkin with Toads archive)<br />
<strong>From</strong> Archivist <strong>to</strong> Digital Archivist | 2nd Sep 2011 | 9
Arrangement and Description<br />
AIMS <strong>digital</strong> <strong>archivist</strong>s identified gap relating <strong>to</strong> A&D of born<strong>digital</strong><br />
archives in Fedora; functionality shaped and refined by<br />
our experiences<br />
Developers at Stanford –<br />
proof of concept Hypatia <strong>to</strong>ol<br />
Use drag’n’drop <strong>to</strong> create the<br />
intellectual arrangement in<strong>to</strong><br />
Fedora sets (objects not moved)<br />
Assign rights & permissions <strong>to</strong><br />
a file, series or entire collection<br />
<strong>From</strong> Archivist <strong>to</strong> Digital Archivist | 2nd Sep 2011 | 10
Axiell<br />
Working with colleagues at LSE,<br />
Parliamentary Archives, Wellcome<br />
Library etc<br />
How CALM can exchange data<br />
(via an API) with a <strong>digital</strong><br />
reposi<strong>to</strong>ry – first version of this<br />
was included on 9.1 CD<br />
<strong>From</strong> Archivist <strong>to</strong> Digital Archivist | 2nd Sep 2011 | 11
User experience<br />
Paper archives offer a sense of discovery<br />
Notebooks - plot outlines, notes<br />
of meetings with producers,<br />
snippets of dialogue etc with<br />
different work all intermingled<br />
With born-<strong>digital</strong> material the same information is<br />
likely <strong>to</strong> be dispersed across separate files<br />
<strong>From</strong> Archivist <strong>to</strong> Digital Archivist | 2nd Sep 2011 | 12
User expectations<br />
Expectation of access and delivery online<br />
- confirming a user’s identity?<br />
- material collected much closer <strong>to</strong> creation - but issues<br />
relating <strong>to</strong> access <strong>to</strong> sensitive material remain unchanged<br />
New <strong>to</strong>ols:<br />
new opportunities<br />
for access/analysis<br />
- visualisation<br />
- word clouds<br />
<strong>From</strong> Archivist <strong>to</strong> Digital Archivist | 2nd Sep 2011 | 13
archives workflow...<br />
Quarantine<br />
Check for<br />
mould &<br />
infestation etc<br />
Archives<br />
Work room<br />
Initial review<br />
/ appraisal<br />
Donor survey<br />
talk <strong>to</strong> deposi<strong>to</strong>r<br />
Re-box the material in<strong>to</strong><br />
archive folders / boxes etc<br />
REPOSITORY<br />
Produce surrogate<br />
copies for user<br />
access (m/film)<br />
Cataloguing Room<br />
first serious look at material,<br />
appraisal, arrangement &<br />
description; identify any<br />
legal/access restrictions etc<br />
CALM<br />
catalogue<br />
original item produced<br />
in search room<br />
WEB PLATFORM – catalogue available online
AIMS, CALM & Fedora workflow...<br />
Digital Assets<br />
<br />
Forensic<br />
workstation<br />
Virus check,<br />
manifest etc<br />
Network<br />
s<strong>to</strong>rage<br />
Initial review<br />
/ appraisal<br />
Donor survey<br />
secure web-form<br />
on a SQL Lite database<br />
KEY<br />
Digital Asset<br />
Metadata<br />
Rubymatica<br />
virus check, checksum, create manifest<br />
(with copy of original manifest and<br />
Donor Survey for ref) creates SIP<br />
web<br />
services<br />
FEDORA<br />
(Technical metadata)<br />
(Access & Security)<br />
transformation layer –<br />
migrate file on <strong>to</strong> access<br />
format etc<br />
Hypatia (Arrangement<br />
& Description Tool)<br />
inc browse/view/delete files;<br />
set intellectual arrangement<br />
and align <strong>digital</strong> objects with<br />
this; set user permissions;<br />
create descriptive record etc<br />
web<br />
services<br />
EAD for paper/hybrid collection<br />
CALM<br />
(Descriptive metadata)<br />
catalogue<br />
Digital Objects<br />
access copy<br />
Solr index<br />
EAD Finding aid inc<br />
links <strong>to</strong> <strong>digital</strong> objects<br />
Solr index<br />
WEB PLATFORM (merged Solr indexes & Blacklight)
Scale of the task<br />
• Made considerable progress - document process, workflows<br />
and written Idiots Guides<br />
• We don’t have a complete solution in-place yet, already have<br />
40,000 born-<strong>digital</strong> files (33.5 GB) from just 4 collections<br />
• Expect <strong>to</strong> have 1m+ born-<strong>digital</strong> files<br />
in 5 years & backlog measured in TB<br />
• E-mail/databases <strong>to</strong> contend with<br />
<strong>From</strong> Archivist <strong>to</strong> Digital Archivist | 2nd Sep 2011 | 16
Conclusion<br />
• Nature (and format) of archives has changed dramatically,<br />
but most of our professional skills are still relevant<br />
• Some new <strong>to</strong>ols, new software, new terminology and more<br />
acronyms <strong>to</strong> learn (esp OAIS)<br />
• Can’t “keep everything” and hope Google create an<br />
algorithm <strong>to</strong> enable access<br />
• Confidence <strong>to</strong> make the transition from paper <strong>to</strong> <strong>digital</strong>;<br />
hands-on learning - play with sample files/media etc<br />
<strong>From</strong> Archivist <strong>to</strong> Digital Archivist | 2nd Sep 2011 | 17
Contact details<br />
Simon Wilson<br />
Digital Archivist (AIMS Project)<br />
<strong>Hull</strong> His<strong>to</strong>ry <strong>Centre</strong><br />
Tel 01482 317506<br />
Email s.wilson@hull.ac.uk<br />
www.hullhis<strong>to</strong>rycentre.org.uk<br />
AIMS Project blog – http://born-<strong>digital</strong>-archives.blogspot.com<br />
Portrait of Claude-Henri Watalet blogging, after Jean-Baptiste Greuze<br />
http://www.flickr.com/pho<strong>to</strong>s/notionscapital/2497196140/in/pho<strong>to</strong>stream/<br />
<strong>From</strong> Archivist <strong>to</strong> Digital Archivist | 2nd Sep 2011 | 18