WSEAS TRANSACTIONS onINFORMATION SCIENCE <strong>and</strong> APPLICATIONSGiulio Concas, Filippo Eros Pani, Maria Ilaria Lunesumay not be appropriate or convenient. Most archivesuse Qualified DC as main schema <strong>for</strong> indexing <strong>and</strong>displaying metadata <strong>and</strong> Simple DC to show themthrough the OAI-PMH st<strong>and</strong>ard. There<strong>for</strong>e, theadoption of Dublin Core must be thoroughlyevaluated when an archive is needed to be compliantwith the interoperability principles required by OAI.Our of the four criteria listed in section 4.3, the mostsuitable technique <strong>for</strong> the case study is an hybridmodel between the second (mapping of nativemetadata on DC elements <strong>and</strong> creation of newcustomized qualifiers <strong>for</strong> DC elements) <strong>and</strong> the thirdone (creation of a customized metadata schema,identical to the native metadata set). The thirdcriterion is more convenient <strong>for</strong> linguisticannotations, so that a dedicated metadata schemacan be created to preserve their granularity; whilethe second criterion is best suited <strong>for</strong> all othermetadata, because it combines the advantages ofgranularity as provided by qualifiers tointeroperability provided by DC metadata.5.6 Application Profile <strong>for</strong> the AnalyticalSound Archive of SardiniaIn creating a specific application profile <strong>for</strong> theASAS, a "conservative" approach was used towardsthe original Qualified DC elements <strong>and</strong> qualifiers inorder to use as many of them as possible <strong>for</strong> the<strong>for</strong>malization of descriptive <strong>and</strong> relational metadata.A special schema, identified by the prefix "asas",was created instead <strong>for</strong> annotations. Its metadatawere entered into the DC application profile asoutlined below.Metain<strong>for</strong>mationor ASASAnnotationTitleAuthorPublisherObjectDescriptionContributorAnnotatorLocationDateOccasionSourceDocumentAccessibilityDC Application Profile Metadatadc.titledc.creatordc.publisherdc.typedc.type.categorydc.contributordc.contributor.annotatoredc.coverage.spatialdc.date.createddc.subjectdc.relation.isbasedondc.rightsPer<strong>for</strong>merdc.contributor.sperakerPer<strong>for</strong>merPer<strong>for</strong>mer's Age dc.description.speakerPer<strong>for</strong>merPer<strong>for</strong>mer's Placeof OriginLanguageSourceCompletenessSource No.Source SectionNo.Document TypeFormatAcquisitionMethodReading TypeInterview TypeMonody TypeUnison /HeterophonyAccompanimentTypedc.description.speakerPer<strong>for</strong>merdc.languagedc.description.integritàdc.relation.ispartofseriesdc.relation.ispartofseriesdc.<strong>for</strong>mat.audioVideodc.<strong>for</strong>mat.mediumdc.<strong>for</strong>mat.modoAcquisizionedc.type.letturadc.type.intervistadc.type.monodiadc.type.unisonoEterofoniaPolyphony Type dc.type.polifoniaInstrumentalInstrumentSinging TypeOtherSyllableToneMorphemePhoneWordPart of SpeechSyntagmSentenceIn<strong>for</strong>mationStructureTurnPerfdc.type.monodiaAccompagnamentodc.type.strumentaledc.type.strumentodc.type.tipoCantodc.descriptionasas.annotazione.sillabaasas.annotazione.toniasas.annotazione.morfemaasas.annotazione.fonoasas.annotazione.parolaasas.annotazione.posasas.annotazione.sintagmaasas.annotazione.fraseasas.annotazione.strutturaIn<strong>for</strong>mativaasas.annotazione.turnPerfMusical Syllable asas.annotazione.sillabaMusicaleMetric Segmentasas.annotazione.segmentoMetricoMusical Segment asas.annotazione.segmentoMusicaleTonal CentreNotationOrnamentationAccentsMelismaticSyllableADD1asas.annotazione.centroTonaleasas.annotazione.notazioneasas.annotazione.ornamentazioneasas.annotazione.accentiasas.annotazione.sillabaMelismaticaasas.annotazione.annotazioneLiberaTable 1: Application profile <strong>for</strong> the ASASE-ISSN: 2224-3402 144 Issue 5, Volume 10, May 2013
WSEAS TRANSACTIONS onINFORMATION SCIENCE <strong>and</strong> APPLICATIONSGiulio Concas, Filippo Eros Pani, Maria Ilaria LunesuThe next step is to enter metadata in theknowledge management system: oncemetain<strong>for</strong>mation have been organized <strong>and</strong>structured, the KMS is configured so that it can beadapted to the selected metadata schema.5.7 Choice <strong>and</strong> Customization of the KMSDSpace, an open source software packagedeveloped in 2000 in the context of a joint project ofthe Massachussetts Institute of Technology withHewlett-Packard, provides all the necessary tools<strong>for</strong> creation <strong>and</strong> management of an IR based on theOpen Access model [1]. Such an IR can collect,store, index, preserve <strong>and</strong> make accessible thein<strong>for</strong>mation output created by universities <strong>and</strong>research institutes in a digital <strong>for</strong>mat.DSpace is designed as a central storage facilityable to collect all kinds of content from thecommunity relating to the institution through a userinterface as simple <strong>and</strong> intuitive as possible. It cancollect various types of digital resources includingtext, images, video, audio, articles <strong>and</strong> preprints,technical reports, working papers, datasets, <strong>and</strong>learning objects directly from the creators.DSpace was chosen to realize the AnalyticSound Archive of Sardinia as it fulfills all therequirements asked by linguists <strong>and</strong> musicologists.It is in fact completely customizable, supportsnatively Qualified DC metadata schema <strong>and</strong> iscompatible with OAI with the support of OAI-PMH.The proposed approach allows to insert the corpus<strong>and</strong> the associated knowledge inside of DSpace,ensuring the maintenance of its structure <strong>and</strong> theability to interrogate <strong>and</strong> update it easily by addingor modifying its contents. Each text of the corpusis inserted into a DSpace item so that it can beuniquely associated with all of the metadata needed<strong>for</strong> the linguistic analysis. The audio file containsthe registrations <strong>and</strong> the original files with theannotations are loaded inside of the item as abitstream, while the metadata are stored in thesystem database.The first step consisted in the insertion of thecustomization of new qualifiers <strong>for</strong> the Dublin Coredescriptive metadata representation <strong>and</strong> a newscheme called "asas" <strong>for</strong> the representation of theannotations. When inserting the corpus into DSpaceit was decided to create a specific item <strong>for</strong> each ofaudio clip. It was there<strong>for</strong>e necessary to set therelease wizard offered by DSpace by changing thespecific XML file responsible <strong>for</strong> entry <strong>for</strong>ms(input-<strong>for</strong>ms.xml). The descriptive metadata,identified by researchers, such as title, author, typeof song, instrument, etc., <strong>and</strong> all metadatacorresponding to linguistic annotations (phono,morpheme, word, etc.), was associated to each item,together with the original file containing the audiorecording <strong>and</strong> the original file of annotations.Figure 1: Customization of DSpace metadata'sRegisterAfter the insertion of metadata, the interface wascustomized by replacing the st<strong>and</strong>ard <strong>for</strong>msprovided by DSpace using modules specficallydesigned to allow the creation of items <strong>and</strong> therelease of DC metadata according to the specificneeds of the project. The metadata on theannotations were inserted instead using directimport because the high number of occurrences <strong>for</strong>each item made it difficult to enter them manually,as shown by Hillman <strong>and</strong> Westbrooks [21].Finally, we proceeded to customize the searchinterface of DSpace in order to adapt it to newmetadata <strong>and</strong> to the particular needs of the AnalyticSound Archive of Sardinia. In essence, all metadatacorresponding to linguistic annotations needed to beindexed in DSpace’s search engine so that we couldfind a certain audio clip even through the search ofan associated record. Furthermore, some descriptivemetadata such as location, type of per<strong>for</strong>mer <strong>and</strong>contribution were indexed to allow effectivesearching that exploited the granularity of themetadata.5.7.1 Metadata SchemasThe metadata are stored <strong>and</strong> managed by DSpacethrough a special tool, the Metadata Registry, wherethe Qualifed Dublin Core schema is configured bydefault. It can nevertheless be changed, <strong>and</strong> newcustomized schemas can be added. The systemoffers two ways to configure the register: one is thegraphic interface named Manakin, <strong>and</strong> the other canE-ISSN: 2224-3402 145 Issue 5, Volume 10, May 2013