creating and consuming metadata from transcribed historical vital records for ingestion in a...
Post on 14-Apr-2017
237 Views
Preview:
TRANSCRIPT
IRL:IrishRecordLinkage,1864-1913
Crea;ngandConsumingMetadatafromTranscribedHistoricalVitalRecordsforInges;oninaLong-term
DigitalPreserva;onPlaIorm
DoloresGrant(a)ChristopheDebruyne(b),RebeccaGrant(a),andSandraCollins(a)
(a) DigitalRepositoryofIreland,RoyalIrishAcademy,Dublin,Ireland(b) ADAPT@TrinityCollegeDublin,Dublin,Ireland
October27,2015@META4eS
IRL:IrishRecordLinkage,1864-1913
DevelopingaplaLormapplyingsemanMctechnologiestohistoricalbirth,deathandmarriagecerMficates.AnsweringquesMonssuchas:“Howaccuratearehistoricmaternalmortalityrates(MMR)andinfantmortalityrates(IMR)forDublin?”Teamconsistsofresearchers(historians),digitalarchivists,andknowledgeengineers.
Knowledge and Linked Data Engineers
HistoriansDigital Archivists
IRL:IrishRecordLinkage,1864-1913
General Registers Office (GRO)• Vital registration data: birth-
certificates, death-certificates and marriage records.
• Digitised TIFF images of hardcopy indexes and registers.
• 2 TB of data• Database describing the
digitised records allowing searches on some fields.
©General Records Office of Ireland 2014
IRL:IrishRecordLinkage,1864-1913
Inpriorwork(see[1]),wecreatedaLinkedDataplaLormthatallowedDigitalArchiviststotranscriberegisterpages,whichwerethentransformedintoRDF.ThatRDFwasthenusedtopopulateothertriplestorestoanalyzethatdata.Partoftheproject,however,wasalsotoinves;gatethedigitallong-termpreserva;onofthedigi;zedregisterpages,andthecorrespondingRDF.
CreaMonofIRLKnowledgeBase
RelaMonalDatabase
GROTriplestore
TransformaMonVitalRecordsOntology
SeparaMo
nofCon
cerns
HistoricalEventsOntology
IRLTriplestore
DataAnalyMcs
DigitalArchivist Historian
LODCloud
IRL:IrishRecordLinkage,1864-1913
Relatedwork• RelatedworkonthepreservaMonofharvestedmetadataexist,
e.g.,inthecontextofGLAMS.
• Lialeworkwastobefoundinthecontextofhistorical(vital)records.ItwaslimitedtointegraMonproblemsandaddressingtheproblemrecordlinkingindatabases.
• WealsowantedtofocusonresearchprojectagnosMctranscripMonofhistoricalvitalrecords(separaMonofconcerns)
IRL:IrishRecordLinkage,1864-1913
Method:Crea;ngRDFDocuments• RegisterpagesareidenMfiedbyastampnumber(e.g.
“4646439”).WecollectthetriplesaroundapageandrelatedrecordswiththefollowingquerytocreateanRDFdocument.
• PREFIXrec:<hap://purl.org/net/irish-record-linkage/records#>DESCRIBE*{ ?pagerec:stampNumber"4646439"; rec:withRecord?record. }
• Wealsoaddafoaf:primaryTopicstatementtothedocument.
IRL:IrishRecordLinkage,1864-1913
Method:Crea;ngQualifiedDublinCoreMetadata• AdopMngtheguidelinesformulatedin[2],weadoptedXSPARQL
[3]totransformRDFdocumentsinQualifiedDublinCoreMetadataDocuments.WethushaveanRDFfileandaQDCfileforeachregisterpage.
IRL:IrishRecordLinkage,1864-1913
RegisterPage
District/Union/County[SPATIALCOVERAGE]Superintendentregistrar'sdistrictDatecerMfiedastruecopybysuperintendentregistrar[ISSUED]DatecerMfiedbyregistrar[CREATED]Forename/surnameregistraronpageForename/surnamesuperintendentregistrar[CREATOR]Pagenumber/Volume/QuarterStampnumber[IDENTIFIER/usedinTITLE]Yearregistered[TEMPORALCOVERAGE]
Record
DateofregistraMonTitle/forename/surnameregistrarAmendmentsNumberinregister
CerMficate
Forename/surname(ofsubject)[PARTOFDESCRIPTION]Address(ofsubject)Sex(ofsubject)[PARTOFDESCRIPTION]Forename/surnameinformantQualificaMonofinformantRelaMonshipofinformantResidenceofinformant
DeathRecord
Forename/surnameofregistrarDateofdeath[PARTOFDESCRIPTION]CauseofdeathandduraMonofillnessCondiMonAgelastbirthdayPlaceofresidenceRank,professionoroccupaMon
1
0..10
IRL:IrishRecordLinkage,1864-1913
IRL:IrishRecordLinkage,1864-1913
RelaMonalDatabase
GROTriplestore
TransformaMon
VitalRecordsOntology
DigitalArchivist
RDFFile1
RDFFile2
RDFFilen
QualifiedDublinCore
XML1
QualifiedDublinCore
XML2
QualifiedDublinCore
XMLn
RegiserPage1
RegiserPage2
RegiserPagen
transform
…
…
…
Digitallong-termpreservaMonplaLorm
ingesMon
PartoftheIRLPlaLorm
IRL:IrishRecordLinkage,1864-1913
Method:BulkInges;onintoaDigitalLongTermRepository• WeadoptedtheDigitalRepositoryofIreland
hap://repository.dri.ie/
• ProvidesitembyitemingesMon,orbulkinges;onviaacommandlinetools.
• Files(digiMzedregisterpages,RDFandQDC)arenamedinacertainwaytorelatedQDCwiththedigiMzedassetandRDFtranscripMon.
IRL:IrishRecordLinkage,1864-1913
IRL:IrishRecordLinkage,1864-1913
ConclusionsandFutureWork• WecreatedanautomatedprocessforcreaMnganduploading
assets,RDFtranscripMonsandassociatedmetadatainalongtermpreservaMonplaLorm.
• EvaluaMonislimitedduetothedatasharingagreements;intermsofdiscoverabilityontherepositoryviafacetedsearchandintermsofsuitabilityofthemetadataviaexpertfeedback.
• ComparisonofQualifiedDublinCorewithEncodedArchivalDescripMon(EAD)istobeconductedaswell.
IRL:IrishRecordLinkage,1864-1913
References1. ChristopheDebruyne,OyaDenizBeyan,RebeccaGrant,SandraCollins,StefanDecker:On
aLinkedDataPlaLormforIrishHistoricalVitalRecords.TPDL2015:99-1102. BusMllo,M.,Collins,S.,Gallagher,D.,Grant,R.,Harrower,N.,Kenny,S.,NíCholla,R.,
O’Carroll,A.,Redmond,S.,Webb,S.:QualifiedDublinCoreandtheDigitalRepositoryofIreland(Grant,R.ed.).Tech.rep.,Maynooth:MaynoothUniversity;Dublin:TrinityCollegeDublin;Dublin:RoyalIrishAcademy;Galway:NaMonalUniversityofIreland,Galway(2015)
3. Dell’Aglio,D.,Polleres,A.,Lopes,N.,Bischof,S.:QueryingtheWebofDatawithXSPARQL1.1.In:Verborgh,R.,Mannens,E.(eds.)ProceedingsoftheISWCDevelopersWorkshop2014,co-locatedwiththe13thInternaMonalSemanMcWebConference(ISWC2014),RivadelGarda,Italy,October19,2014.CEURWork-shopProceedings,vol.1268,pp.113–118.CEUR-WS.org(2014)
IRL:IrishRecordLinkage,1864-1913
QuesMons?
MoreinformaMon• Twiaer:@IRL_Project• Projectwebsitehap://irishrecordlinkage.wordpress.com/
top related