building blocks for the future: making controlled vocabularies available for the semantic web

53
Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web Dr. Barbara B. Tillett Chief, Policy & Standards Division Library of Congress For the Texas Library Association Conference April 13, 2011

Upload: elsie

Post on 30-Jan-2016

23 views

Category:

Documents


0 download

DESCRIPTION

Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web. Dr. Barbara B. Tillett Chief, Policy & Standards Division Library of Congress For the Texas Library Association Conference April 13, 2011. Linked Data. VIAF. LCSH. National Library of Sweden. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

Building Blocks for the Future: Making Controlled

Vocabularies Available for theSemantic Web

Dr. Barbara B. TillettChief, Policy & Standards Division Library of CongressFor the Texas Library Association ConferenceApril 13, 2011

Page 2: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

DBpedia

National Library of Sweden

Linked Data LCSH

VIAF

Page 3: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

Internet “Cloud”

Databases, Repositories

Web frontend

Services

3

Page 4: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

Internet “Cloud”

Web frontend

ServicesVIAF

Databases, Repositories

LCSH

4

Page 5: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

5

VIAF Objectives

Facilitate exposure of authority data Reduce cataloging costs Simplify authority control (creation

and maintenance) internationally Provide authority data in form,

language, and script users want

Page 6: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

6

Tchaikovsky, Peter Ilich, 1840-1893

Tschaikowski, Peter I.

Čajkovskij, Petr Il'ič

Chaikovski, P. I

Чайковский, Петр Ильич, 1840-1893

Page 7: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

7

VIAF: The Virtual International Authority File

Original VIAF partners Library of Congress (LC) Deutsche Nationalbibliothek (DNB) Bibliothèque nationale de France (BnF) OCLC - host

Virtually combining the name authority files of all institutions into a single name authority service.

http://viaf.org/

Page 8: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

8

Virtual International Authority File

Matches names across 21 authority files of 18 institutions 13 million name records 10 million personas 4.5 million clusters

Based on KSY Cooperative Identities Hub, CEAL 2010-03

Page 9: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

9

  National Library of Australia •   National Library of the Czech Republic •   Bibliotheca Alexandrina (Egypt) •   Library of Congress/NACO • Getty Research Institute •   Deutsche Nationalbibliothek •   Bibliothèque nationale de France  

• National Library of Israel •   Istituto Centrale per il Catalogo Unico (Italy) •   Biblioteca National de Portugal •   Biblioteca Nacional de España •   National Library of Sweden •   Swiss National Library •   Vatican Library •   NUKAT Center (Poland) •   Library and Archives Canada •   NII (Japan) •   National Széchényi Library (Hungary)

Page 10: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

10

Current StatusAvailable as linked data with

URIs (Universal Resource Identifiers)

Unicode throughoutMARC 21, UNIMARC, and RDF

supportedUsage tripled this last year

Thousands of visits daily

Page 11: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

Enhancing the Authorities

Bibliographic

Record

Derived Authorit

y

AuthorityRecord

Enhanced

Authority

11

Page 12: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

Mining the Bibliographic Record LDR 00638ncm a22002057a 450 1 5773347 5 19960820101947.4 8 960815s1965 oruuua n eng 10 $a 96753638 040 $a DLC $c DLC019 $a 17706440020 $c $2.95028 22 $a 48418 $b Matrix Publ. Co. 045 2 $b d198006 $b d198007048 $b va01 $b ve01 $a ka01050 00 $a M1258 $b .L100 1 $a Leigh, Mitch, $d 1928-245 14 $a The man of La Mancha / $c by Mitch Leigh & Joe Darion; arr. By Roland Barrett & Alan Keown.260 $a Springfield, OR : $b Matrix Publ. Co., $c c1965.300 $a 1 score (16 p.) ; $c 18 x 27 cm.500 $a Brief record.650 0 $a Musicals $x Excerpts.600 10 $a Leigh, Mitch $x Musical settings.700 1 $a Darion, Joe.

Authors

LC Control Number

LC ClassificationTitl

e

Material Type

Publisher

Place of Publication

Language

Date ofPublication

Usage

Page 13: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

Derived Authority Record

00505cz a2200157n 450 0 1 xlc 1 1 3 OCoLC 2 5 19880921165012.4 3 8 880831n|acannaab|n aaa c 4 040 $a OCoLC $b eng $c OCoLC $f viaf 5 100 1 $a Leigh, Mitch. 6 903 $a 88030979 7 910 14 $a the man of la mancha 8 921 $a matrix publ co 9 922 $a oru10 930 $a mitch leigh11 940 $a eng12 942 $a 23413 943 $a 196x14 944 $a cm15 950 1 $a darian, joe $d 1928-

All text is normalized

Subjects are grouped into

broad subject areas

Material type is coded

Publication date is by decadeCoauthor

Page 14: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

Enhanced Authority Record00505cz a2200157n 450 0 1 oca01144962 1 5 19880921165012.4 2 8 840702n| acannaab| |n aaa ||| 3 10 $a n 88090379 4 40 $a DLC $c DLC $d DLC 5 100 1 $a Leigh, Mitch, $d 1928- 6 670 $a the man of la mancha, c1966: $b t.p. (Mitch Leigh) 7 903 $a 84758340 $9 1 8 903 $a 93710923 $9 1 9 910 11 $a impossible dream $9 110 910 11 $a century library of music and sound by mitch leigh $9 111 921 $a matrix publ co $9 112 921 $a kapp $9 213 922 $a oru $9 214 930 $a mitch leigh $9 115 940 $a eng $9 216 942 $a 234 $9 217 943 $a 196x $9 118 943 $a 197x $9 119 944 $a cm $9 220 950 11 $a darian, joe $d 1928- $9 121 950 11 $a wasserman, dale $9 1

Page 15: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

15

Information in Bibliographic Records He writes music

His primary subject area is music He was published in the 1960s and

1970s by Matrix Publ. Co. in Oregon and Kapp in New York

Worked with Joe Darion and Dale Wasserman

Mitch Leigh is the only name he has used on his publications

Etc.

Page 16: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

16

http://www.viaf.org

Page 17: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

17

viaf.org

Page 18: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

Cervantes Saavedra, Miguel de 1547Cervantes de Salazar, Francisco, ca. 1514Cervantes, 1823-1898Cervantes Juan, 1395-1458Cervantes, Ignacio, 1847-1905Cervantes, Juan de, 1382-1453Cervantès, François, 1959-Cervani, Giulio, 1919-Cervantes, María AntonietaCervantes de Haro, fl. 1908-193-

As viewed Nov. 1, 2010

cer

Page 19: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

Cervantes

Page 20: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

Cervantes

Page 21: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

Cervantes

Preferred Forms

Page 22: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

Cervantes

Page 23: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

Cervantes

Page 24: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

Cervantes

Page 25: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

Cervantes

Page 26: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

Cervantes

Page 27: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

Cervantes

Page 28: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

MA

RC

21

Cervantes

Page 29: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

RDF

Cervantes

Page 30: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

30

VIAF and Catalogers Use as a reference tool:

To resolve conflicts, questionable dates, forms of name, etc.

Cite as source in 670 $a, for example:BNF in VIAF, date searchedNat. Lib. of Australia in VIAF,

date searchedLAC in VIAF, date searched

Page 31: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

31

Next steps for VIAF Better searching More “Linked data”

Related persons as in WorldCat Identities, Wikipedia, etc.

Participants beyond librariesRights management agencies,

PublishersMuseums, Archives

More name typesCorporate and Family namesUniform titlesGeographic names… not topical terms

Page 32: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

32

SKOS

Simple Knowledge Organization System“Provides a model for expressing the

basic structure and content of concept schemes such as thesauri, classification schemes, subject heading lists, taxonomies, folksonomies, and other similar types of controlled vocabulary”—SKOS Primer

Page 33: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

33

SKOS

Based on the Resource Description Framework (RDF)Resources can be exchanged

between software applications and published on the Web

Interconnects data on the Web, helping create the Semantic Web

Page 34: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

34

id.loc.gov/authorities

“Authorities & Vocabularies” from the Library of Congress

Intent: To provide human and programmatic access to commonly found standards and vocabularies developed by LC

Page 35: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

35

“Authorities & Vocabularies”LCSH was the first offering

Subject headingsGenre/form headingsChildren’s subject headingsSubdivision recordsValidation records

Provides links from LCSH headings to RAMEAU headingsExploring Répertoire de vedettes-

matière (RVM) and others

Page 36: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

36

“Authorities & Vocabularies”Also includes:

Thesaurus for Graphic Materials (TGM)

MARC geographic area codesMARC language codesMARC relator codesPreservation Events … etc.

Page 37: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

37

“Authorities & Vocabularies”

BenefitsServers can download entire controlled vocabularies and the values within them, in multiple formats

Available for free on the Web

Page 38: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

38

Page 39: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

39

Page 40: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

40

Page 41: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

41

URI for specific LCSH records/ concepts:id.loc.gov/authorities/[LCCN]

For example:id.loc.gov/authorities/sh8508803

“Authorities & Vocabularies”

Page 42: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

42

Page 43: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

43

Page 44: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

44

Contact informationContent of site: Libby Dechman, [email protected] questions: Larry Dixson, [email protected]

“Authorities & Vocabularies”

Page 45: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

45

Controlled Vocabularies / Registry

Free on the Web at the Open Metadata Registry

http://metadataregistry.org/schema/list.html

Page 46: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

http://metadataregistry.org/rdabrowse.htmhttp://metadataregistry.org/rdabrowse.htm

Page 47: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

Carrier type

Page 48: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

URI

Page 49: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

RDA Carrier Types

URI

Page 50: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

RDA Linked DataRDA Linked Data

Don Quixote

Madrid, 1979

English

Spanish

French

German

Cervantes

Library of CongressCopy 1Green leather binding

Exemplary novels

Wasserman

The Man of La Mancha

Tex

t

Movies…

Derivative

works

Subject

created

created created

Page 51: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

51

RDA Linked Terms for Languages

Don Quijote

Madrid, 1979

Inglés

Español

Francés

Alemán

Cervantes

Library of CongressCopia 1Encuadernación en piel color verde

Novelas Ejemplares

Wasserman

The Man of La Mancha

Text

oPelículas …

Obras

derivadas

Mater

ias

Page 52: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

Internet “Cloud”

Web frontend

ServicesVIAF

Databases, Repositories

LCSH

Page 53: Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web

iPhone apps to connect to libraries via WorldCat (OCLC)

Pic2shop apphttp://www.youtube.com/watch?

v=MHiuaDXipWQ RedLaser app

http://www.youtube.com/watch?v=fDv1cAYR5wc&feature=related

49