connecting museums with linked data
DESCRIPTION
When Culture Encounters Internet Conference, December 14th-15th, 2010, TaipeiTRANSCRIPT
Hideaki Takeda / National Institute of Informatics
Connecting Museums with Linked Data以鏈結資料連結博物館
Hideaki Takeda 武田英明[email protected]
National Institute of Informatics
国立情報学研究所
When Culture Encounters Internet Conference, December 14th-15th, 2010, Taipei
With LODAC project teamI. Ohmukai, F. Kato, T. Kamura, T. Takahashi, H. Ueda
Hideaki Takeda / National Institute of Informatics
Outline
Information Cycle Linked Data and Museum Data LODAC Museum
Hideaki Takeda / National Institute of Informatics
Information Cycle
Share
Collect
Use
Publish
Create&
Information can be created only based on existing information No information can be created out of nothing Collect – Use & Create
Value of information is how much it is used No value for information without use Use & Create – Publish
Accumulation of information is the wealth of society Distribution of information is the health of society Publish – Share -- Collect
Hideaki Takeda / National Institute of Informatics
Information Cycle
Share
Collect
Use
Publish
Create&
Before Gutenberg Media
Hand-writing booksOral communication
Information Cycle isSlowSmall amountFew People
After Gutenberg, the age of Mass media arrived …
Hideaki Takeda / National Institute of Informatics
Two social layers on information cycle with Mass Media
Share
Collect
Use
Publish
Create
Writer, Artist, ScholarMass media
Government
&
Hideaki Takeda / National Institute of Informatics
Two social layers on information cycle with Mass media
Share
Collect
Use
Publish
Create
Writer, Artist, ScholarMass media
Government
&
OrdinaryPeople
Collect
Use
Create&
Hideaki Takeda / National Institute of Informatics
Two social layers on information cycle with Mass Media
Share
Collect
Use
Publish
Create
Writer, Artist, ScholarMass media
Government
OrdinaryPeople &
Hideaki Takeda / National Institute of Informatics
WebShare
Collect
Use
Publish
Internet
Web Server
Web BrowserCreate
& HTML Editor
Search Engine
Information Cycle with Web
Open Door to Information Cycle for Ordinary People
Hideaki Takeda / National Institute of Informatics
WebInformation Cycle
Share
Collect
Use
Publish
Create&
Web accelerate Information Cycle in Speed Quantity People
Hideaki Takeda / National Institute of Informatics
WebShare
Collect
Use
Publish
Create&
Internet
Web Server
Web Browser HTML Editor
Search Engine
Information Cycle with Web
Hideaki Takeda / National Institute of Informatics
Metadata is the platform of Information Cycle
&
Metadata
Share
Collect
Use
Publish
&Create
Hideaki Takeda / National Institute of Informatics
Linked Data will be the platform of Information Cycle on the content layer
&
Metadata
Share
Collect
Use
Publish
&Create
Linked Data
Hideaki Takeda / National Institute of Informatics
LOD Cloud(Linking Open Data)
Hideaki Takeda / National Institute of Informatics
Linked Data – Four Rules
Linked Data is “Web of Data” (Traditional) Web is “Web of Documents”
What is Linked Data? RDF triples Can refer others Can be referred by others,
Four Rules for Linked Data Use URIs as names for things Use HTTP URIs so that people can look up those names. When someone looks up a URI, provide useful information, using
the standards (RDF*, SPARQL) Include links to other URIs. so that they can discover more things
Linked Data, TBL, http://www.w3.org/DesignIssues/LinkedData.html
Hideaki Takeda / National Institute of Informatics
Importance of data in public sector as Linked Data
In principle, it should be shared It is the basic knowledge of our society Data in public sector
Library Museum Archive Government
Hideaki Takeda / National Institute of Informatics
Challenges for Linked Data in Japan
Lack of culture of sharing Immature community for linked data Lack of central data Set Difficulty of multi-lingual data
Anyway let’s start!
Hideaki Takeda / National Institute of Informatics
LODAC Project
Open Social Semantic Web Platform for Academic Resources Providing platforms for Linked Data Practicing data accumulation and publishing
Interested Areas Museum information Geographical information, especially geographical names Local information …
Hideaki Takeda / National Institute of Informatics
Museum data as LOD
The state-of-the-art of museum information in Japan Distributed
Self maintained Isolated
OpaqueSelf designedMessy
Aggregating and associating museum information LODAC-Museum (tentative)
Hideaki Takeda / National Institute of Informatics
Over 1.4 billion collectionsOver 1,000 organizations
Hideaki Takeda / National Institute of Informatics
http://lod.ac/ (open on December 11)
Hideaki Takeda / National Institute of Informatics
LODAC Museum – Main work
Gathering of data Thesaurus, museum collections, etc
Standardization of data Representing data from different sources in a unique form
Integration of data Identifying data Associating the same data
Publishing and share of data
Hideaki Takeda / National Institute of Informatics
Data sources
Thesaurus and authority sources 日本美術シソーラス DB 絵画編
(Thesaurus of Japanese Art) 国指定文化財データベース
(DB for National Designated Cultural Property) 文化遺産オンライン
(Cultural Heritage Online) Museum Collection (14 museums)
国立美術館所蔵作品総合目録検索システム ( 国立国際美術館,京都国立近代美術館,東京国立近代美術館 ) (4 Nat’l Museums)
国立西洋美術館 (Nat’l M. Western Art) 京都国立博物館 (Kyoto Nat’l Museum) 奈良国立博物館 (Nara Nat’l Museum) 福島県立美術館 (Fukushima Pref. M. of Art)
Other sources DBPedia Japan GIS data
栃木県立美術館 秋田県立近代美術館 岩手県立美術館 徳島県立近代美術館 山梨県立美術館 東京都現代美術館 香川県立東山魁夷せとうち
美術館
Hideaki Takeda / National Institute of Informatics
Metadata design
Basic Structure Work – Creator – Museum
Interoperability is more considered than correctness in the domain DC> DCTerm> FOAF> iCal >SKOS>NDLSH> RDA> CIDOC
CRM Keep it flat as long as possiblePREFIX URI
crm http://purl.org/NET/cidoc-crm/core#
dc http://purl.org/dc/terms/
dc11 http://purl.org/dc/elements/1.1/
foaf http://xmlns.com/foaf/0.1/
skos http://www.w3.org/2004/02/skos/core#
rdfs http://www.w3.org/2000/01/rdf-schema#
ical http://www.w3.org/2002/12/cal/ical#
rda2 http://RDVocab.info/ElementsGr2
lodac http://lod.ac/ns/lodac#
lodac:Work Property( 一部項目省略 )資料分類 lodac:genre文化財 lodac:culturalAssets制作者 dc:creator / dc11:creator国籍 crm:P7_took_place_at作品名 dc:title / skos:prefLabel作品名読み dc:title @ja-hrkt / skos:altLabel作品名英語 dc:title @en / skos:altLabel銘文 crm:P62I_is_depicted_by印章 crm:P65_shows_visual_item員数 crm:P57_has_number_of_partsコレクション dc:isPartOf制作年 dc:created推定始年 lodac:estimatedStartYear材質 dc:medium / crm:P45_consists_of
Metadata elementsWork: 46Person: 23Org. 13Bib. 12
Hideaki Takeda / National Institute of Informatics
Integration Policy How to integrate data from different sources
sharing of responsibilityEach source is responsible for its data
Identifying IDs for data and managing data with the IDs LODAC is only responsible for integration
Assigning original IDs and associating other IDs to them
Data from Source B
24
Integrated data
dc:references dc:references
dc:references dc:references
dc:references dc:references
dc:creatordc:creator
crm:P55_has_current_location crm:P55_has_current_location
crm:P55_has_current_location dc:creator
Data from Source A
Work
Museum
Creator
Hideaki Takeda / National Institute of Informatics
Integration of Person Data
Matching of Creators Base: List of Artists from Thesaurus of Japanese Art Target: Creators of collection in museums + Dbpedia Method: String match of names Results: Links from artist nodes to work nodes are added
LODAC data
Link to Work
DBpedia
Basic Information for Creators
Links
Hideaki Takeda / National Institute of Informatics
Hideaki Takeda / National Institute of Informatics
Hideaki Takeda / National Institute of Informatics
Hideaki Takeda / National Institute of Informatics
Hideaki Takeda / National Institute of Informatics
Hideaki Takeda / National Institute of Informatics
徳島県立美術館 Tokushima Pref. Museum
Hideaki Takeda / National Institute of Informatics
東京近代美術館 National Museum of Modern Art, Tokyo
Hideaki Takeda / National Institute of Informatics
国指定文化財データベース DB for National Designated Cultural Property
Hideaki Takeda / National Institute of Informatics
Tokushima Pref. Museum Thesaurus for Japanese ArtDB for National Designated
Cultural Property
National Museum of Modern Art, Tokyo
Fukui Pref. Museum
Hideaki Takeda / National Institute of Informatics
Data size and Integration Results
Source Type No.
国立美術館 ( 西美を除く 3 館 ) Work 25180
国立西洋美術館 Work 4373
京都国立博物館 Work 5819
奈良国立博物館 Work 431
福島県立美術館 Work 20
栃木県立美術館 Work 32
秋田県立近代美術館 Work 22
岩手県立美術館 Work 1558
徳島県立近代美術館 Work 18482
山梨県立美術館 Work 262
東京都現代美術館 Work 5416
香川県立東山魁夷せとうち美術館 Work 266
Thesaurus for J. art Work 3800
Thesaurus for J. art Person 1332
Thesaurus for J. art Group 289
Thesaurus for J. art Museum 648
Cultural Heritage Online Museum 915
Designated Cultural Property DB Work 10115
合計 103096
Type for Integration
Sources No. Results
Museum Thesaurus for J. art 648 77
Cultural Heritage Online 915
Designated Cultural Property
Thesaurus for J. art (work) 3800 74
Designated Cultural Property DB
10115
work Thesaurus for J. art (work) 1332 15020
Museum collections (work) 61861
Person Thesaurus for J. art (artist) 1332 615
Museum collections (work) 61861
Museum collections
Hideaki Takeda / National Institute of Informatics
What can LOD give Museum Data?
Connectivity!!
Open Connectivity makes new values for museum data Connect to data in other areas Connect to UGC (User Generated Contents)
Hideaki Takeda / National Institute of Informatics
Local Information with Museum data
Museum LOD + Local LOD / Sightseeing LOD / Geo LOD
e.g., Tour visiting museums with a focus
Joint event with local festivals
Tour for food related historical events
…
37
Hideaki Takeda / National Institute of Informatics
User Generated Contents for Museum Information
1. Statue of Sarasvati 2. Ryohoji Temple
3. Theme Song for Ryohoji 4. Event
Contributions by non-experts
e.g.,
Personal comments for Buddha statues
Records of visiting museums
Media-mix events
弁財天像
Hideaki Takeda / National Institute of Informatics
Publish museum data as LOD
Let’s make museum data open and shareable
Change “cultural heritage” to “cultural resources”
(art/culture) * information = Promotion of the Nation
Beyond collaboration of Museum Library Archives(MLA)
MLA3(Museum Library Archives, Arts and Academia)
More users, more various types of usage
Hideaki Takeda / National Institute of Informatics
ARTS & CultureArt
Computer Science
Music
Movie
Media
Make arts and culture more dynamic and more energetic
Pop Culture