digital resource management in national institute of japanese literature shoichiro hara (national...

28
Digital Resource Digital Resource Management Management in in National Institute of National Institute of Japanese Literature Japanese Literature Shoichiro Hara Shoichiro Hara (National Institute of Japanese Literature) (National Institute of Japanese Literature) 1-16-10, Yutaka-cho, Shinagawa-ku, Tokyo 142 1-16-10, Yutaka-cho, Shinagawa-ku, Tokyo 142 -8585, Japan -8585, Japan [email protected] [email protected]

Upload: erick-mccoy

Post on 12-Jan-2016

217 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Digital Resource Management in National Institute of Japanese Literature Shoichiro Hara (National Institute of Japanese Literature) 1-16-10, Yutaka-cho,

Digital Resource ManagementDigital Resource Managementin in

National Institute of Japanese National Institute of Japanese LiteratureLiterature

Shoichiro HaraShoichiro Hara(National Institute of Japanese Literature) (National Institute of Japanese Literature)

1-16-10, Yutaka-cho, Shinagawa-ku, Tokyo 142-8585, Japan1-16-10, Yutaka-cho, Shinagawa-ku, Tokyo 142-8585, [email protected]@nijl.ac.jp

Page 2: Digital Resource Management in National Institute of Japanese Literature Shoichiro Hara (National Institute of Japanese Literature) 1-16-10, Yutaka-cho,

National Institute of Japanese Literature- NIJL: 国文学研究資料館 -

Founded in 1972 As an Inter-University Research Institute By the Ministry of Education, Culture, Sports, Science and Technology

(MEXT: 文部科学省 )

Mission Survey Japanese Classical Literal Materials Collect Originals and Microfilms Public Access to Research Information

We Have Done Collected Materials Organized their Information Published Catalogues Developed Variety Kinds of Databases

Page 3: Digital Resource Management in National Institute of Japanese Literature Shoichiro Hara (National Institute of Japanese Literature) 1-16-10, Yutaka-cho,

NIJL Databases Catalogue Databases

Holding Catalogues (Books and Microfilms) Research Papers OPAC (Online Public Access Catalogue) Union Catalogue of Classical Books( 古典籍総目録 ) etc.

Sharing Database of Historical Materials Image Database

Holding Original Materials (Approx. 1,000,000 frames) Meiji Publications Nara Picture Book ( 奈良絵本 ) etc.

Full Text Databases The Anthology of Japanese Classical Literature( 日本古典文学大系 ) 21Waka-Anthologies ( 二十一代集 ) etc.

Movie Pictures and more ・・・

Page 4: Digital Resource Management in National Institute of Japanese Literature Shoichiro Hara (National Institute of Japanese Literature) 1-16-10, Yutaka-cho,

Resource ManagementResource Management(Past Systems)(Past Systems)

1. Investigation, Collection, Microfilming, Cataloging2. Database Systems

Main-frame System Networks (N-1)

3. Database Services Catalogues OPAC Full-text Data Image Data

4. Other Services Publications References, Reproductions Education, Lectures, Exhibitions

Page 5: Digital Resource Management in National Institute of Japanese Literature Shoichiro Hara (National Institute of Japanese Literature) 1-16-10, Yutaka-cho,

Problems of Past SystemsNIJL Systems were particular to its own purposes Heterogeneities of the Information Systems

System Architecture and Historical Background• Different data structure• Different data description

Complicate and High-cost Data Management Obsolescence of Hardware and Software

Regular/Periodic System Renewal• CPU / peripheral devices• Applications

System Reconstruction• Reconstruction of applications /user Interfaces• Data migration

Coping with Hypermedia No Standard Applications Development for Particular System

Page 6: Digital Resource Management in National Institute of Japanese Literature Shoichiro Hara (National Institute of Japanese Literature) 1-16-10, Yutaka-cho,
Page 7: Digital Resource Management in National Institute of Japanese Literature Shoichiro Hara (National Institute of Japanese Literature) 1-16-10, Yutaka-cho,

Resource ManagementResource Management(Current Systems)(Current Systems)

1. Data Portability Introducing XML

2. Coping with Hyper Media Unix Base Systems Catalogue - Image

3. Databases Catalogues OPAC Full-Texts Images Movies

Page 8: Digital Resource Management in National Institute of Japanese Literature Shoichiro Hara (National Institute of Japanese Literature) 1-16-10, Yutaka-cho,

Importance of Data Portability Maintenance

Data Independent from Hardware, Software Readable Data

Data Processing Data Conversion to Web, Publishing, Database etc. Data Backup and Transfer Data Hub Format / Data Interchange

Coping with Hypermedia Web Pages Linking with Images, Movies etc. External Standard Character

Page 9: Digital Resource Management in National Institute of Japanese Literature Shoichiro Hara (National Institute of Japanese Literature) 1-16-10, Yutaka-cho,

Necessities for Portability Self-describing

An ability to define a set of data structure and provide a way to check that data conforms to a set of rules

Readable Data Data should be plain text files in ASCII, Latin 1

(ISO 8859-1) or Unicode (UTF-8 or -16)

Page 10: Digital Resource Management in National Institute of Japanese Literature Shoichiro Hara (National Institute of Japanese Literature) 1-16-10, Yutaka-cho,

Portability of XML Self-describing

DTD: Document Type Definition XML Schema

• Can define element sets and provide a way to check that a document conform to a set of rules

Readable Text XML documents are plain text files in ASCII,

Latin 1 (ISO 8859-1) or Unicode (UTF-8 or -16)

Page 11: Digital Resource Management in National Institute of Japanese Literature Shoichiro Hara (National Institute of Japanese Literature) 1-16-10, Yutaka-cho,

Schema of XML as “an Intermediate Data”

DataBase 1

DataBase 2

Application 1

Application 2

Interface

DataBase 3

XML XSLT

HTML

PDF

XHTML

XML

SpreadSheet

Page 12: Digital Resource Management in National Institute of Japanese Literature Shoichiro Hara (National Institute of Japanese Literature) 1-16-10, Yutaka-cho,

NIJL Present Multimedia Databases e-Booke-Book

Full Text Database Reconstructed Books WEB Books

Image Database Movie PicturesMovie Pictures Image Databases of Holding Original Image Databases of Holding Original

Materials Materials (Approx. 900,000 frames)(Approx. 900,000 frames)

Page 13: Digital Resource Management in National Institute of Japanese Literature Shoichiro Hara (National Institute of Japanese Literature) 1-16-10, Yutaka-cho,
Page 14: Digital Resource Management in National Institute of Japanese Literature Shoichiro Hara (National Institute of Japanese Literature) 1-16-10, Yutaka-cho,

Resource ManagementResource Management(Future Systems)(Future Systems)

Resource Sharing SystemResource Sharing System Another ApproachesAnother Approaches

Web Based SystemWeb Based System GISGIS

Page 15: Digital Resource Management in National Institute of Japanese Literature Shoichiro Hara (National Institute of Japanese Literature) 1-16-10, Yutaka-cho,

Resource Sharing Project What are the Problems ?

Most Databases are Heterogeneous… Similar but Different Databases

Historical Background, Different Purposes Incompatibility

Different Operations, Non-Interoperability Inter-institutional Information Retrieval

Different Information Systems Different Information Management Bases

Page 16: Digital Resource Management in National Institute of Japanese Literature Shoichiro Hara (National Institute of Japanese Literature) 1-16-10, Yutaka-cho,

Solutions might be ・・・

Introducing Standards for Data Description (Portability) Mutual Data Structure (Different Structures) Data Retrieval (Compatibility)

Standardization not by Compulsion Authority

Page 17: Digital Resource Management in National Institute of Japanese Literature Shoichiro Hara (National Institute of Japanese Literature) 1-16-10, Yutaka-cho,

3-layer Architecture (Standardization)

Our efforts have been standardization of 1. First Layer: Database Layer

Description Portability SGML/XML

2. Second Layer: Data Structure Layer Mutual Data Structure Metadata (Dublin Core, EDI, EAD, TEI etc)

3. Third Step: Data Retrieval Layer Protocol (Z39.50 etc)

Page 18: Digital Resource Management in National Institute of Japanese Literature Shoichiro Hara (National Institute of Japanese Literature) 1-16-10, Yutaka-cho,

Schema of 3-layers Architecture

Existent Databases

Existent Methods

Data Description Standard

Data Structure Layer

Data Retrieval Layer

Database Layer

Page 19: Digital Resource Management in National Institute of Japanese Literature Shoichiro Hara (National Institute of Japanese Literature) 1-16-10, Yutaka-cho,

The Merits of Layer Architecture Module Oriented

Easy to change sub-modules Ex. from Z39.50 to Web Service (Retrieval Layer) from DC to METS (Structure Layer) Dictionaries (Database Layer)

Protocol Oriented Independent from hardware/software/venders

Page 20: Digital Resource Management in National Institute of Japanese Literature Shoichiro Hara (National Institute of Japanese Literature) 1-16-10, Yutaka-cho,

How to Link Heterogeneous System?Federation System by Dublin Core + Z39.50

UC Berkeley ECAI Clearing House DC Meta Data Model

Inter University NMHF Z39.50 Gateway

Images Doc.s OPACInstitues Standard Protocol

Domain Specific SGML

Universities Osaka City Univ. or XML Data Bases

Standard Data Description

Meta DatabaseTarget Institutions

Standard Data Model

Uploading NIJL Data Clearing House

Retrieving

Page 21: Digital Resource Management in National Institute of Japanese Literature Shoichiro Hara (National Institute of Japanese Literature) 1-16-10, Yutaka-cho,

Web-Z39.50 Gateway + Metadata(Resource Sharing System)

OriginalDatabase

MetadataDatabase

Z39.50 Server

Z39.50 Server

Z39.50 Server

WebClient

Z39.5

0 Pro

toco

l

Z39.50 ProtocolWeb-Z39.50

Gateway Server

HTTP Z39.50 Protocol

On

e D

ata

Vie

w

Page 22: Digital Resource Management in National Institute of Japanese Literature Shoichiro Hara (National Institute of Japanese Literature) 1-16-10, Yutaka-cho,

Resource Sharing Project Inter-institutional Project

Linking Databases of Several Institutes Seamlessly The Graduate University for Advanced Studies

National Institute of Japanese Literature National Museum of Ethnology International Research Center for Japanese Studies National Museum of Japanese History

Universities The Historiographical Institute, The University of Tokyo Institute of South East Studies, Kyoto University Osaka City University Keio University

ECAI Clearing House IAS University of California Berkeley, ACL aboratory dney the University of Sydney

Page 23: Digital Resource Management in National Institute of Japanese Literature Shoichiro Hara (National Institute of Japanese Literature) 1-16-10, Yutaka-cho,

Web Service - Next Z39.50 -

WEB Oriented More Portability

Remote Procedure Call System Architecture Independent

Light ProtocolLight Protocol Only for Data Retrieval

Introducing SOAP (Simple Object Access Protocol)

How to Treat ASN.1 ?

Page 24: Digital Resource Management in National Institute of Japanese Literature Shoichiro Hara (National Institute of Japanese Literature) 1-16-10, Yutaka-cho,

DB Serve SOAP Client

SOAP Search.NET Client

.NET Framework

SOAP

SOAP Server

Java2 SDK

Apache Tomcat

Apache-AXIS

SOAP Search Web Service DB Access Routine JNI

Database DB I/F

Library Server(FLORA 730) Windows NT Server 4.0

Windows Terminal Windows NT/2000/XP

Server: BASE2 Solaris 8

Experimental SOAP System

Page 25: Digital Resource Management in National Institute of Japanese Literature Shoichiro Hara (National Institute of Japanese Literature) 1-16-10, Yutaka-cho,

Information Retrieval byTime and Space

Geo-temporal Information Facts about specific time and places and their

associations with other times and places on the Earth's surface

Not all materials have enough bibliographic information Archaeology (Historical Sites, Ruins, Remains) Maps, Pictures Physical events

We use time and pace information in many aspects 5 W 1 H

Page 26: Digital Resource Management in National Institute of Japanese Literature Shoichiro Hara (National Institute of Japanese Literature) 1-16-10, Yutaka-cho,

Tool Example (ECAI TimeMap)

Time

Longitude

Latitude

          Meta Data    ECAI Metadata

Data Set   GIS Data   TimeMap Metadata

          Attribution Data

Project

Page 27: Digital Resource Management in National Institute of Japanese Literature Shoichiro Hara (National Institute of Japanese Literature) 1-16-10, Yutaka-cho,

Time and Place Data and Related Information

Time and Place Data from Texts Japanese Calendar ⇒ Gregorian Calendar Old Place Name Lat. And Lon. ⇒

Related Information Ex. Faults Map

Superimposed

Page 28: Digital Resource Management in National Institute of Japanese Literature Shoichiro Hara (National Institute of Japanese Literature) 1-16-10, Yutaka-cho,

Related URLRelated URL

National Institute of Japanese LiteratureNational Institute of Japanese Literature

http://www.nijl.ac.jp/

ECAI (Electronic Cultural Atlas Initiative)ECAI (Electronic Cultural Atlas Initiative)

http://ecai.org

PNC (Pacific Neighborhood Consorcium)PNC (Pacific Neighborhood Consorcium)

http://pnc-ecai.oiu.ac.jphttp://pnc-ecai.oiu.ac.jpPRDLA (The Pacific Rim Digital Library Alliance)PRDLA (The Pacific Rim Digital Library Alliance)

http://prdla.org/

Contact E-mail: Contact E-mail: [email protected]@nijl.ac.jp