1 from the www and minimal digital libraries, to powerful digital libraries: why and how edward a....
TRANSCRIPT
1
From the WWW andMinimal Digital Libraries, toPowerful Digital Libraries:
Why and How
Edward A. Fox [email protected]
ICADL 2005Bangkok, Thailand – December 13, 2005
Acknowledgements (selected)
• 5S Helpers: Weiguo Fan, Marcos Gonçalves, Doug Gorton, Rohit Kelapure, Neill Kipp, Uma Murthy, Ananth Raghavan, Rao Shen, Hussein Suleman, Srinivas, Vemuri, Layne Watson, …
• Sponsors: ACM, Adobe, AOL, CAPES, CNI, CONACyT, DFG, IBM, Microsoft, NASA, NDLTD, NLM, NSF (IIS-9986089, 0086227, 0080748, 0325579, 0535057, 0535060; ITR-0325579; DUE-0121679, 0136690, 0121741, 0333601), OCLC, SOLINET, SUN, SURA, UNESCO, US Dept. Ed. (FIPSE), VTLS
3
Outline
• WWW and Digital Libraries (DLs)• Minimal DLs• Powerful DLs
– Learning Object Repository Requirements– NSDL, OCKHAM, User Interfaces, Services
• Why– DL education, Practical systems– General requirements, Domain specific requirements– Personal DLs, Global DLs
• How– Components, Metamodels, Models– Graphical aids, Generators– Integration, Quality
4
WWW and DLs
• Both emerged in early 1990s.
• Convergence began around 1994 .
• Example: Google spun off from Stanford DL.
• Crawling WWW is one way to build DLs.
• WWW support many portals to DLs.
• Parts of WWW that have catalogs (e.g., Yahoo categories) are close to DLs.
• Web Services help move WWW toward DLs, as the Semantic Web emerges.
5
Degree of Structure
Chaotic Organized Structured
Web DLs DBs
6
NSDL Information ArchitectureEssentially as developed by the Technical Infrastructure Workgroup
referenceditems &
collections
referenceditems &
collections
Special Databases
NSDLServicesNSDL
ServicesOther NSDLServices
CI Services
annotation
CI Services
discussion
CI Services
personalization
CI Services
authentication
CI Services
browsing
Core Services:information retrieval
Core Collection-Building Services
harvesting
Core Collection-Building Services
protocols
Core Services:metadata gathering
Portals &ClientsPortals &
ClientsPortals &Clients
Usage Enhancement
Collection Building
User Interfaces
NSDLCollections
NSDLCollections
NSDLCollections
CoreNSDL“Bus”
7
8
9
10
Outline
• WWW and Digital Libraries (DLs)• Minimal DLs• Powerful DLs
– Learning Object Repository Requirements– NSDL, OCKHAM, User Interfaces, Services
• Why– DL education, Practical systems– General requirements, Domain specific requirements– Personal DLs, Global DLs
• How– Components, Metamodels, Models– Graphical aids, Generators– Integration, Quality
11
Minimal Digital Libraries
• Key concepts, core ideas
• Minimalist perspective
• Underlying concepts: 5S (ETANA example)
• Higher DL constructs
• Bases:– Literature– Informal explanations– Formal definitions
12
Informal 5S & DL Definitions
DLs are complex systems that
• help satisfy info needs of users (societies)
• provide info services (scenarios)
• organize info in usable ways (structures)
• present info in usable ways (spaces)
• communicate info with users (streams)
13
5Ss
Ss Examples Objectives
Streams Text; video; audio; image Describes properties of the DL content such as encoding and language for textual material or particular forms of multimedia data
Structures Collection; catalog; hypertext; document; metadata
Specifies organizational aspects of the DL content
Spaces Measure; measurable, topological, vector, probabilistic
Defines logical and presentational views of several DL components
Scenarios Searching, browsing, recommending
Details the behavior of DL services
Societies Service managers, learners, teachers, etc.
Defines managers, responsible for running DL services; actors, that use those services; and relationships among them
14
Example of 5Ss: ETANA-DL
• Archaeological DL (Electronic Tools for Ancient Near Eastern Archaeology Digital Library)
• Integrated DL– Heterogeneous data handling
• Applies and extends the OAI-PMH– Open Archives Initiative Protocol for Metadata Handling
• Design considerations– Componentized– Extensible– Portable– Work based on 5S framework
15
16
ETANA Societies
1. Historic and pre-historic societies (being studied)2. Archaeologists (in academic institutes, fieldwork
settings, or local and national governmental bodies)
3. Project directors4. Technical staff (consisting of photographers,
technical illustrators, and their assistants)5. Field staff (responsible for the actual work of
excavation)6. Camp staff (e.g., camp managers, registrars, tool
stewards)7. General public (e.g., educators, learners, citizens)
17
ETANA Societies – cont’d
• Social issues1. Who owns the finds?
2. Where should they be preserved?
3. What nationality and ethnicity do they represent?
4. Who has publication rights?
5. What interactions took place between those at the site studied, and others? What theories are proposed by whom about this?
18
ETANA Scenarios1. Life in the site in former times2. Digital recording: the planning stage and the excavation stage 3. Planning stage: remote sensing, fieldwalking, field surveys, building
surveys, consulting historical and other documentary sources, and managing the sites and monuments
4. Excavation1. Detailed information is recorded, including for each layer of soil, and for
features such as pole holes, pits, and ditches. 2. Data about each artifact is recorded together with information about its
exact find spot. 3. Numerous environmental and other samples are taken for laboratory
analysis, and the location and purpose of each is carefully recorded. 4. Large numbers of photographs are taken, both general views of the
progress of excavation and detailed shots showing the contexts of finds. 5. Organization and storage of material6. Analysis and hypotheses generation and testing7. Publications, museum displays8. Information services for the general public
19
ETANA Spaces
1. Geographic distribution of found artifacts2. Temporal dimension (as inferred by
archaeologists) 3. Metric or vector spaces
1. used to support retrieval operations, and to calculate distance (and similarity)
2. used to browse / constrain searches spatially
4. 3D models of the past, used to reconstruct and visualize archaeological ruins
5. 2D interfaces for human-computer interaction
20
ETANA Structures
1. Site Organization1. Region, site, partition, sub-partition, locus,
…
2. Temporal orderings (ages, periods)
3. Taxonomies1. for bones, seeds, building materials, …
4. Stratigraphic relationships1. above, beneath, coexistent
21
ETANA Streams
1. successive photos and drawings of excavation sites, loci, unearthed artifacts
2. audio and video recordings of excavation activities and discussions
3. textual reports
4. 3D models used to reconstruct and visualize archaeological ruins.
22
5S and DL formal definitions and compositions (April 2004 TOIS)
5S
structures (d.10)streams (d.9) spaces (d.18) scenarios (d.21) societies (d. 24)
structural metadataspecification(d.25)
descriptive metadataspecification(d.26)
repository(d. 33)
collection (d. 31)
(d.34)indexingservice
structured stream (d.29)
digitalobject (d.30)
metadata catalog (d.32)
browsingservice
(d.37)
searchingservice (d.35)
digital library(minimal) (d. 38)
services (d.22)
sequence (d. 3)
graph (d. 6)function (d. 2)
measurable(d.12), measure(d.13), probability (d.14), vector (d.15), topological (d.16) spaces
event (d.10)state (d. 18)
hypertext(d.36)
sequence (d. 3)
transmission(d.23)
relation (d. 1) language (d.5)
grammar (d. 7)
tuple (d. 4)*
23
Digital Object
RepositoryCollection Minimal DL
Metadata Catalog
Descriptive Metadata
Specification
A Minimal DL in the 5S Framework
Structural Metadata
Specification
Streams Structures Spaces Scenarios Societies
indexing
browsing searching
services
hypertext
Structured Stream
24
Streams
text
audio
image
video digitalobject
Repository
CollectionCatalog
describes
stores
is_version_of/ cites/links_to
Index
Service
Scenario
event
extends
reuses
ServiceManager
Actor
operationexecutes
participates_in
recipient
runs
Scenarios
Societies
inherits_from/includes
association
uses
Topological
ProbabilisticMetric
Measurable
Measure
describes
employsproduces
employsproduces
employs
produces
Structures
Spaces
Vector
contains
metadata specifications
is_a is_a
precedes
happens_before
is_a
redefinesinvokes
contains
contains
25
Outline
• WWW and Digital Libraries (DLs)• Minimal DLs• Powerful DLs
– Learning Object Repository Requirements– NSDL, OCKHAM, User Interfaces, Services
• Why– DL education, Practical systems– General requirements, Domain specific requirements– Personal DLs, Global DLs
• How– Components, Metamodels, Models– Graphical aids, Generators– Integration, Quality
26
Powerful Digital Libraries
• Features, functions (LOR report)
• Services
• User interfaces
• Services to support interactions– tasks, activities
27
Learning Object Repository S/W
• WCET EduTools LOR Comparative Research Report of Nov. 2004, at http://– www.edtechpost.ca/mt/archive/000597.html– For 4 project partners, giving requirements:– University of Georgia System– Utah Education Network– Virginia Community College System– Virginia Tech
• Review criteria• 10 categories• 44 features• Reviews of six products
28
LOR S/W – Feature Categories
1. Discovery tools
2. Aggregation tools
3. Community & evaluation
4. Meta-tagging
5. Content management
6. Digital rights management
7. Presentation and consortia issues
8. Integration and interoperability
9. Technical considerations
10.Pricing/licensing/other
29
LOR S/W – Feature Category 1
• Discovery tools
–Searching
• Indexable?
–Browsing• Schema support
• Vocabulary and thesaurus support
–Syndication and notification
30
LOR S/W – Feature Category 2
• Aggregation tools–Personal collections
• Bookmarked objects• Order? Organization? Shareable?
–Content aggregator and packaging tools• Objects made of elements• Create/package/use aggregate objects
31
LOR S/W – Feature Category 3
• Community & evaluation
–Evaluation system• annotations
–Context usage illustrators• reuse
–Wish lists• requests
32
LOR S/W – Feature Category 4
• Meta-tagging–Metadata markup tool (form, batch)–Schema support (IEEE LOM, DC)–Indexing workflow support (route
records through people and processes)
–Import and export tools–Unique identifier support
33
LOR S/W – Feature Category 5
• Content management
–Authoring and publishing workflow support
–Version control & archiving functions
–Authoring tools (direct, or through packages like Word)
34
LOR S/W – Feature Category 6
• Digital rights management
–DRM (enforce rights, create rights policies, associate specific content with specific rights policies)
–Payment and fulfillment (permit users to pay for access, or to get non-digital copies)
35
LOR S/W – Feature Category 7
• Presentation and consortia issues
–Accessibility (W3C guidelines: WCAG)
–Multiple output formats (HTML, WAP)
–Customized look and feel
–Internationalization
–Multiple collections
–Media transformation and display (on upload or on demand, thumbnails too)
36
LOR S/W – Feature Category 8
• Integration and interoperability
–Federation and harvesting
–Course management integration (CMS <-> repository links, searches)
–API and Web Service support (separate repository backend, or one integrated with authoring/aggregation)
37
Harvesting: Black Box Perspective
OA 1
OA 2
OA 4
OA 3
OA 5OA 6
OA 7
38
LOR S/W – Feature Category 9
• Technical considerations– Authentication– Authorization &
Personalization– Usage reporting– Unix/Linux/Apple
server support– Windows server
support– Application server
requirements
– Database requirements – Scalability– Tech model (P2P?)– Support (help?)– Staffing requirements– Client browser
requirements (version?)
39
LOR S/W – Feature Category 10
• Pricing/licensing/other
–Company profile
–Number of installations
–Costs / licensing model
40
41
42
NSDL: Collection Services
• Discovery of content• Classification and cataloguing• Acquisition and/or linking; referencing• Disciplinary-based themes define a natural body of content,
but other possibilities are also encouraged • Access to massive real-time or archived datasets• Software tool suites for analysis, modeling, simulation, or
visualization• Reviewed commentary on learning materials and pedagogy
43
NSDL: User/Other Services• Help services, frequently asked questions, etc.
• Synchronous/asynchronous collaborative learning environments using shared resources
• Mechanisms for building personal annotated digital information spaces
• Reliability testing for applets or other digital learning objects
• Audio, image, and video search capability
• Metadata system translation
• Community feedback mechanisms
44
OCKHAM Proposed Services
• NSDL project where University Libraries constitute a P2P network, supporting:
• Alerting• Browsing• Cataloging• Conversion• OAI – Z39.50• Pathfinding• Registry
45
OCLC SRU Interface
46
47
ETD Union Search Mirror Site in China (CALIS)(http://ndltd.calis.edu.cn – popular site!)
48
Browsing Collaborating Customizing Filtering Providing access Recommending Requesting Searching Visualizing
Annotating Classifying Clustering Evaluating Extracting Indexing
Measuring Publicizing
Rating Reviewing (peer)
Surveying Translating
(language)
Conserving Converting
Copying/Replicating Emulating Renewing
Translating (format)
Acquiring Cataloging
Crawling (focused) Describing Digitizing
Federating Harvesting Purchasing Submitting
Preservational Creational
Add Value
Repository-Building
Information Satisfaction
Services
Infrastructure Services
49
Ontology: Applications
50
Ontology: Applications
• Expand definition of minimal DL by characterizing– typical DL services – in the context of “employs” and “produces”
relationships
• Use characterization to:– Reason about how DL services can be built
from other DL components– As well as be composed with other services
through extension or reuse
51
Composition of key fundamental / infrastructure services
Ic
Acquiring
universalcollection
C
DMCIndexing
DescribingCataloguing
Linking
Hypertext
Submitting
AuthoringDigitizing
doi
mskjp
p
e
e
describes
p
p
p
e
e
p
e
p
52
SearchingBrowsing
queryanchor
Society
actor
Collection, {digital object}
Recommending Filtering Binding Visualizing Expanding query
user model query/category {digital object}
{digital object} {digital object}
binder
InformationSatisfaction Services
space query’
fundamental
Rating Training
Infrastructure
Services (Add_Value)
composite
Requesting
handle
p pp
e e e{(digital object, actor, rate) }
p
e
e
p p p p p
e e
classifier
e ee e
e
p
e
Indexing
Index
p
e
transformer
e
53
Outline
• WWW and Digital Libraries (DLs)• Minimal DLs• Powerful DLs
– Learning Object Repository Requirements– NSDL, OCKHAM, User Interfaces, Services
• Why– DL education, Practical systems– General requirements, Domain specific requirements– Personal DLs, Global DLs
• How– Components, Metamodels, Models– Graphical aids, Generators– Integration, Quality
54
Why?
• Support DL education
• Practical systems
• General requirements: US-Korea diagram
• Domain specific requirements: DLEs
• Example 1: Personal DLs
• Example 2: Global DLs
55
DL Curriculum FrameworkSemester 1:
DL collections:development/creation
Semester 2:DL services and
sustainability
CO
UR
SE
ST
RU
CT
UR
E
DigitizationStorage
Interchange
Digital objectsCompositesPackages
MetadataCataloging
Author submission
NamingRepositories
Archives
Spaces(conceptual,geographic,2/3D, VR)
Architectures(agents, buses,
wrappers/mediators)Interoperability
Services(searching,
linking, browsing, etc.)
Intellectual property rights mgmt.
PrivacyProtection (watermarking)
Archiving and preservation
Integrity
Architectures(agents, buses,
wrappers/mediators)Interoperability
CO
RE
DL
TO
PIC
S
DocumentsE-publishing
Markup
Info. NeedsRelevanceEvaluation
Effectiveness
ThesauriOntologies
ClassificationCategorization
Bibliographic information
BibliometricsCitations
RoutingFiltering
Community filtering
Search & search strategyInfo seeking behavior
User modelingFeedback
Info summarizationVisualization
Multimedia streams/structures
Capture/representationCompression/coding
Content-based analysis
Multimedia indexing
Multimediapresentation,
rendering
RE
LA
TE
DT
OP
ICS
56
Foundations for Information Systems: Digital Libraries and the 5S Framework
• Ch. 1. Introduction (Motivation, Synopsis)
• Part 1 – The “Ss”
• Part 2 – Higher DL Constructs
• Part 3 – Advanced Topics
• Appendix
57
Book Parts and Chapters - 1
• Ch. 1. Introduction (Motivation, Synopsis)
• Part 1 – The “Ss”– Ch. 2: Streams
– Ch. 3: Structures
– Ch. 4: Spaces
– Ch. 5: Scenarios
– Ch. 6: Societies
58
Book Parts and Chapters - 2
• Part 2 – Higher DL Constructs– Ch. 7: Collections
– Ch. 8: Catalogs
– Ch. 9: Repositories and Archives
– Ch. 10: Services
– Ch. 11: Systems
– Ch. 12: Case Studies
59
Book Parts and Chapters - 3
• Part 3 – Advanced Topics– Ch. 13: Quality– Ch. 14: Integration– Ch. 15: How to build a digital library– Ch. 16: Research Challenges, Future Perspectives
• Appendix– A: Mathematical preliminaries– B: Formal Definitions: Ss – C: Formal Definitions: DL terms, Minimal DL– D: Formal Definitions: Archeological DL– E: Glossary of terms, mappings
60
Practical Systems
• Commercial: IBM, VTLS, …
• Open Source– Greenstone– CWIS (for NSDL)– Institutional repositories
• DSpace• Fedora
61
Institutional Repositories
• “A university-based institutional repository is a set of services that a university offers to the members of its community for the management and dissemination of digital materials created by the institution and its community members. It is most essentially an organizational commitment to the stewardship of these digital materials, including long-term preservation where appropriate, as well as organization and access or distribution.”
• Lynch, C.A. In ARL Bimonthly Report 226, pp. 1-7, Feb. 2003, www.arl.org/newsltr/226/ir.html
62
Application
Domain
Related Institutions
Examples Technical Challenges Benefit / Impact
PublishingPublishers, Eprint
archivesOAI Quality control, openness Aggregation, organization
Education
Schools, colleges, universities
NSDL, NCSTRL Knowledge management,
reuseabilityAccess to data
Art, Culture
Museum AMICO, PRDLA Digitization, describing,
catalogingGlobal understanding
ScienceGovernment,
Academia, Commerce
NVO, PDG, SwissProt, UK
eScience,European Union Commission
Data modelsreproducibility, faster reuse, faster
advance
(e) Governme
nt
Government Agencies (all levels)
Census Intellectual property rights,
privacy, multi-nationalAccountability, homeland security
(e) Commerce
, (e) Industry
Legal institutionsCourt cases,
patents Developing standards
Standardization, economic development
History, Heritage
Foundations American Memory Content, context,
interpretation
Long term view, perspective, documentation, recording, facilitating, interpretation,
understanding
Cross-cutting
Library, Archive
Web, personal collections
Multi-language, preservation, scalability, interoperability, dynamic
behavior, workflow, sustainability, ontologies,
distributed data, infrastructure
Reduced cost, increased access, pereservation, democratization, leveling, peace, competitiveness
Reagan Moore
Ed Fox
June
2002
for
NSF
63
Digital Libraries in Education
• Analytical Survey, ed. Leonid Kalinichenko• © 2003, www.iite-unesco.org, [email protected]• Transforming the Way to Learn• DLs of Educational Resources & Services• Integrated/Virtual Learning Environment• Educational Metadata• Current DLEs: US (NSDL, DLESE, CITIDEL,
NDLTD), Europe (Scholnet, Cyclades), UK (Distributed National Electronic Resource)
64
Digital Libraries in Education - 2
• Advanced Frameworks & Methodologies– Instructional course development with learning
module repositories, Learning Object reuse– Community organization around DLEs– Other content for science and research– Cyberinfrastructure, data grids– Curriculum-based interfaces (see Krowne et al.)– Concept-based organization of learning materials
and courses (CMs, ontologies)
65
DLEs: Future Vision (p. 6)
• Global learning environment of the future:
• Student-centered
• Interactive and dynamic
• Enabling group work on real world problems
• Enabling students to determine their own learning routes (styles, personalization)
• Supporting lifelong learning
66
DLEs: Objectives (p. 11)
• Long-range: lifelong/distance/anytime-anywhere• Intermediate goals
– Support for students, teachers, parents– Enhanced student performance– More students excited about science– More Internet-based science educ. resources
• with increased quality and comprehensiveness,• easy to discover and retrieve,• preserved and universally available
67
DLEs: Guiding Principles (p. 12)
• Driven by educational and science needs• Facilitating educational innovation• Stable, reliable, permanent• Accessible to all• Leverage prior research: DL, courseware, …• Adaptable to new technologies• Supporting decentralized services• Resource integration thru tools/organization
68
Personal DLs
• Old Dominion University’s Kepler
• Microsoft and Google support to move search and other services to the PC to include local data
• Microsoft MyLifeBits project with SenseCam
69
70
ETANA-DL Global ArchitectureDigBase and DigKit
Lahav
Nimrin
Umayri
Hisban
Megiddo
Jalul
New Sites
DATABASE
WRAPPERS
ETANA-DLUNION
CATALOG
SearchUSER
INTERFACE
Browse
Recommend
Note
Personalize
Review
Visualizations
ArchaeologySpecific
Work in progress
…
71
Megiddo Opening Screen
72
Locus Screen: Pictures
View all
73
Area Screen
74Repository1
DL1
Repository2
Union Catalog
Union Repository
Catalog1 Catalog2
Searching
Union DL DL2
archaeologists
Society
General Public
Society
ArchaeologistsGeneral Public
Union Society
ServiceBrowsingService
Union Service
Harvesting, Mapping,Searching, Browsing,
Clustering, Visualization
Global DL: Architecture of a Union DL
75
Outline
• WWW and Digital Libraries (DLs)• Minimal DLs• Powerful DLs
– Learning Object Repository Requirements– NSDL, OCKHAM, User Interfaces, Services
• Why– DL education, Practical systems– General requirements, Domain specific requirements– Personal DLs, Global DLs
• How– Components, Metamodels, Models– Graphical aids, Generators– Integration, Quality
76
How?
• Components
• Metamodels
• Models
• Graphical model building aids
• DL generators
• Integration
• Quality
77
1010100101010010101010010101010101010101
Program
1010100101010010101010010101010101010101
Document
1010100101010010101010010101010101010101
Document
1010100101010010101010010101010101010101
Document
1010100101010010101010010101010101010101
Program
1010100101010010101010010101010101010101
Program
1010100101010010101010010101010101010101
Image
1010100101010010101010010101010101010101
Image
1010100101010010101010010101010101010101
Image
1010100101010010101010010101010101010101
Video
1010100101010010101010010101010101010101
Video
1010100101010010101010010101010101010101
Video
componentized digital library
?
?
?
?
???
?
?
?
?
??
? ?
?
?
?
?
?
?
?
78
DiscoveryCurrent
AwarenessPreservation
Service Providers
Data Providers
Meta
data
harv
estin
g
The World According to OAI:Open Archives Initiative –
Protocol for Metadata Harvesting
79
80
81
Metamodels
• Completed– Minimal– Archaeological
• Planned– Practical– System oriented
• Doug Gorton’s thesis, so people can build models for their systems, and have them generated to work with a particular DL system
82
Digital Object
RepositoryCollection Minimal DL
Metadata Catalog
Descriptive Metadata
Specification
A Minimal DL in the 5S Framework
Structural Metadata
Specification
Streams Structures Spaces Scenarios Societies
indexing
browsing searching
services
hypertext
Structured Stream
83
5SL – The Minimal DL Metamodel
Index
Actor
Search Manager
Index Manager
Document
Collection Catalog
Metadata
Service
Manager
Interface Manager
Community
Event
Scenario
Service
Browsing Manager
User
Interface
Scenarios (Meta-) Model
Spatial
(Meta-) Model
Meta-Models
Meta-ModelsPrimitives
Stream
(Meta-)ModelStructural (Meta-) Model
Text AudioVideo Image
Societal (Meta-) Model
Retrieval
Model
uses
runs
receiver
Repository Manager
84
Streams Structures Spaces Scenarios Societies
indexing
browsing searching
services
hypertext
Structured Stream
Descriptive Metadata
specification
SpaTemOrg
StraDia
Arch Descriptive Metadata specification
ArchDO
ArchObj
ArchColl
Arch Metadata catalog
ArchDColl ArchDR Minimal ArchDL
A Minimal ArchDL in the 5S Framework
85
Requirements Analysis Design Implementation Test
5S 5SLOO ClassesWorkflow Components
DLEvaluation
5SGraph 5SLGenFormalTheory/Metamodel
DL XMLLog
86
Overview of 5SGraph
Workspace
(instance model)
Structured
toolbox
(metamodel)
87
Tools/Applications
5S MetaModel
5SGraphDL
Expert
DL Designer
5SL DL
Model
5SLGen
Practitioner
Researcher
TailoredDL
Teacher
componentpool
ODLSearch,ODLBrowse,ODLRate,ODLReview,
…….
Logging ModuleXMLLog
88
5SGen – Version 2: ODL, Services, Scenarios
5SL-SocietiesModel (1)
XPATH/JDOMTransform (2)
XMI:ClassModel (3)
Xmi2Java (4)
JavaClasses
Model (5)
superclass
DeterministicFSM (10)
SMC (11)
JavaFinite
State MachineClass
Controller (12)
5SL-ScenarioModel (6)
XPath/JDOMTransform (7)
StateChartModel (8)
Scenario Synthesis (9)
ODLSearch
Java
Wrapping
import
ComponentPool
ODLBrowse
Java
Wrapping
import
.
.
.
JSPUser
InterfaceView (13)
Generated DL Services
DLDesigner
DLDesigner
binds
5SLGen
5SL-SocietiesModel (1)
XPATH/JDOMTransform (2)
XMI:ClassModel (3)
Xmi2Java (4)
JavaClasses
Model (5)
superclass
DeterministicFSM (10)
SMC (11)
JavaFinite
State MachineClass
Controller (12)
5SL-ScenarioModel (6)
XPath/JDOMTransform (7)
StateChartModel (8)
Scenario Synthesis (9)
ODLSearch
Java
Wrapping
import
ComponentPool
ODLBrowse
Java
Wrapping
import
.
.
.
ODLSearch
Java
Wrapping
import
ComponentPool
ODLBrowse
Java
Wrapping
import
.
.
.
JSPUser
InterfaceView (13)
Generated DL Services
DLDesigner
DLDesigner
binds
5SLGen
89
XML-based DL Log Standard• Log analysis
– is a source of information on:• How patrons really use DL services• How systems behave while supporting user information
seeking activities
• Used to:– Evaluate and enhance services– Guide allocation of resources
• Common practice in the web setting– Supported by web servers, proxy caches
• DL Logging can be more detailed
90
The XML Log Format
Log
SessionId MachineInfo StatementTransaction Timestamp
SessionInfo RegisterInfo StatementEvent Timestamp
Action
Search Browse StoreSysInfoUpdate
SearchBy QueryString CatalogCollection PresentationInfo
StatusInfo
Timeout
91
DL Integration
• What is “DL Integration”– Hide distribution– Hide heterogeneity– Enable autonomy of individual component
• Why Integration– island-DLs– inability to seamlessly and transparently
access knowledge across DLs
Utilize various autonomous DLs in concert
92
Formal Definition of DL Integration
• DLi=(Ri, DMi, Servi, Soci), 1 i n
– Ri is a network accessible repository
– DMi is a set of metadata catalogs for all collections
– Servi is a set of services
– Soci is a society
• UnionRep• UnionCat• UnionServices• UnionSociety
93
Formal Definition of DL Integration (Cont.)
• DL integration problem definition:
Given n individual libraries, integrate the n DLs to create a UnionDL.
94
ETANA-DL Approach• Applying and extending Digital Library (DL)
techniques to solve key problems: making primary data available, data preservation, and interoperability
• Modeling archaeological information systems using 5S to better understand the domain and design the system and the supporting services
• Rapidly prototyping DLs that handle heterogeneous archaeological data using componentized frameworks:– eliciting requirements– refining metamodel and union schema– modeling sites– mapping– harvesting– providing useful services
95
Example of Union Service: CitiViz
96
Union Catalog Integration
VN MetadataFormat
Global MetadataFormat
VNCatalog
HDCatalog
Union Catalog
MappingTool
Wrapper
MappingTool
Wrapper
HD MetadataFormat
Virtual Nimrin(VN)
Halif DigMaster(HD)
Union ArchDL
97
local schema global schema
98
Describing Quality inDigital Libraries
• What’s a “good” digital Library?– Central Concept: Quality!– Hypotheses of this work:
• Formal theory can help to define “what’s a good digital library” by:
• New formalizations of quality indicators for DLs within our 5S framework
• Contextualizing these measures within the Information Life Cycle
99
Quality DimensionsDL Concept Dimensions of Quality Digital object Accessibility
Pertinence Preservability Relevance Similarity Significance Timeliness
Metadata specification Accuracy Completeness Conformance
Collection Completeness Impact Factor
Catalog Completeness Consistency
Repository Completeness Consistency
Services Composability Efficiency Effectiveness Extensibility Reusability Reliability
100
AuthoringModifying
OrganizingIndexing
Storing
Archiving
NetworkingAccessing
Filtering
Creation
DistributionUtilization
Significance
Similarity
Pertinence
AccuracyCompletenessConformance
Seeking
SearchingBrowsingRecommending
Relevance
Timeliness
Accessibility
Accessibility
Inactive
Active
Discard
RetentionMining
Semi-Active
Preservability
Timeliness
Preservability
Describing
Quality and the Information Life Cycle
101
Summary
• WWW and Digital Libraries (DLs)• Minimal DLs• Powerful DLs
– Learning Object Repository Requirements– NSDL, OCKHAM, User Interfaces, Services
• Why– DL education, Practical systems– General requirements, Domain specific requirements– Personal DLs, Global DLs
• How– Components, Metamodels, Models– Graphical aids, Generators– Integration, Quality
102
Selected Links - http://fox.cs.vt.edu• CITIDEL (computing education resources)
– www.citidel.org• NCSTRL (computing technical reports)
– www.ncstrl.org• NDLTD (electronic theses and dissertations
worldwide)– www.ndltd.org and etdguide.org
• NSDL (National Science Digital Library)– www.nsdl.org
• OAI (Open Archives Initiative)– www.openarchives.org
• Virginia Tech Digital Library Research Laboratory (DLRL, www.dlib.vt.edu)– 5S, AmericanSouth.Org, CSTC, DL-in-a-box, ENVISION,
ETANA, MARIAN, NDLTD, NSDL, OAD, ODL, …)
103
Questions?Discussion?
Thank You!