week 11 summing up. tonight’s agenda discuss last week’s work –thoughts about the quality...
TRANSCRIPT
Week 11Summing up
Tonight’s agenda
• Discuss last week’s work– Thoughts about the Quality paper now that you
have read it– Thoughts about the method of engaging in a
discussion
• Looking ahead to the rest of the semester– Next week– Presentions
• Tie together the pieces we have seen this semester
The paper on Quality
• Strengths?
• Weaknesses?
• If you were a reviewer, would you want to publish it?
• If you are designing a digital library, would you refer to it?
• Any suggestions to that author?
Last week’s work
• A few people submitted their first comments much too late for others to respond to.
• There were some good interactions, with comments about what was said in a submission.
• Most people were very agreeable.
Looking ahead
• Next week– Option one
• Go to the talk by Joseph Lucia• Write 2 - 5 pages discussing both this talk and
his talk in this class
– Option two• I will provide something to read and either
report on or discuss• Preference?
Presentations• No one has requested that we have class on
December 19 (week 16)• Project presentations begin November 28,
continue for three weeks.• Presentation
– Power point or other presentation of goals, decisions, approach
– Demonstration of the working features of your project– Make sure each team member speaks– Submit a report which has the same information as the
presentation, but in report format
Presentations - timing
• Send me a URL of your project by class time on November 28
• Final report due to me no later than 5pm, Friday, December 15. Earlier will be appreciated, but do a good job.– No disadvantage to those who present early
• E-mail me if you have a preferred date to present– You must be present for all– I will honor preferences as much as possible, make
random choices where necessary
The teams
1. DL for University selection– Ganesh/Phaneendra
2. DL for Vatican video archive– Gayatri/Smita/Xiaoyan– Moorthy Jayanthi
3. Domino DL– Naomi Chin
4. New functionalities in DSPACE– Mode/Bukka/Kerray
5. Modify subject search in Dspace1. Patipati/Ravi
6. Manakin Project– Gopikrishna/Kolahalam
7. HTPC front end– Heller
8. Irish Music DL– Nordengren/Harris/Malone
9. Art DL– McCaffrey
10.Music DL UI using Manakin– Tate
11.Music Dspace DL– Huffner
12.Car DL– Kashevarov
Timing
• 13 projects on 12 topics• 3 class meetings• 4 - 5 projects per class• 30 minutes per presentation
– That allows time for transistion from one presentation to another
– Be well organized• Load your presentation on my computer at beginning of
the session or bring your own computer if you prefer
Summing up
Let’s review– What is a Digital Library?
• How does a DL differ from any web page that provides information? A wiki, for instance? A furl collection?
– The Vannevar Bush vision• Who was Vannevar Bush?
– What is his position related to DLs?
– What is Gordon Bell’s MyLifeBits?
a managed collection of information, with associated services, where the information is stored in digital formats and accessible over a network. - Wm Arms 1999
The 5S model• What are the S-words?
– Streams• The flow of information in various formats
– Structures• Organizational aspects of the DL
– Spaces• Organizational aspects of the DL
– Scenarios• Services and behaviors
– Societies• Communities and relationships among them
• Which have been most important to you in your DL work?
5S summaryModel Primitives Formalisms Objectives
Stream Text; video, audio, software program
Sequences, types Describes properties of the DL content, encoding and textual material or particular forms of multimedia data.
Structure Collection, catalog; hypertext; document; metadata; organizational tools
Graphs; nodes; links; labels; hierarchies
Specifies organizational aspects of the DL content
Space User Interface; index; retrieval model
Sets; operations; vector space; measure space; probability space
Defines logical and presentational views of several DL components
Scenarios Service, event; condition; action
Sequence diagrams; collaboration diagrams
Details the behavior of DL services
Societies Community; managers; actors; classes; relationships; attributes; operators
Object-oriented modeling constructs; design patterns
Defines managers responsible for running DL services; actors that use those services, and relationships among them
Source: http://www.dlib.vt.edu/projects/5S-Model/
An example application of 5S - Etana: A DL for an archeological site
Text Video Audio
*Site *Sub-partition *Container *Artifact*LocusRegion
Taxonomies
Temporal Artifact-specific
Space model
Structuremodel
Metadata
Drawing Photo 3DStreammodel
*Partition
Society model
Archaeologist
General public
Geographic space
Service Manager
Information Satisfaction
Value added
Repository buildingScenario
model Services
Domain specific
User interface Metric space
Spatial
Source: E. A. Fox http://feathers.dlib.vt.edu/
Content
• A DL without content is not much use• Content requirements:
– Store (organize, describe)– Find– Deliver
• Describing content– Metadata
• Standards, including Dublin Core
– Descriptive language: XML
<?xml version="1.0" encoding="ISO-8859-1"?><!DOCTYPE recipe SYSTEM “recipe.dtd”><recipe><recipe-title> Meringue cookies</recipe-title><ingredient-list> <ingredient> <ingredient-amount>3 </ingredient-amount> <ingredient-name> egg whites</ingredient-name> </ingredient> <ingredient> <ingredient-amount> 1 cup</ingredient-amount> <ingredient-name> sugar</ingredient-name> </ingredient> <ingredient> <ingredient-amount>1 teaspoon </ingredient-amount> <ingredient-name> vanilla</ingredient-name> </ingredient> <ingredient> <ingredient-amount>2 cups </ingredient-amount> <ingredient-name>mini chocolate chips </ingredient-name> </ingredient></ingredient-list><directions>Beat the egg whites until stiff. Stir in sugar, then vanilla. Gently fold in chocolate chips. Place in warm oven at 200 degrees for an hour. Alternatively, place in an oven at 350 degrees. Turn oven off and leave overnight. </directions> </recipe>
An example XML file
Metadata for DL content
• Free text or controlled vocabularies– Ease of processing– Restricted descriptive capabilities– When is each appropriate?
Dublin Core elementssee: http://dublincore.org/documents/dces/
• Title• Creator • Subject - C• Description• Publisher• Contributor• Date • Type - C
• Format - C• Identifier• Source• Language• Relation• Coverage - C
• Rights Rights Management information
Space, time, jurisdiction.
C = controlled vocabulary recommended.
Reference to related
resource
Standards RFC 3066,
ISO639
Unambiguous ID
Ex: collection, dataset, event, image
YYYY-MM-DD, ex.
Entity primarily responsible for making content of the resource
Entity making the resource available
Contributor to content of the resource
What is needed to display or operate the resource.
Content - example
• The Google Books projects and offshoots
• Questions of Intellectual Property rights and fair use
Stanford and the Google Books project
Summary of Google Books
• Indexing, not publishing project
• Increase access to library collection– Keyword searching
• Global marketing for publishers
• Services to readers
• Preserves library contents
• Opportunities for new research
Access Control & Rights Management
• Issues are both legal and technical
• Legal: What are the requirements for controlling access?
• Technical: How do we enforce the legal requirements?
Date of work Protected from Term
Created 1-1-78 or after
When work is fixed in tangible medium of expression
Life + 70 years1(or if work of corporate authorship, the shorter of 95 years from publication, or 120 years from creation
Published before 1923
In public domain None
Published 1923 - 63
When published with notice3 28 years + could be renewed for 47 years, now extended by 20 years for a total renewal of 67 years. If not so renewed, now in public domain
Published from 1964 - 77
When published with notice 28 years for first term; now automatic extension of 67 years for second term
Created before 1-1-78 but not published
1-1-78, the effective date of the 1976 Act which eliminated common law copyright
Life + 70 years or 12-31-2002, whichever is greater
Created before
1-1-78 but published between then and 12-31-2002
1-1-78, the effective date of the 1976 Act which eliminated common law copyright
Life + 70 years or 12-31-2047 whichever is greater
Chart created by Lolly Gasaway. Updates at http://www.unc.edu/~unclng/public-d.htm
Technical issues
• Link the resource to the copyright statements• Maintain that link when the resource is copied
or used• Approaches:
– Steganography– Encryption– Digital Wrappers– Digital Watermarks
Encryption• Protecting controlled content
– Passive Listening• More relevant in the DL case
– Active interference
Original message
EncodingMethod Ciphertext
DecodingMethod
Received message
Eavesdropping Masquerading
Intruder
(Plain text)(Plain text)
Public key encryption
• Eliminates the need to deliver a key• Two keys: one for encoding, one for decoding• Known algorithm
– security based on security of the decoding key• Essential element:
– knowing the encoding key will not reveal the decoding key
• Bottom line:– key used for decoding is dependent upon the key
used for encoding, but the relationship cannot be determined in any feasible computation or observation of transmitted data
Digital Signatures
• Similar techniques to encryption, but do not need to worry about the content being read.
• Issue is authentication, not hiding– Protect an individual against unauthorized
access to resources or misrepresentation of the individual’s intentions
– Protect the receiver against repudiation of a commitment by the originator
Public key encryption with implied signature
• Add the requirement that E(D(M)) = M
• Sender A has encoding key EA, decoding key DA
• Intended receiver has encoding (public) key EB.
• Sender A produces EB(DA(M))
• Receiver B calculates EA(DB(EB(DA(M))))– Result is M, but also establishes that only A could
have encoded M
Metadata harvesting
• Z39.50– Predates the Web– Protocol to communicate with collection
holders in order to provide additional services
• Open Archives Initiative– More recent– Service provider gathers information from
collection holders
OAI and Z39.50Z39.50 OAI
Content (Objects) Distributed Distributed
World View Bibliographic Bibliographic
Object Presentation Data provider Data provider
Searching is Distributed Centralized
Search done by Data provider Service provider
Metadata searched is Up to date Stale
Semantic Mapping When searching Metadata delivery
Source: oai.grainger.uiuc.edu/FinalReport/JCDL_2003_OAI_Intro.ppt
OAI componentsService Providers
and
Data Providers
Requests and Responses
http://www.oaforum.org/tutorial/english/page3.htm#section3
Interoperability
• The goal: communication, without human intervention, between information sources– Books that “talk to each other”
• Live links for references
• Knowledge of how to find relevant resources when needed
• Ability to query other information locations
User Interface and Usability
• Evaulation for any purpose has two major components– Formative
• During development, spot check how things are progressing• Identify problems that may prevent goals from being achieved• Make adjustments to avoid the problems and get the project back on
track
– Summative• After development, see how well it all came out• Lessons learned may be applicable to future projects, but are too late
to affect the current one.• Needed for reporting back to project sponsors on success of the
work.
Inspection categories• User classes
– Know your user– Example from the cited study:
• Scientific researchers in computer science• Administrators
– Do not use the regular interface, so not evaluated
• User tasks– Search for technical reports on a set of criteria– Browse the collection– Register– Submit– Harvest
Hartson 04
Categories of Problems
• General to most applications, GUIs– Wording– Consistency– Graphic layout and
organization– User’s model of the
system
• Digital Library functionality– Browsing– Filtering– Searching– Document
submission functions
Hartson 04
Special issues of Video DLs
• Cataloging– Semi-automatic tool– Manual tool– Threshold adjustable before
automatic segmentation
• Textual Query– Natural language (or
keyword)– Category or keyword list
browsing– Audio information for
indexing, browsing
• Intelligent frame selection
• Video browsing– Text description– Transcript– Single keyframe– Storyboard– Option re granularity of
keyframe set– Interactive hierarchical keyframe
browser– Keyframe slide show– Video summary playing– Playback– Transcript + playback synch– Keyframe + playback synch– Text search + playback and/or
keyframe synch
Lee 02
Summary of UI and Usability
• Much of digital library user interface design and usability analysis is the same as that of other web services– Keep the user central in the design phase– Be careful about word use– Organize the graphics and layout carefully– Think about the user experience
• Some special considerations about DL usability have to do with DL services– Search, filter, browse– Connections with other collections to which this is a portal
Remembering our goals• DL theory
– Describe Web based information sources– Contrast a digital library with other types of online resource collections.
• Enumerate the characteristics of a digital library that distinguishes it from a database or a web site.
– Characterize Digital Libraries using the 5S model• Streams, Structures, Spaces, Scenarios, Societies -- the 5 S's that
encompass the characteristics of digital libraries. What do they all mean and how can they be used to build and maintain a working digital library.
– Recognize and Protect Intellectual Property• Balancing the rights of the author with the desire to share information
requires understanding of intellectual property rights. What can we put into our digital library? To whom can we make it available?
Goals -2
• Organize information in a digital library– The role of metadata and other ways to describe digital library content
• Digital Library services – What besides content makes a library? How are these services
expressed in a digital environment?
• Develop and organize a collection– A library is a collection of resources. There are services to help find and
use the resources, but the resources themselves are the reason the library exists. What are the characteristics of a digital library collection? How are the components organized to make a coherent whole?
• Connect distributed collections – A digital library may connect users to items that are physically stored in
another location. What services are necessary to make this possible?
Goals -3 - Practice
• Install and configure a digital library– You will install and configure a digital
library so that it can hold information and present it to users, along with appropriate digital library services.
• Digital Library project – Create something new in digital libraries.
Accomplishments
• I think we have made great progress
• Your projects are the major element in your learning
• I’m looking forward to seeing what you have accomplished from this material.