fox@vt fox.cs.vt dept. of computer science, virginia tech
DESCRIPTION
1 st Canadian ETD & Open Repositories Workshop May 10-11, 2010 Carleton University, Ottawa “Opening and Expanding Digital Library Services” by Edward A. Fox. [email protected] http://fox.cs.vt.edu Dept. of Computer Science, Virginia Tech Blacksburg, VA 24061 USA. Acknowledgements. - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/1.jpg)
1
1st Canadian ETD &Open Repositories WorkshopMay 10-11, 2010 Carleton University, Ottawa
“Opening and ExpandingDigital Library Services”
by Edward A. Fox
• [email protected] http://fox.cs.vt.edu• Dept. of Computer Science, Virginia Tech• Blacksburg, VA 24061 USA
![Page 2: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/2.jpg)
Acknowledgements• Mentors (Licklider, Kessler, Salton)• Virginia Tech, CS, Digital Library Research Laboratory• NSF and other sponsors• Students, colleagues, co-investigators• Monika Akbar, Yinlin Chen, Spencer Lee, Venkat
Srinivasan, Seungwon Yang, … • Boots Cassel, Gary Marchionini, Jeffrey Pomerantz,
Barbara Wildemuth, Andrea Kavanaugh, Naren Ramakrishnan, Steve Sheetz, Don Shoemaker, …
2
![Page 3: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/3.jpg)
Part 1 – Selected DL Projects• Digital Library Curricular Resources
– NSF IIS-0535057 & 0535060• CTRnet (Crisis, Tragedy & Recovery Net)
– NSF IIS-0916733• Ensemble (Computer Science Education)
– NSF DUE-0840719• Digital Preserve
– NSF IIS-0910183 & 0910465– http://slurl.com/secondlife/Digital%20Preserve/
140/126/29
3
![Page 4: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/4.jpg)
DL Curric. Project - 1• NSF awards to VT and UNC-CH• CS and LIS
• Project server: http://curric.dlib.vt.edu/
• Wikiversity: http://en.wikiversity.org/wiki/Curriculum_on_Digital_Libraries
4
![Page 5: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/5.jpg)
DL Curric. Project - 2• Module 1-b: History of digital libraries
and library automation• Module 2-c: File Formats,
Transformation, and Migration• Module 3-b: Digitization• Module 4-b: Metadata• Module 5-a: Architecture overviews
5
![Page 6: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/6.jpg)
DL Curric. Project - 2• Module 5-b: Application software• Module 5-d: Protocols• Module 6-a: Information needs/relevance• Module 6-b: Online information seeking
behaviors and search strategies• Module 6-d: Interaction design and
usability assessment
6
![Page 7: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/7.jpg)
DL Curric. Project - 3• Module 7-b: Reference Services• Module 7-g: Personalization• Module 8-b: Web Archiving• Module 9-c: Digital library evaluation,
user studies
7
![Page 8: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/8.jpg)
8
CTR stakeholders
![Page 9: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/9.jpg)
9
• Build a networked digital library relating to CTR
• Support information exploration
• Aided by an ontology
• Integrate community, content, and services relating to CTR, making it accessible, and preserving it for long-term reuse
• www.citeulike.org group ctrnet
• Citations• Papers, …
![Page 10: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/10.jpg)
Haiti Photographs, Content Based Image Retrieval Evaluation
![Page 11: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/11.jpg)
Goals for Ontology for CTR
11
Social networkapplications
CTR literature
Focus groups
Websites, Internet Archive
Browsing
SearchingQuery expansion
Visualizing
Tagging
Summarizing
CTR OntologyCTR Ontology• Individual• Organizational• Community• Political• …
Multicultural/ linguistic input
Recommending
sources
uses
![Page 12: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/12.jpg)
Preliminary Data Analysis
Revise seeds if poor preliminary
data
![Page 13: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/13.jpg)
Data Filtering and Storytelling
![Page 14: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/14.jpg)
Ensemble Portal
Fedora
Social network services
AlgoVizSWENET
Syllabus
Computing Communities
WebCAT TECH
Walden’s Path/VKB
CATSpace
CITIDEL
Drupal
Blog
Forum
Browse
Submit
Search
RSS
Storage
FOCES
CS1
CSTC
CSTA
Walden’s Path
VKB SI
Computing Resources
Tools
![Page 15: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/15.jpg)
Ensemble in Second LifeThe Ensemble Pavilion offers:• teleports to other computing sites in Second Life like the Digital Preserve • hyperlinks to related computing websites• RSS readers with feeds from computing and computing education blogs• membership in the Ensemble Computing group in Second Life, Facebook, and Twitter
http://slurl.com/secondlife/Educators%20Coop%204/66/236/28
www.computingportal.org
![Page 16: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/16.jpg)
16
Selected Digital Preserve Personnel
EdFox RiekoEdward Fox
zamfir PauleSpencer Lee
Krad ProtoSeungwon Yang
Gary OctagonGary Marchionini
mantruc MartianJavier Velasco-Martin
Uma AldrinUma Murthy
![Page 17: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/17.jpg)
17
•18 posters on display•Poster view tips•Video screen
Poster Building
DP areas
•Beverages•Screens•Discussion areas
Cafe
![Page 18: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/18.jpg)
Part 2 – Basic DL Concepts
• Digital Library Scope• OAI
– Harvesting– Repositories
• Space-related Perspectives of Computing– Distributed– Cloud …
• 5S
18
![Page 19: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/19.jpg)
DL Scope
• Institutional repositories• Open archives• Electronic/virtual libraries• Content management systems• Courseware management systems• Personal information management systems• Cloud/ubiquitous/… computing
19
![Page 20: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/20.jpg)
20
SynchronousScholarly Communication
Same time, Same or different place
![Page 21: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/21.jpg)
21
Asynchronous, Digital Library Mediated Scholarly Communication
Different time and/or place
![Page 22: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/22.jpg)
22
![Page 23: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/23.jpg)
23
Information Life Cycle
AuthoringModifying
OrganizingIndexing
StoringRetrieving
DistributingNetworking
Retention/ Mining
AccessingFiltering
UsingCreating
![Page 24: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/24.jpg)
24
AuthoringModifying
OrganizingIndexing
Storing Archiving
NetworkingAccessingFiltering
Creation
DistributionUtilization
Significance
Similarity
Pertinence
AccuracyCompletenessConformance
Seeking
SearchingBrowsingRecommending
Relevance
Timeliness
AccessibilityAccessibility
Inactive
Active
Discard
RetentionMining
Semi-Active
Preservability
Timeliness
Preservability
Describing
Quality and the Information Life Cycle
![Page 25: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/25.jpg)
25
DLs Shorten the Chain to
Author
Reader
Digital
LibraryEditor
Reviewer
Teacher
Learner
Librarian
![Page 26: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/26.jpg)
26
Degree of Structure
Chaotic Organized Structured
Web DLs DBs
![Page 27: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/27.jpg)
Example of Structural Levelof Text Information
Example of Granularity of Information Structure
![Page 28: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/28.jpg)
ETD Logical Hierarchy
![Page 29: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/29.jpg)
29
OAI = Technical Umbrella forPractical Interoperability…
ReferenceLibraries
Publishers E-PrintArchives
…that can be exploited by different communities
Museums
![Page 30: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/30.jpg)
30
OAI – Repository PerspectiveRequired: Protocol
DODO DO DO
MDO
MDO MDOMDOMDO
MDOMDOMDO
Glossary:DC=Dublin CoreMDO=Metadata ObjectDO=Digital Object
![Page 31: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/31.jpg)
31
Discovery CurrentAwareness Preservation
Service Providers
Data Providers
Metada ta
ha rve sting
The World According to OAI
![Page 32: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/32.jpg)
Space-related Computing
InformationInformation
Social Social ComputingComputing
Mobile Mobile ComputingComputing
Ubiquitous Ubiquitous ComputingComputing
Cloud Cloud ComputingComputingGreen Green
ComputingComputing
![Page 33: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/33.jpg)
33
5S LayersSocieties
Scenarios
Spaces
Structures
Streams
![Page 34: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/34.jpg)
34
5Ss
Ss Examples Objectives
Streams Text; video; audio; image Describes properties of the DL content such as encoding and language for textual material or particular forms of multimedia data
Structures Collection; catalog; hypertext; document; metadata
Specifies organizational aspects of the DL content
Spaces Measure; measurable, topological, vector, probabilistic
Defines logical and presentational views of several DL components
Scenarios Searching, browsing, recommending
Details the behavior of DL services
Societies Service managers, learners, teachers, etc.
Defines managers, responsible for running DL services; actors, that use those services; and relationships among them
![Page 35: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/35.jpg)
5S Contextualized
• Societies/communities/users served• Scenarios/services supported• Management of physical/conceptual/
feature spaces• Use of structures/organizational devices• Streams of content and communication
35
![Page 36: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/36.jpg)
36
5S and DL formal definitions and compositions (April 2004 TOIS)
5S
structures (d.10)streams (d.9) spaces (d.18) scenarios (d.21) societies (d. 24)
structural metadataspecification(d.25)
descriptive metadataspecification(d.26)
repository(d. 33)
collection (d. 31)
(d.34)indexingservice
structured stream (d.29)
digitalobject (d.30)
metadata catalog (d.32)
browsingservice
(d.37)
searchingservice (d.35)
digital library(minimal) (d. 38)
services (d.22)
sequence (d. 3)
graph (d. 6)function (d. 2)
measurable(d.12), measure(d.13), probability (d.14), vector (d.15), topological (d.16) spaces
event (d.10)state (d. 18)
hypertext(d.36)
sequence (d. 3)
transmission(d.23)
relation (d. 1) language (d.5)
grammar (d. 7)
tuple (d. 4)*
![Page 37: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/37.jpg)
37
Streams
text
audio
image
video digitalobject
Repository
Collection Catalogdescribes
stores
is_version_of/ cites/links_to
Index
Service
Scenario
event
extendsreuses
ServiceManager
Actor
operationexecutes
participates_in
recipient
runs
Scenarios
Societies
inherits_from/includes
association
uses
Topological
ProbabilisticMetric
Measurable
Measure
describes
employsproduces
employsproduces
employsproduces
Structures
Spaces
Vector
contains
metadata specifications
is_a is_a
precedeshappens_before
is_a
redefinesinvokes
contains
contains
Content / People
![Page 38: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/38.jpg)
38
Extending 5S
• Higher DL Constructs– Collections– Catalogs– Repositories and Archives– Systems– Case Studies
• Specialized views and services
![Page 39: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/39.jpg)
Streams Structures Spaces Scenarios Societies
structured stream
structural metadata specification
descriptive metadata specification
digital object
metadata catalog
collection repository
hypertext
Minimal DL
image stream
feature vector
composite image descriptor
image descriptor
image content description image object
image digital object
image descriptor metadata catalog
structured feature vector
image collection
base document
superimposed document
mark superimposed structure
subdocument
presentation channel
complex object
complex object structureCBIR servicevisualization
view in context
browsingindexing searching
services
user
community
personalization
user model
user role
collaboration
![Page 40: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/40.jpg)
40
Requirements Analysis Design Implementation Test
5S 5SLOO ClassesWorkflow Components
DLEvaluation
5SGraph 5SLGenFormalTheory/Metamodel
DL XMLLog
![Page 41: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/41.jpg)
41
Tools/Applications
5S MetaModel
5SGraphDL
Expert
DL Designer
5SL DL
Model
5SLGen
Practitioner
Researcher
TailoredDL
Teacher
componentpool
ODLSearch,ODLBrowse,ODLRate,ODLReview,
…….
Logging ModuleXMLLog
![Page 42: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/42.jpg)
Society Centered• Society, community, group, user• Web 2.0, Social networking• Computer-supported cooperative work• User modeling
– Authors, committee/peers, readers• Economics / culture
– Free: but who actually pays, how, implications– Low cost: prepaid, but what of preservation– Repository hierarchy: group, institution, nation
42
![Page 43: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/43.jpg)
Student Gets CommitteeSignatures and Submits ETD
Signed
Grad School
![Page 44: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/44.jpg)
Library Catalogs ETD, Access isOpened to the New Research
WWW
NDLTD
![Page 45: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/45.jpg)
Content Centered
• Genre– Gray literature– Report, courseware– Posters, demos, tutorials, panels, debates
• Format• Presentation• Preservation
45
![Page 46: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/46.jpg)
Part 3 – Services Centered
• Taxonomy• Interoperability, integration, packaging
– HTML5• Collaboration, annotation, recommending• Indexing, CBIR• Categorizing, browsing• Roles of librarians
46
![Page 47: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/47.jpg)
47
Browsing Collaborating Customizing Filtering Providing access Recommending Requesting Searching Visualizing
Annotating Classifying Clustering Evaluating Extracting Indexing
Measuring Publicizing
Rating Reviewing (peer)
Surveying Translating
(language)
Conserving Converting
Copying/Replicating Emulating Renewing
Translating (format)
Acquiring Cataloging
Crawling (focused) Describing Digitizing
Federating Harvesting Purchasing Submitting
Preservational Creational Add Value
Repository-Building Information Satisfaction
Services
Infrastructure Services
![Page 48: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/48.jpg)
DL.Org Functionality WGDagobert Soergel – Sci. Lead:
Functions where Interoperability is important
48
Behind the scene For usersFeature extractionClassification / clusteringSharing authority filesLog file analysisSharing user profilesHarvesting , aggregatingShared storage and backup
Federated searchIncorporating content from other places on the flyDisplay and visualization
TimelinesMapsPlaying videos
Same look-and-feel browse
![Page 49: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/49.jpg)
Sub-functions of search
49
Quick Search Advanced SearchEnter a query and click searchEnter keywords or phrases for
selected field
Limit results toSearch subscribed titelsClear
Enter a query and click searchEnter keywords or phrases for
selected fieldsSelect keyword from a listSelect Boolean operator
(explicit)Define phrase match (explicit)ClearSearch within resultsLimit results to (preselection)Sort by (preselection)Select display optionsDisplay X results per pageDisplay search history
![Page 50: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/50.jpg)
Sub-functions of annotate
50
Select object to be annotated(need to indicate selection method)
Mark region in the object(many different methods depending on the object)
Select type of annotation (highlight, mark with special meaning, text, image,
sound)
If text, image, sound
Specify relationship to object to be annotated
Select or create the annotating object (possibly specifying a region
Annotating within one system
Annotating across systems
![Page 51: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/51.jpg)
51
Annotations
OAI Data
Harvester
EDUCATORS
ADMINISTRATORS LEARNERS
Multilingual Searching
Revising Annotating Filtering Browsing Administering
Filtering Profiles User Profiles
Union Metadata
OAI Data
Provider
Remote and Peer Digital Libraries (eg. NSDL -CIS)
PORTALS
SERVICES
REPOSITORIES
Digital library architecture for localand interoperable CITIDEL services
![Page 52: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/52.jpg)
52
Example of Union Service: CitiViz
![Page 53: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/53.jpg)
53
ETANA.org
![Page 54: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/54.jpg)
54Repository1
DL1
Repository2
Union Catalog
Union Repository
Catalog1 Catalog2
Searching
Union DL DL2
archaeologists
Society
General Public
Society
ArchaeologistsGeneral Public
Union Society
ServiceBrowsingService
Union Service
Harvesting, Mapping,Searching, Browsing,
Clustering, Visualization
Architecture of a Union DL (ETANA.org)
![Page 55: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/55.jpg)
55
Union Catalog Integration
VN MetadataFormat
Global MetadataFormat
VNCatalog
HDCatalog
Union Catalog
MappingTool
Wrapper
MappingTool
Wrapper
HD MetadataFormat
Virtual Nimrin(VN)
Halif DigMaster(HD)
Union ArchDL
![Page 56: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/56.jpg)
HTML5 Structuring Flowchart
PDFETD
Multimedia file link extractor
ETD structureanalyzer
Multimedia file source extractor
PDF2Text/HTML converter
HTML5ETD
HTML5Converter
HTML5tag setTXT/
HTML
HTML
Tagged MM Source
TXT/ HTML
Tagged TXT
Tagged TXT Text/
Grammar
![Page 57: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/57.jpg)
CategoryTree
Document Sets
Google Naïve Bayes Classifiers
Training Sets
Web Interface
ETD Collection
Categorized ETDs
Category label for each node used as query
Top 50 webpages (for each node in the tree)
Cleanup (stemming, stopword removal, etc.)
Level-wise categorization
ETD metadata used for categorization
BrowsingTraining
ETDs categorized into a node of the category tree (after classification)
ETD Classification:ETD Classification: Algorithm PipelineAlgorithm Pipeline
![Page 58: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/58.jpg)
Digital Librarians
• Community oriented• Collection management• Customized services
• Principles:– Openness– Expansion
• Interoperation, integration, communitization
58
![Page 59: fox@vt fox.cs.vt Dept. of Computer Science, Virginia Tech](https://reader035.vdocuments.site/reader035/viewer/2022070502/56813c7b550346895da615b2/html5/thumbnails/59.jpg)
Summary• Selected DL Projects• Basic DL Concepts• Services Centered
• Openness• Expansion
• Questions and Comments?• http://fox.cs.vt.edu/talks/2010/
59