october 2, 2003t-space0. t-space: dspace implementation at the university of toronto rea devakos and...
Post on 20-Dec-2015
217 views
TRANSCRIPT
October 2, 2003 T-Space 1
T-Space: DSpace Implementation at the University of Toronto
Rea Devakos and Kent WeaverUniversity of Toronto Library
Rea Devakos and Kent WeaverUniversity of Toronto Library
October 2, 2003 T-Space 3
Agenda
• T-Space – the short overview
• Institutional Repositories (IRs)
• T-Space – the longer overview
• DSpace/Federation (past, present and future)
• Conclusion
October 2, 2003 T-Space 4
T-Space Foundations
• Task Force on Library Automation Planning
• Provost’s Task Force on Academic Computing and New Media
• Six-year planning reviews – “Raising our Sights” and current review
October 2, 2003 T-Space 5
T-Space is
an institutional repository service that enables the
• Capture
• Description
• Distribution
• Preservation
of U of T’s digital research & teaching products
Communities
Library
T-SpacePartnership
October 2, 2003 T-Space 6
T-Space Communities
• Institutional administrative unit– Colleges, Departments, Faculties– Policy decisions– Who may contribute content– Who may access content– Workflow for submission of content
• Supply content & metadata• Determine identity / establish
framework• Grant right to preserve & distribute
content
October 2, 2003 T-Space 7
Implementation
• Pilot phase– Installation & configuration– Implementation details
• Communication & training plans• Policy decisions
– Early adopters have and still being recruited
– Now available for testing by early adopters– June – Live operation
• Production — November 2003
October 2, 2003 T-Space 8
Current communities
• G7/G8
• Iter
• KMDI
• MCIS
• OISE (Ontario Institute for Studies in Education)
• UTSC (University of Toronto at Scarborough)
October 2, 2003 T-Space 9
October 2, 2003 T-Space 10
Institutional Repositories
• Part of the shift in scholarly communication– SPARC (Scholarly Publishing and Academic
Resources Coalition) initiative
• Institution-based or defined: visible, prestige
• Access to scholarly material in digital formats via the Web
October 2, 2003 T-Space 11
Institutional Repositories
• Cumulative and perpetual
• Open and interoperable
• Cross searching and listing critical – OAI (Open Archives Initiative)– OAI-PMH (Protocol for Metadata
Harvesting)
October 2, 2003 T-Space 12
Institutional Repositories
• “a digital archive of the intellectual product created by the faculty, research staff, and students of an institution and accessible to end users both within and outside of the institution, with few if any barriers to access.”
Rick Johnson, SPARC Enterprise Director
October 2, 2003 T-Space 13
Institutional Repositories
• “Type” - e.g., learning objects (Merlot, TILE), digital video or multimedia (Open Video Project)
• Subject - e.g. ArXiv, EconPapers
• Publishers – e.g. JSTOR
• Institutional
October 2, 2003 T-Space 14
Some other IR initiatives
• eprintsUK • Theses Alive! (UK)• UK’s FAIR (Focus on Access to
Institutional Resources)• Dutch ARNO & Dare• Ohio State U Knowledge Bank• Caltech digital collections• Fedora Project (Flexible Extensible Digital
Object and Repository Architecture)
October 2, 2003 T-Space 15
Institutional Repository Economy
• More references to LO economy than to an IR economy
– virtual LO economy
– commercial LO economy (royalties, payments, business models, e-learning marketplace)
October 2, 2003 T-Space 16
Basic Functions of T-Space
• Capture
• Description
• Distribution
• Preservation
of U of T’s digital research & teaching products
October 2, 2003 T-Space 17
Fundamentally same mission:
• Capture
• Description
• Distribution
• Preservation
• Collection Development &Acquisitions
• Cataloguing
• Circulation, OPAC& Web Services
October 2, 2003 T-Space 18
Basic Functions of T-Space
• Capture
• Description
• Distribution
• Preservation
▬ Content is provided by faculty & researchers
▬ Via simple web forms
▬ Digital research material in any format
October 2, 2003 T-Space 19
Basic Functions of T-Space
• Capture
• Description
• Distribution
• Preservation
▬ Metadata is provided by faculty & researchers
▬ IncludesDescriptive
informationTechnical
informationRights
management information
October 2, 2003 T-Space 20
Basic Functions of T-Space
• Capture
• Description
• Distribution
• Preservation
▬ Via secure web server
▬ Access control
▬ Persistent identifiers
October 2, 2003 T-Space 21
Basic Functions of T-Space
• Capture
• Description
• Distribution
• Preservation
▬ StorageLarge-scale
StableLong-termManaged
▬ File migration as technology changes
October 2, 2003 T-Space 22
Benefits
• University-wide content management
• Community and author control
• Institutional and group identity
• Uniform distribution
• Long term preservation
October 2, 2003 T-Space 23
Community & Access Control
• Submission process– Levels of permission– Workflow
• Submitter manages entire process• Submission → Review → Metadata → Approval
• Access control– World wide– Restricted to University (IP Control)
– Restricted to Community (Username/Password)
October 2, 2003 T-Space 24
User
Access to T-Space
T-SpaceInterface
LibraryCatalog
OtherT-Space
WebT-SpaceObject
October 2, 2003 T-Space 25
Other access – Google ?
Unlimited queries Deep crawls SiteSearch, with optional WebSearch Google will refresh its entire index
approximately every 30 days No advertising Traffic reports High level of customization Free
October 2, 2003 T-Space 26
• University Program
• http://services.google.com/univ_faq.html
• on our test server
October 2, 2003 T-Space 27
Enhanced search capability
• implemented built-in full text search on test server - indexes pdf, doc, txt, htm, html, and xls
• integrated with the advanced search tool - adds new items to the index on the fly
October 2, 2003 T-Space 28
Documents & Files
• Document Types:
– Articles Technical Reports– Preprints Working Papers– Data Sets Presentations– Learning Objects Images– Conference Papers Sound
etc…
October 2, 2003 T-Space 29
Documents & Files• Document Types / Document Support:
– Supported – Non-proprietary formatsGIF XML
JPEG
MARC Postscript TIFF
– Known – Proprietary formatsExcel Mathematica
Photoshop
PowerPoint Word WordPerfect
– UnknownMIME Type: application/octet-stream
October 2, 2003 T-Space 30
The Details
• Organization of content
• Uniform framework
• Technical underpinnings
• Examples
• Who does what
• How they do it
October 2, 2003 T-Space 31
Organization of Content
T-Space
Contains communities, each with its own identityCommunity
Community
Contains collections that can have individual workflowsCollection
Collection
Contains itemsItem
Item
Contains filesFile
File
Bitstreams –Multiple Formats - Same ContentComplex Objects - Multiple Files
October 2, 2003 T-Space 32
• University identity• Common navigation• Group identity
– Identity & Description
– Navigation& Links
• Content
Uniform framework
University Identity
NavigationCommunityNavigation
& LinksCommunity Identity & Description
Searching
Content
RecentAdditions
October 2, 2003 T-Space 33
Technical Underpinnings
• Hardware– Server
• IBM P670
– Storage• “Please sir, I want more.”
– Backup• Robotic tape system• Local, remote on-site & off-site storage• Files migrated as technology changes
October 2, 2003 T-Space 34
Then - December, 2000
Anticipated growth:• Current 3,506 GB• 00/01 4,813 GB• 01/02 5,553 GB• 02/03 6,335 GB• 03/04 7,166 GB• 04/05 8,055 GB 0
1,000
2,000
3,000
4,000
5,000
6,000
7,000
8,000
9,000
Meg
abyt
es
Current 00/01 01/02 02/03 03/04 04/05
Projected
October 2, 2003 T-Space 35
Now – July, 2003
Application/Service Current 2003 2004 2005 2006 2007
Digital Library 2.5 5.7 7.0 8.4 9.9 11.6Electronic Journals 2.8 10.1 13.6 18.0 23.5 30.4Image Archive 0.9 3.3 4.6 6.0 7.6 9.3Institutional Repository 0.0 1.1 2.3 3.6 5.0 6.5Library System 1.4 1.7 2.0 2.3 2.6 2.9Library/University Web 0.4 0.5 0.6 0.7 0.9 1.0Staff/Admin Files 1.1 2.2 3.0 3.9 4.8 5.8Text Archive 0.2 0.4 0.6 0.8 1.0 1.2UofT Press 0.0 0.3 0.7 1.1 1.5 1.9Web of Knowledge 0.0 1.3 1.5 1.7 2.0 2.3
TOTAL (Terabytes) 9.3 26.6 35.9 46.5 58.7 72.9
October 2, 2003 T-Space 36
Original & Current
RFP 1
RFP 2
0
10
20
30
40
50
60
70
80
99/00 00/01 01/02 02/03 03/04 04/05 05/06 06/07 07/08
Tera
byte
s
October 2, 2003 T-Space 37
Technical Underpinnings
• Software– Based on MIT’s DSpace– Open Source (PostgreSQL, Tomcat, Java) – OAIS (Open Archival Information System)
Compliant
• Metadata– Qualified Dublin Core
• Persistent URLs– CNRI (Corporation for National Research Initiatives)
Handle server
October 2, 2003 T-Space 38
Who does what – Community
• Defines– Identity of Community– Scope of collections– Workflow– Access
• Submits content• Provides metadata• Retains copyright• Grants right to preserve & distribute
October 2, 2003 T-Space 39
T-Space Service
Who does what – Library
• Distribution – Searchable Database– Community Portal
• Preserves content
• Provides– Community support– Infrastructure– Institutional framework
Director, ITS
OperationsSupport
AdvisoryCommittee
Chief Librarian
UserSupport
InformationSpecialists
Communities
October 2, 2003 T-Space 40
Submission Process
Logon
October 2, 2003 T-Space 41
Submission Process
Description
October 2, 2003 T-Space 42
Submission ProcessDescription
October 2, 2003 T-Space 43
Submission ProcessDescription
October 2, 2003 T-Space 44
Submission ProcessUpload
October 2, 2003 T-Space 45
Submission ProcessVerify
October 2, 2003 T-Space 46
Submission ProcessLicense
October 2, 2003 T-Space 47
Submission Process
Complete
October 2, 2003 T-Space 48
Submission Process
Submitted item in T-Space
October 2, 2003 T-Space 49
Digital Rights Management
• Back to license and copyright issues for a moment
• Creative Commons – December, 2002 - suite of licences – Canadian version not ready yet
• “some rights reserved”• http://creativecommons.org
• RoMEO – Rights Metadata for Open Archiving
October 2, 2003 T-Space 50
DSpace
• Two years of development – beta testing – officially released as open source software November 4, 2002
• Been 2,500+ downloads from SourceForge: http://sourceforge.net/projects/dspace
• ”model pointing the way forward”• Jointly developed by
– MIT – Hewlett-Packard Labs
with support of a Mellon grant
October 2, 2003 T-Space 51
DSpace Federation
• How will DSpace evolve?
• Will and how will IRs co-operate?
• DSpace FederationColumbia U Cornell U
Ohio State U Washington U
U of Rochester
Cambridge (UK) Toronto (Canada)
MIT
October 2, 2003 T-Space 52
The Federation
• Multi-institutional federated services
• Policy, service and business issues
• Future development and new capabilities of DSpace
• Policy Workshop at MIT before end of 2003
October 2, 2003 T-Space 53
URLs• Institutional Repositories
http://www.arl.org/newsltr/226/ir.html
• DSpacehttp://www.dspace.org
• MIThttp://libraries.mit.edu/dspace
• UofTwww.tspace.library.utoronto.ca(http://tspace.library.utoronto.ca:8080 – test system)
October 2, 2003 T-Space 54
Contacts
Peter [email protected]
Kent [email protected]
Overall T-Space Service
Establishing a Community;
Implementation & Training;
Metadata Consultation;
Support Questions
October 2, 2003 T-Space 55
T-Space Staff
Gabriela Mircea
416-946-0114
Frank Rotiroti
416-978-7572
Technical implementation, support and documentation
IBM server and AIX support
October 2, 2003 T-Space 56
Thank you.
Are there any Questions?