danish legal deposit on the internet national diet library, tokyo, january 2002 by birgit n....
TRANSCRIPT
![Page 1: Danish Legal Deposit on the Internet National Diet Library, Tokyo, January 2002 by Birgit N. Henriksen Head of Digitization and Web Department The Royal](https://reader034.vdocuments.site/reader034/viewer/2022052603/56649e1f5503460f94b0a95a/html5/thumbnails/1.jpg)
Danish Legal Deposit on the Internet
National Diet Library, Tokyo, January 2002
by Birgit N. HenriksenHead of Digitization and Web Department
The Royal Library, [email protected]
![Page 2: Danish Legal Deposit on the Internet National Diet Library, Tokyo, January 2002 by Birgit N. Henriksen Head of Digitization and Web Department The Royal](https://reader034.vdocuments.site/reader034/viewer/2022052603/56649e1f5503460f94b0a95a/html5/thumbnails/2.jpg)
Presentation outline
•Experiences with legal deposit of web materials in DK since 1998
•Period with new projects, 2000-2002
•A new strategy in the future?
![Page 3: Danish Legal Deposit on the Internet National Diet Library, Tokyo, January 2002 by Birgit N. Henriksen Head of Digitization and Web Department The Royal](https://reader034.vdocuments.site/reader034/viewer/2022052603/56649e1f5503460f94b0a95a/html5/thumbnails/3.jpg)
Denmark5 million people in Northern
Europe
![Page 4: Danish Legal Deposit on the Internet National Diet Library, Tokyo, January 2002 by Birgit N. Henriksen Head of Digitization and Web Department The Royal](https://reader034.vdocuments.site/reader034/viewer/2022052603/56649e1f5503460f94b0a95a/html5/thumbnails/4.jpg)
The Danish Legal Deposit Law
•1697: The first legal deposit law in Denmark
•1902: All printed materials to be deposited
•1997: All published works to be deposited
![Page 5: Danish Legal Deposit on the Internet National Diet Library, Tokyo, January 2002 by Birgit N. Henriksen Head of Digitization and Web Department The Royal](https://reader034.vdocuments.site/reader034/viewer/2022052603/56649e1f5503460f94b0a95a/html5/thumbnails/5.jpg)
The law from 1997 covers
any work published in Denmark regardless of medium
“work”: a delimited quantity of information which must be considered a final and independent unit
“published”: when … copies of the work have been placed on sale or otherwise distributed to the public
![Page 6: Danish Legal Deposit on the Internet National Diet Library, Tokyo, January 2002 by Birgit N. Henriksen Head of Digitization and Web Department The Royal](https://reader034.vdocuments.site/reader034/viewer/2022052603/56649e1f5503460f94b0a95a/html5/thumbnails/6.jpg)
Types of Net Publications
Static publications included (only periodically updated) •monographs•periodicals
Dynamic publications excluded (continuously updated) •Databases•Homepages
![Page 7: Danish Legal Deposit on the Internet National Diet Library, Tokyo, January 2002 by Birgit N. Henriksen Head of Digitization and Web Department The Royal](https://reader034.vdocuments.site/reader034/viewer/2022052603/56649e1f5503460f94b0a95a/html5/thumbnails/7.jpg)
Notification
•Whothe person in charge of the technical completion of the digital copy
•Howby filling out a form at the Danish legal deposit website: http://www.pligtaflevering.dk
•When as soon as the net publication is placed on the web. The Royal Library must then download it within three months
![Page 8: Danish Legal Deposit on the Internet National Diet Library, Tokyo, January 2002 by Birgit N. Henriksen Head of Digitization and Web Department The Royal](https://reader034.vdocuments.site/reader034/viewer/2022052603/56649e1f5503460f94b0a95a/html5/thumbnails/8.jpg)
DC Registration Form - Monographs
![Page 9: Danish Legal Deposit on the Internet National Diet Library, Tokyo, January 2002 by Birgit N. Henriksen Head of Digitization and Web Department The Royal](https://reader034.vdocuments.site/reader034/viewer/2022052603/56649e1f5503460f94b0a95a/html5/thumbnails/9.jpg)
Download - workflow
The staff at the Danish Department :
•determine whether a publication is covered by the law
•if yes, download all files belonging to the work
•check downloaded work•catalogue and classify the work in the OPAC (only periodicals)
•transfer work to archival server
(server mirrored every night to State and University Library, Århus)
![Page 10: Danish Legal Deposit on the Internet National Diet Library, Tokyo, January 2002 by Birgit N. Henriksen Head of Digitization and Web Department The Royal](https://reader034.vdocuments.site/reader034/viewer/2022052603/56649e1f5503460f94b0a95a/html5/thumbnails/10.jpg)
Cataloguing – Indexing•Danish Bibliographic Centre makes MARC records of the part included in the National Bibliography
•Searches in OPAC supplemented or replaced with: •access by searching directly in data provided by the publisher
•full text search in the archived material through a ‘web index’ – the same way you use the material when it is online on the net
![Page 11: Danish Legal Deposit on the Internet National Diet Library, Tokyo, January 2002 by Birgit N. Henriksen Head of Digitization and Web Department The Royal](https://reader034.vdocuments.site/reader034/viewer/2022052603/56649e1f5503460f94b0a95a/html5/thumbnails/11.jpg)
Access to archived web material
•Theory - Restricted Access•One PC in each legal deposit library placed in a reading room – free for all
•No possibility of making electronic copies from the archive, only paper print-outs
•Practice - No Access
![Page 12: Danish Legal Deposit on the Internet National Diet Library, Tokyo, January 2002 by Birgit N. Henriksen Head of Digitization and Web Department The Royal](https://reader034.vdocuments.site/reader034/viewer/2022052603/56649e1f5503460f94b0a95a/html5/thumbnails/12.jpg)
Statistics January 2002
•Subdomains within top level domain .dk:•# subdomains in Denmark: 352,000•# subdomains in archive: ~ 1,000
•Volume:•# net publications : 10,522•# files : 693,309•# Gbytes: 23
•Content: •1/3 monographs, 2/3 periodical issues•2/3 public publishers, 1/3 private publishers
![Page 13: Danish Legal Deposit on the Internet National Diet Library, Tokyo, January 2002 by Birgit N. Henriksen Head of Digitization and Web Department The Royal](https://reader034.vdocuments.site/reader034/viewer/2022052603/56649e1f5503460f94b0a95a/html5/thumbnails/13.jpg)
Staff resources
Man Years
Paid hours/ publication
Comments
1998 2,3 12,75 System being developed and set up
1999 1,9 1,2 Downloading, cataloguing and classifying all publications
2000- 1,3 0,6 Downloading, cataloguing and classifying all periodicals
![Page 14: Danish Legal Deposit on the Internet National Diet Library, Tokyo, January 2002 by Birgit N. Henriksen Head of Digitization and Web Department The Royal](https://reader034.vdocuments.site/reader034/viewer/2022052603/56649e1f5503460f94b0a95a/html5/thumbnails/14.jpg)
MimeType Statistics – 2001% of collected files
Selective collection, Denmark
Bulk collection,Sweden
TEXT/HTML
59,3 % 55,6 %
Image (GIF, JPEG, PNG)
37,9 % 40,0 %
PDF 1,7 % 1,0 %
Other formats
1,1 % 3,4 %
![Page 15: Danish Legal Deposit on the Internet National Diet Library, Tokyo, January 2002 by Birgit N. Henriksen Head of Digitization and Web Department The Royal](https://reader034.vdocuments.site/reader034/viewer/2022052603/56649e1f5503460f94b0a95a/html5/thumbnails/15.jpg)
Problems related to harvesting
•Errors or inconsistencies in the published files
•Java applets and java scripts – no solution at the moment
•Data protected with username/password logins is covered by the law but more difficult to download
•Can't harvest more sophisticated formats
•Can't harvest interactive processes
![Page 16: Danish Legal Deposit on the Internet National Diet Library, Tokyo, January 2002 by Birgit N. Henriksen Head of Digitization and Web Department The Royal](https://reader034.vdocuments.site/reader034/viewer/2022052603/56649e1f5503460f94b0a95a/html5/thumbnails/16.jpg)
Summary
•Selective web archiving based on notification covered by law and practiced since 1998
•Only static publications•Doesn’t get everything covered by the law
•Doesn’t get a representative part of the net
•Doesn’t get the most advanced part•Labour intensive •Cataloguing partly replaced by alternatives
•Very restricted access
![Page 17: Danish Legal Deposit on the Internet National Diet Library, Tokyo, January 2002 by Birgit N. Henriksen Head of Digitization and Web Department The Royal](https://reader034.vdocuments.site/reader034/viewer/2022052603/56649e1f5503460f94b0a95a/html5/thumbnails/17.jpg)
Web Archiving Conference, CPH June 2001
•Focus:•User Expectations for web archiving
•Input from scholars & scientists: •Archive the dynamic part of the web•Focus on archiving
• the content• the context• the evidence of use
•Archivists:•Use different archiving approaches •Find new methods for archiving interactive material•Budgets for making snapshots and making selective collections are comparable
![Page 18: Danish Legal Deposit on the Internet National Diet Library, Tokyo, January 2002 by Birgit N. Henriksen Head of Digitization and Web Department The Royal](https://reader034.vdocuments.site/reader034/viewer/2022052603/56649e1f5503460f94b0a95a/html5/thumbnails/18.jpg)
Why harvesting ?
•Possible to get a representative part of the net
•Private and public publishers•Material about Danes as well as material that
interests the Danes
•Get new trends as soon as they appear
•The easiest way to get (all) updated versions quick and easy
•Accumulative harvesting of news and media
![Page 19: Danish Legal Deposit on the Internet National Diet Library, Tokyo, January 2002 by Birgit N. Henriksen Head of Digitization and Web Department The Royal](https://reader034.vdocuments.site/reader034/viewer/2022052603/56649e1f5503460f94b0a95a/html5/thumbnails/19.jpg)
Why not only harvesting?•Programmes and plug-ins difficult to handle
• Harvesting is not always possible (e.g. streamed and web casted material, flash applications, chat …)
•Harvesting may not give a useful result•technical problems
(java, interactive sites like net art, games, auctions …)
•personalised sites•services
(search engines, route planners, home banking, e-commerce…)
![Page 20: Danish Legal Deposit on the Internet National Diet Library, Tokyo, January 2002 by Birgit N. Henriksen Head of Digitization and Web Department The Royal](https://reader034.vdocuments.site/reader034/viewer/2022052603/56649e1f5503460f94b0a95a/html5/thumbnails/20.jpg)
Birte Christensen-Dalsgaard, SB:
Archive Experience, not Data
User Interface
Service Layer
Data Layer
Library SystemXML parserChatEtc.
Databases:CataloguePapers and articlesFinansial InformationEtc.
Database publishing,
![Page 21: Danish Legal Deposit on the Internet National Diet Library, Tokyo, January 2002 by Birgit N. Henriksen Head of Digitization and Web Department The Royal](https://reader034.vdocuments.site/reader034/viewer/2022052603/56649e1f5503460f94b0a95a/html5/thumbnails/21.jpg)
Three Danish web archiving initiatives
•Legal deposit based on selective approach since 1998, http://www.pligtaflevering.dk
•Nordic Web Archive (Nordic project 2000-2002, access to web archives) http://nwa.nb.no
•netarchive.dk (Danish project, multiple archiving strategies , 2001-2002) http://netarchive.dk
![Page 22: Danish Legal Deposit on the Internet National Diet Library, Tokyo, January 2002 by Birgit N. Henriksen Head of Digitization and Web Department The Royal](https://reader034.vdocuments.site/reader034/viewer/2022052603/56649e1f5503460f94b0a95a/html5/thumbnails/22.jpg)
netarchive.dk
•Project testing different archival approaches and the subsequent usability of the archived material for research
•Project partners:•State and University Library, Aarhus •Centre for Internet Research, Aarhus University •The Royal Library, Copenhagen•Economic support from the Danish Electronic Research Library (DEF)
•Period: August 2001 – July 2002
•Case: Danish municipal elections November 2001
![Page 23: Danish Legal Deposit on the Internet National Diet Library, Tokyo, January 2002 by Birgit N. Henriksen Head of Digitization and Web Department The Royal](https://reader034.vdocuments.site/reader034/viewer/2022052603/56649e1f5503460f94b0a95a/html5/thumbnails/23.jpg)
netarchive.dk
InteractivityStatic Dynamic
Real time dialog
Published, static
Sig
nal l
ifet
ime
Different archival approaches
Chatter botsChatWeb conference
Report
Web forms
Searching OPAC
Net auctions
Net art
![Page 24: Danish Legal Deposit on the Internet National Diet Library, Tokyo, January 2002 by Birgit N. Henriksen Head of Digitization and Web Department The Royal](https://reader034.vdocuments.site/reader034/viewer/2022052603/56649e1f5503460f94b0a95a/html5/thumbnails/24.jpg)
Accumulative
Snapshot
netarchive.dk
InteractivityStatic Dynamic
Real time dialog
Published, static
Sig
nal l
ifet
ime
Process
Different archival approaches
![Page 25: Danish Legal Deposit on the Internet National Diet Library, Tokyo, January 2002 by Birgit N. Henriksen Head of Digitization and Web Department The Royal](https://reader034.vdocuments.site/reader034/viewer/2022052603/56649e1f5503460f94b0a95a/html5/thumbnails/25.jpg)
netarchive.dk - Experiences Experiences with event based harvesting
•New materials: web sites, discussion groups, portals and chat
•Hard to find the relevant, new URL’s during the event
Experiences with contracts/agreements•Only 3 of 44 contracts have been signed •Knowledge of agreements do not spread out
sufficiently in a top-down organisation• Agreements must cover
•how to harvest (Technical issues)•how to give access to harvested (Copyright issues)
Experiences with different harvesters •Browsers more robust to errors on sites than
harvesters, and they interpret programme objects like java scripts
![Page 26: Danish Legal Deposit on the Internet National Diet Library, Tokyo, January 2002 by Birgit N. Henriksen Head of Digitization and Web Department The Royal](https://reader034.vdocuments.site/reader034/viewer/2022052603/56649e1f5503460f94b0a95a/html5/thumbnails/26.jpg)
netarchive.dk
Process rather than data
•Make a film of the process
•‘container’ with known preservation strategy
•Accept loss of all functionality
•‘Filming’ through a browser•Catch chronological series of displayed WebPages
•Tools to take into consideration:•Business intelligence tools•Tools used in usability laboratories
![Page 27: Danish Legal Deposit on the Internet National Diet Library, Tokyo, January 2002 by Birgit N. Henriksen Head of Digitization and Web Department The Royal](https://reader034.vdocuments.site/reader034/viewer/2022052603/56649e1f5503460f94b0a95a/html5/thumbnails/27.jpg)
New Strategy Proposal
•Archiving dynamic material must be legal
•Selective approach replaced/supplemented with bulk collections done by robot harvesting
•Retain possibility for delivery
•‘Filming’ parts of the net •Access less restrictive
![Page 28: Danish Legal Deposit on the Internet National Diet Library, Tokyo, January 2002 by Birgit N. Henriksen Head of Digitization and Web Department The Royal](https://reader034.vdocuments.site/reader034/viewer/2022052603/56649e1f5503460f94b0a95a/html5/thumbnails/28.jpg)
END