paint-yourself-in-the-corner infrastructure

86
http://alexanderponting.blogspot.com/2012/01/how-to-paint-yourself-into-corner.html @hvdsomp

Post on 20-Sep-2014

13 views

Category:

Technology


0 download

DESCRIPTION

Presentation given at the EMTACL12 conference in Trondheim, Norway, on October 1 2012. Discusses the evolution towards a highly dynamic scholarly record (assets don't have the sense of fixity they used to have; assets are highly interdependent) and how the archiving infrastructure used for scholarly communication can not adequately deal with this dynamism.

TRANSCRIPT

Page 1: Paint-Yourself-In-The-Corner Infrastructure

http://alexanderponting.blogspot.com/2012/01/how-to-paint-yourself-into-corner.html

@hvdsomp

Page 2: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Consideration 1 - A Dynamic Scholarly Record

•  The scholarly record is extending with a wide range of non-traditional assets emerging from eScience and eHumanities endeavors. •  e.g. datasets, software, ontologies, workflows, online debate,

slides, blogs, videos, collaborative environments, etc.

•  Many of these non-traditional assets: •  Do not have the sense of fixity that traditional assets such as

journal articles or books have. •  Have a wide range of dependencies on other assets.

•  Even traditional assets are becoming increasingly dynamic and dependent on other assets, which may themselves be dynamic.

Page 3: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

PeerJ Dynamic Content

http://peerj.com - http://www.publishersweekly.com/pw/by-topic/digital/content-and-e-books/article/52512-scholarly-publishing-2012-meet-peerj.html

Page 4: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Article Wikipedia Bridge

http://blogs.plos.org/plos/2012/04/bridging-the-journal-wikipedia-gap/

PLoS Computational Biology

Page 5: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Research Objects

Bechhofer, S. et al. (2010) http://precedings.nature.com/documents/4626/version/1

Page 6: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Nowakowski et al. (2011) The Collage Authoring Environment Procedia Computer Science v4 http://dx.doi.org/10.1016/j.procs.2011.04.064

Executable Paper – Collage - Conceptual View

Page 7: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Nowakowski et al. (2011) The Collage Authoring Environment Procedia Computer Science v4 http://dx.doi.org/10.1016/j.procs.2011.04.064

Executable Paper – Collage – Rendering a Paper

Page 8: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Scientific Workflows, Services, Data, Workflow Engines

Carole Goble, JCDL 2012 Keynote https://dl.dropbox.com/u/617206/JCDL2012keynoteGoble.ppt

Page 9: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

What is the Scholarly Record?

•  It becomes challenging to define what the scholarly record is: where does it start and where does it end? •  Transforming from a stack of journals or a bunch of PDF files

into a dynamic network of interconnected assets and actors.

“An article about computational science in a scientific publication is not the scholarship itself, it is merely advertising of the scholarship. The actual scholarship is the complete software development environment, [the complete data] and the complete set of instructions which generated the figures.” David Donoho, “Wavelab and Reproducible Research,” 1995

Page 10: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Fixity is Challenged …

•  The ever-evolving nature of some assets challenges the notion of fixity as “forever frozen” and begs considering the notion of the “state of the scholarly record at a specific moment in time”. •  Evolution from the version of record to a version of the

record.

•  Whatever the boundaries of the scholarly record are, it will be essential to be able to look back at certain assets in order to understand how findings came about.

Page 11: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Consideration 2 – The Web as the Infrastructure

•  For quite some time, the Web has been the conduit for scholarly information. But, the scholarly endeavor is increasingly embedded into, native to, the Web.

•  From PDF to HTML. •  Social component: Contributors taking a central role. •  Machine component: Semantic, Linked Data technologies.

•  The Web is becoming the infrastructure for the Scholarly Record. •  Long Term Sustainability: Reuse of infrastructure (network, software, platforms, standards, etc.) that the entire world depends on. •  Integration of scholarly discourse with other Web-based discourse.

•  The special requirements of Scholarly Communication (certification, archiving, persistence, trust, annotation, metrics, …) must be addressed in an interoperable manner within the Web infrastructure, not in some parallel scholarly universe.

Page 12: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

The Web as the Infrastructure: alt-metrics

http://altmetrics.org/manifesto/

Page 13: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

http://impactstory.it/

Page 14: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

The HTTP URI is the Identifier

•  At the core of the Web are HTTP URIs.

•  The Web-based scholarly record works because of HTTP URIs.

•  Even when persistent identifiers are assigned to assets, contributors, and institutions they need to be instantiated as HTTP URIs in order to do anything useful with them on the Web.

•  cf. http://dx.doi.org/… •  same for ORCID, I2, pmid, etc.

•  Many non-traditional assets are born with an HTTP URI and never obtain a persistent identifier.

•  cf. presentations on SlideShare, software, ontologies, workflows, etc.

Page 15: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Page 16: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Existing Archival Infrastructure Assumes Fixity and Boundary

Page 17: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

The Web Exists in the Perpetual Now

Page 18: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

The Web Exists in the Perpetual Now

The lack of temporal capabilities of the Web has shaped our expectations.

•  We don’t object to prior versions not being available. We tolerate 404s.

•  Reviewer of Memento paper at WWW 2010: •  Is there (sic) any statistics to show that many or a good number

of Web users should like to get obsolete data or resources

•  Web archives are destinations, not integrated in the Web browsing experience.

Nelson, M.L. (2012) http://arxiv.org/abs/1209.2664

Page 19: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Not Accessible From cnn.com

Page 20: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Paper Era: Publication Context

Page 21: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Paper Era: Publication Context

Page 22: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Web Era: Publication Context

Page 23: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Web Era: Publication Context

Page 24: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Several Challenges

•  Archival approach and infrastructure to deal with dynamic, interdependent content

•  Referencing scholarly assets

•  Recreating a version of the scholarly record

Page 25: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

•  Is it possible to reconstruct the Web-based scholarly record as it was at a certain point in time?

•  For example, given a paper can one see the referenced/linked assets as they were at the time of publication of the paper?

•  The ability to reconstruct a version of the scholarly record will become increasingly important as the scholarly endeavor and discourse becomes increasingly dynamic and Web-based.

Recreating a Version of the Scholarly Record

Page 26: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

To Be Expected

Page 27: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Time-dependent decay of URLs published in MEDLINE abstracts

Wren J D, Bioinformatics, 2008;24:1381-1385

Most common types dead links were for computer programs (43%), followed by scholarly content (38%) and databases (19%)

Page 28: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

•  Content Management Systems

•  Web Archives

•  Transactional archives

•  Search engine caches

•  …

Traces of the Past Web Exist

Page 29: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

If Only It Would Be Possible to Follow a URI in Time

Page 30: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

It is with Memento

Digital Preservation Award 2010

http://www.mementoweb.org/

Page 31: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Today Select Date Jun 16 1997 Jun 16 1997

From Internet Archive

Time Travel

Page 32: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

June 16 1997

http://www.ntnu.no/ @ June 16 1997

Page 33: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Original Resources and Mementos

Page 34: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Bridge from Present to Past

Page 35: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Bridge from Past to Present

Page 36: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Memento Framework

Page 37: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Also with 404, etc.

Page 38: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Memento & IIPC

http://netpreserve.org/projects/memento

Page 39: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Memento & Wikipedia, Mediawiki

http://en.wikipedia.org/wiki/Wikipedia:Requests_for_comment/Memento

Page 40: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Memento & DBpedia

http://mementoweb.org/depot/native/dbpedia/

Page 41: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

To Be Expected

NOT IN ARCHIVE

Page 42: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

•  Is it possible to reconstruct the Web-based scholarly record as it was at a certain point in time?

•  For example, given a paper can one see the referenced materials as they were at the time of publication of the paper?

•  Example:

Van de Sompel, H., Payette, S., Erickson, J., Lagoze, C., and Warner, S. (2004) Rethinking scholarly communication: Building the System that Scholars Deserve. D-Lib Magazine, 10(9). doi:10.1045/september2004-vandesompel ; http://dx.doi.org/10.1045/september2004-vandesompel

Recreating a Version of the Scholarly Record

Page 43: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Published September 15 2004

Page 44: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Page 45: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Domain Gone

Page 46: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Archived copy December 5 2003

Page 47: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Page 48: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Current version

Page 49: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Archived copy December 11 2004

Page 50: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Page 51: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Resource gone

Page 52: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Archived copy December 5 2003

Page 53: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Page 54: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Resource gone

Page 55: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Archived copy unavailable

Page 56: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Page 57: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Current version

Page 58: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Archived copy August 26 2003

Page 59: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

•  Pilot study:

•  Papers from arXiv: 400,000 papers => 144,000 unique URIs

•  Thesis from UNT ETD repository: 3,600 papers => 18,000 URIs

•  URIs of established scholarly repositories removed (e.g. http://dx.doi.org), i.e. focusing in on the periphery of the scholarly record.

Citation Rot Studies at Scale with Memento

Sanderson, R., Phillips, M., and Van de Sompel, H. (2011) Analyzing the Persistence of Referenced Web Resources with Memento. Open Repositories 2011; Arxiv preprint. arXiv:1105.3459 ; http://arxiv.org/abs/1105.3459

Page 60: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

UNT

Page 61: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

arXiv

Page 62: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

UNT

Page 63: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

arXiv

Page 64: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Page 65: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

DOI Redirects to R1

Page 66: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Later, DOI Redirects to R2, then R3

Page 67: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

R1, R2, R3 Have Mementos

Page 68: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Looking for Memento of DOI with t in [t2,t3[

Page 69: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

End Up at Wrong Memento

Page 70: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Introduce Temporal Awareness for DOI Resolver

Page 71: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

End Up at Correct Memento

Page 72: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

But … the DOI Resolver Exists in the Perpetual Now

•  The latest information indicates that the DOI redirection history is currently not maintained

•  The situation is aggravated by multiple consecutive redirects at publisher’s end (which are likely not archived because of strict robots.txt rules)

•  While HTTP DOIs help achieve long-term workable links, they exist in the Perpetual Now like the rest of the Web’s URIs

Page 73: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Several Challenges

•  Archival approach and infrastructure to deal with dynamic, interdependent content

•  Referencing scholarly assets

•  Recreating a version of the scholarly record

Page 74: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Referencing Scholarly Assets

•  With Memento, the same HTTP URI can function as the reference to temporally evolving resources

•  But in order to reference the appropriate temporal version, both the HTTP URI and the desired time are needed. •  Essential for referencing resources in annotations

•  A few possibilities: •  Express URI and time as is currently done in citations – human

readable, not machine actionable •  Turn the reference into a tuple: URI and machine-actionable

annotation of the URI – allows expressing fragments of resources too

•  Use DURI scheme

Page 75: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

DURI

http://tools.ietf.org/html/draft-masinter-dated-uri

Page 76: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

duri:1997-06-17:http://www.ntnu.no

http://www.ntnu.no/ @ June 16 1997

Page 77: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

HTML5 Custom Protocol Handler

http://dev.opera.com/articles/view/html5-custom-protocol-and-content-handlers/

Page 78: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

HTML5 Custom Protocol Handler

http://dev.opera.com/articles/view/html5-custom-protocol-and-content-handlers/

Page 79: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

HTML5 Custom Protocol Handler

http://dev.opera.com/articles/view/html5-custom-protocol-and-content-handlers/

Page 80: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Referencing Scholarly Assets

Page 81: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Several Challenges

•  Archival approach and infrastructure to deal with dynamic, interdependent content

•  Referencing scholarly assets

•  Recreating a version of the scholarly record

Page 82: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Archival Approach

•  Archiving via a combination of “curated”, “at point of interaction”, and “in the wild” approaches:

o  CMS, wikis, datawikis with solid versioning mechanisms can play a significant role as archival hubs

o  Archiving the linked context at the time of publication (cf. WebCite), when submitted into institutional repository, etc.

o  Archiving at the moment of interaction with assets: reading, commenting, annotating, liking, tweeting, executing, etc.

o  Web archives come to the rescue for “in the wild” materials.

Page 83: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

SiteStory Transactional Archiving

http://mementoweb.github.com/SiteStory/

Page 84: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

SiteStory Transactional Archiving

http://mementoweb.github.com/SiteStory/

Page 85: Paint-Yourself-In-The-Corner Infrastructure

Herbert Van de Sompel Paint-Yourself-In-The-Corner Infrastructure

EMTACL 2012, Trondheim, Norway, October 1 2012

Conclusions

•  Scholarly assets are increasingly dynamic and interdependent •  The existing scholarly archiving infrastructure is about fixity and

boundary

•  Scholarly communication, and, as a matter of fact, the entire scholarly endeavor is increasingly Web-native

•  The Web exists in the perpetual now

•  This brings along significant challenges …

Page 86: Paint-Yourself-In-The-Corner Infrastructure

http://alexanderponting.blogspot.com/2012/01/how-to-paint-yourself-into-corner.html

@hvdsomp