web archiving collaborations: a presentation for colleagues working in the libraries of the...

33
Web archiving collabora/ons at Columbia University Libraries Anna Perricci Columbia University Libraries Metropolitan Museum of Art (August 19, 2014)

Upload: anna-perricci

Post on 21-Jun-2015

318 views

Category:

Education


0 download

DESCRIPTION

These slides were used to support a presentation on web archiving collaborations for colleagues working in the Libraries of the Metropolitan Museum of Art.

TRANSCRIPT

Page 1: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art

Web  archiving  collabora/ons  at  Columbia  University  Libraries  

Anna  Perricci  

Columbia  University  Libraries  

Metropolitan  Museum  of  Art  (August  19,  2014)  

Page 2: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art

Web    Resources  Archiving  Collabora/on  

Many  thanks  to  the  Mellon  FoundaFon  

Building  collaboraFons  among  •  The  web  archiving  community  

•  Other  research  libraries  •  Users  and  potenFal  users  of  web  archives  •  Website  creators  

Page 3: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art

Incen/ves  grants  to    advance  web  archiving  tools  

Image  source:  hNp://imgur.com/gallery/vG7KE48  

Page 4: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art

Incen/ve  awards  projects  

Warcbase:  Building  a  Scalable  Web  Archiving  PlaWorm  on  HBase  and  Hadoop.  (Jimmy  Lin,  University  of  Maryland)  

Archiving  TransacFons  Towards  UninterrupFble  Web  Service  (Zhiwu  Xie  and  Edward  A.  Fox,  Virginia  Tech  University)  

Page 5: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art

Incen/ve  awards  projects  

Visualizing Digital Collections of Web Archives (Michele Weigle, Old Dominion University)

Tools for Managing Seed URLs (Michael Nelson, Old Dominion University)

Page 6: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art

Incen/ve  awards  projects  

Perma.cc:  MiFgaFng  the  Pervasive  Problem  of  Link  Rot  in  Scholarly  Works  and  Preserving  Online  Content  (Kim  Dulin,  The  Harvard  Library  InnovaFon  Lab)  

Free  Law  Project    

 Providing  free  access  to  primary  legal  materials,  developing  legal  research  tools,  and  supporFng  academic  research  on  legal  corpora  

Page 7: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art

Building  an  efficient  and  scalable  na/onal  framework  for  collec/ng  web  content    

Image  source:  hNp://imgur.com/gallery/1m5MBKf      

Page 8: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art

Designated  space  for  collabora/ve  collec/ng  

Page 9: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art

Collabora/ve  Architecture,  Urbanism  and  Sustainability  Web  Archive  (CAUSEWAY)  

hNps://archive-­‐it.org/collecFons/4638    

Page 10: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art

Collabora/on  with  music  librarians  

Page 11: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art

Contemporary  composers—the  perfect  storm?  

Page 12: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art

Contemporary  Composers  Web  Archive  

Selectors  

•  Borrow  Direct  Music  Librarians  Group:  music  librarians  at  Brown,  Columbia,  Cornell,  Dartmouth,  Harvard,  Johns  Hopkins,  Princeton,  and  Yale  universiFes,  MIT,  and  the  universiFes  of  Chicago  and  Pennsylvania  

Cataloging  exper/se  

•  Russell  MerriN  (cataloger  specializing  in  music  resources)  •  Kate  Harcourt  (Director  of  Original  and  Special  Materials  Cataloging)  

•  Alex  Thurman  (Web  Resources  CollecFon  Coordinator)  

Page 13: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art

Contemporary  Composers  Web  Archive  hNps://archive-­‐it.org/collecFons/4019    

Page 14: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art

Quality  Assurance  

Page 15: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art

Crea/ng  MARC  records  for  web  archives  

•  CreaFng  MARC  records  for  archived  websites  is  standard  pracFce  at  CUL  – MARC  records  make  web  archives  discoverable  in  CLIO  (Columbia  Libraries  InformaFon  Online)  

•  CollecFon  level  and  seed  level  records  

•  Will  use  Archive-­‐It  interface  to  make  Dublin  Core  records  

Page 16: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art

Patron  view  of  record  in  CLIO  

Page 17: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art

Cataloger’s  view  of  record  in  CLIO  

Page 18: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art

An/cipa/ng  wider  use  of  MARC  records  

•  Records  have  been  released  to  WorldCat  

•  Collaborators  on  cataloging  were  aNenFve  to  which  fields  will  ordinarily  be  stripped  out  when  a  MARC  record  is  imported  to  another  insFtuFon’s  OPAC  

Page 19: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art

CCWA  MARC  records  

•  So  far  sample  of  10  records  has  taught  us…  

•  PosiFve  feedback  from  music  librarians  

•  Next  we  will  add  another  44  records  for  the  archived  sites  in  CCWA  soon  

Page 20: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art

Project  tracking  

Page 21: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art

Use  cases  

Page 22: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art

Who  are  the  web  archives  for?    Are  they  being  used?    Could  we  encourage  more  effec/ve  use?  

Page 23: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art

hSp://hrwa.cul.columbia.edu  

Page 24: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art

Using  the  Human  Rights  Web  Archive  &  learning  from  human  rights  scholars’  work  (publica/ons,  cita/ons)  

Page 25: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art

Cita/ons  scraped  from  ar/cles  published  in  2010  in  select  scholarly  journals  

Page 26: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art

Isola/ng  URLs  from  list  of  cita/ons  (approximately  10%  of  cita/ons  scraped  have  URLs  in  them)  

Page 27: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art

Best  Prac/ces  for  site  creators:  working  with  website  creators  

Image  source:  hNp://imgur.com/gallery/NWJ12Pl    

Page 28: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art

Open  issues:  division  and  maintenance  of  coopera/ve  efforts  

(communica/on,  so]ware  and  more)  

Page 29: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art

Process  over  next  16  months  

•  Further  planning  (revision  as  needed)  and  user  interviews  •  Maintain  group  communicaFon  

•  Ongoing  growth  (scale  of  collecFng  and  distribuFon  of  effort)  •  Present  shared  costs  and  sustainability  models  (currently  in  

development)  

•  3-­‐5  year  plan  for  Borrow  Direct  collaboraFons  (collecFons  strategy,  finances,  workflows  and  governance)  

•  If  collaboraFon  persists,  idenFfy  themes  for  further  collecFng  

•  Catalog  resources  to  high  standards  •  Quality  Assurance  and  ongoing  evaluaFon  

Page 30: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art

Web  archiving  ini/a/ves    focusing  on  art  resources  

An  iniFaFve  designed  to  address  the  “urgent  need  to  document  the  dynamic  web-­‐based  versions  of  aucFon  catalogues,  catalogues  raisonnés,  and  scholarly  research  projects,  as  well  as  arFst,  gallery,  and  museum  websites”  (hNp://www.nyarc.org/content/web-­‐archiving)  

ArFsts  Files  Special  Interest  Group  

Page 31: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art

Ques/ons?  

Image  source:  hNp://imgur.com/gallery/qoCqQoh    

Page 32: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art

Resources  that  came  up  in  the  Q  &  A  

•  Internet  Archive  "Save  a  Page"  Plug-­‐In  for  Chrome  hNps://github.com/lintool/chrome-­‐archive-­‐this-­‐page    

•  SAA  Web  Archiving  Roundtable  hNp://webarchivingrt.wordpress.com/    

Page 33: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art

Thanks!  

Anna  Perricci  

[email protected]    @AnnaPerricci    

Columbia  University  Libraries