richard rogers' slides from digital conversations event on 26/09/2013

24
WEB ARCHIVING: THEORIZED PRACTICES Prof. Richard Rogers University of Amsterdam

Upload: digital-research-and-curator-team-british-library

Post on 28-May-2015

529 views

Category:

Technology


0 download

TRANSCRIPT

Page 1: Richard Rogers' slides from Digital Conversations event on 26/09/2013

WEB ARCHIVING: THEORIZED PRACTICES Prof. Richard Rogers University of Amsterdam

Page 2: Richard Rogers' slides from Digital Conversations event on 26/09/2013

WEB ARCHIVING: THEORISEDPRACTICES

1. The Crisis in Web archive use 2. Addressing the crisis - Approaches to Web archive use ('Repurposing' web archive machines and building atop them)

Web archive use

Page 3: Richard Rogers' slides from Digital Conversations event on 26/09/2013

WEB ARCHIVING: THEORISEDPRACTICES

>1. The Crisis in Web archive use 2. Addressing the crisis - Approaches to Web archive use (From the study of content to features, digital objects and code)

Web archive use

Page 4: Richard Rogers' slides from Digital Conversations event on 26/09/2013
Page 5: Richard Rogers' slides from Digital Conversations event on 26/09/2013
Page 6: Richard Rogers' slides from Digital Conversations event on 26/09/2013
Page 7: Richard Rogers' slides from Digital Conversations event on 26/09/2013

WEB ARCHIVING: THEORISEDPRACTICES

1. The Crisis in Web archive use >2. Addressing the crisis - Approaches to Web archive use ('Repurposing' web archive machines and building atop them)

Web archive use

Page 8: Richard Rogers' slides from Digital Conversations event on 26/09/2013

WRITING (WEB) HISTORY FROM THE WEB ARCHIVES

>1) Single website history - Capture history of website, and playback as screencast documentary (time-lapsed photography) 2) Collection making. Build collections from _already_ archived websites (e.g., right-wing extremist sites) 3) Reconstruct periods of web history. Early blogosphere. Showwhat is missing from archive. Also give missing sites context. 4) Reconstruct history of the web. Tracker and cookie use bythe New York Times. Study code, not content.

Researcher usage of web archives - Amsterdam Digital Methods (digitalmethods.net)

Page 9: Richard Rogers' slides from Digital Conversations event on 26/09/2013

WEBSITE ARCHIVE USE: HOWTO STUDY WHAT THEY OFFER

Follow Wayback Machine's interface - Privileges single-site histories. Capture a site's history. What does it tell?

How else to study it? As an archived object, awaiting use.

Page 10: Richard Rogers' slides from Digital Conversations event on 26/09/2013
Page 11: Richard Rogers' slides from Digital Conversations event on 26/09/2013

WRITING (WEB) HISTORY FROM THE WEB ARCHIVES

1) Single website history - Capture history of website, and playback as screencast documentary (time-lapsed photography) >2) Collection making. Build collections from _already_ archived websites (e.g., right-wing extremist sites) 3) Reconstruct periods of web history. Early blogosphere. Showwhat is missing from archive. Also give missing sites context. 4) Reconstruct history of the web. Tracker and cookie use bythe New York Times. Study code, not content.

Researcher usage of web archives - Amsterdam Digital Methods (digitalmethods.net)

Page 12: Richard Rogers' slides from Digital Conversations event on 26/09/2013
Page 13: Richard Rogers' slides from Digital Conversations event on 26/09/2013
Page 14: Richard Rogers' slides from Digital Conversations event on 26/09/2013
Page 15: Richard Rogers' slides from Digital Conversations event on 26/09/2013
Page 16: Richard Rogers' slides from Digital Conversations event on 26/09/2013

STUDYING A WEBSITE COLLECTION

"This newspaper made an inventory of right-wing and extreme right-wing sites.... The newspaper found 150 of such sites that came and went over the past 10 years. The internet archive's wayback machine was used. We found that the number of right-wing and extreme right-wing sites grew after 11 September 2001, the rise of Pim Fortuyn...The growth was explosive... Thedifference in language use became smaller.... The internet is revealing a hardening of the Netherlands." - NRC Handelsblad (newspaper), 25 August 2007

On 20 September 2007, it fell out of the top 1,000 results.

UNDERSTANDING SOCIETAL CONDITIONS WITH THE WEB

Page 17: Richard Rogers' slides from Digital Conversations event on 26/09/2013

WRITING (WEB) HISTORY FROM THE WEB ARCHIVES

1) Single website history. Capture history of website, and playback as screencast documentary (time-lapsed photography) 2) Collection making for social research. Build collections from _already_ archived websites (e.g., right-wing extremist sites) >3) Reconstruct periods of web history. Early blogosphere. Showwhat is missing from archive. Also give missing sites context. 4) Reconstruct history of the web. Tracker and cookie use bythe New York Times. Study code, not content.

Researcher usage of web archives - Amsterdam Digital Methods (digitalmethods.net)

Page 18: Richard Rogers' slides from Digital Conversations event on 26/09/2013

The Archived Blogosphere: the EatonWeb Directory in the Internet Archive

Digital Methods and the Internet Archive: DMI Summer ‘09

Anaylsis_ Michael Stevenson and Marijn de Vries Hoogerwerf

Method_ Grab screenshot for earliest archived version of each

EatonWeb blog, order by date. Visualize ’missing’ blogosphere.

Page 19: Richard Rogers' slides from Digital Conversations event on 26/09/2013

slashdot.org

wired.com

obscurestore.com robotwisdom.com

zdnet.com

flutterby.com cybereditions.com

amazon.com

facto.org

scripting.com

nytimes.com

washingtonpost.com

members.tripod.com

mercurycenter.com

mediagossip.com

windowseat.org

news.bbc.co.uk

cnn.com

salon.com julienne.com

newslinx.com

user.usonet.ne.jp

rc3.org

stuffeddog.com

osnews.com

dailynews.yahoo.com

catless.ncl.ac.uk

msnbc.com sfgate.com

camworld.com

pounce.vis.nu

memepool.com

dailygrail.com users.interport.net

peterme.com

my.netscape.com

microsoft.com

athens.net

tamu.edu

abcnews.go.com

geocities.com

refererlog.com

salonmagazine.com

digitalsquirrel.com

news.cnet.com

newscientist.com

news.com

theregister.co.uk

ntk.net latimes.com

chaparraltree.com

members.xoom.com

wonko.com tomalak.org

bump.net

usatoday.com

techdirt.com

us.imdb.com search.washingtonpost.com

overlawyered.com

boston.com rebeccablood.net researchbuzz.com

thenia.com

techweb.com

artsjournal.com

apple.com

thedixons.net

privacydigest.com

www-personal.umich.edu web.pitas.com

macronin.com

gilest.org

youknowwhat.com

hack-the-planet.felter.org

l0pht.com

alchemy.openjava.org

infoworld.com geeknews.net

my.userland.com

forbes.com

nandotimes.com

ultimatechaos.com

newsunlimited.co.uk

chicagotribune.com

thestandard.com

cgi.ebay.com

blogger.com

slate.com

ltseek.ltc.vanderbilt.edu

herring.com

solosier.com

kottke.org

shmooze.com

google.com

cdnow.com

roosh.com

theonion.com

worldnewyork.com

mar.anomy.net

bekkoame.ne.jp

eatonweb.com observer.com

dtheatre.com

newhomemaker.com

villagevoice.com

whump.com

byte.com

computerworld.com

calendarlive.com

evhead.com

dack.com

dandot.com

freshmeat.net

drudgereport.com

egroups.com

epinions.com

biz.yahoo.com

cgi.pathfinder.com

bradlands.com

mtv.com

seattletimes.com

sjmercury.com suntimes.com

thecounter.com

pcworld.com

nypostonline.com

thestranger.com

nospoon.org

tv-99-ad.com

news.excite.com

underbelly.org

userfriendly.org

mediainfo.com

magnetbox.com linkwatcher.com

megnut.com

yahoo.com

pigdog.org

alt0169.com pathfinder.com

indirection.skunkworks.cx unpopular.com

azstarnet.com

miscmedia.com forgo.net

metafilter.com

bud.com

zeldman.com

listology.com

musicinsight.com

coffee-a-gogo.com

pyra.com

metagrrrl.com

technocrat.net

brainsausage.com

The outlinks analyzed are directed at both archived and non-archived blogs, making it possible to estimate the latter’s position within the network.

The outlink analysis produces clusters: the subset highlighted here includes blogs more closely associated with tech news sources, including Computer World and CNet News.

The Archived Blogosphere: a Snapshot from 1999Method_ Gather outlinks from EatonWeb blogs archived in 1999, using the version closest to July 15. Perform cluster analysis and visualize network.Analysis_ Esther Weltevrede, Carolin Gerlitz, Anat Ben-David and Michael StevensonDMI Summer ‘09_ Digital Methods and the Internet Archive

Page 20: Richard Rogers' slides from Digital Conversations event on 26/09/2013

WRITING (WEB) HISTORY FROM THE WEB ARCHIVES

1) Single website history - Capture history of website, and playback as screencast documentary (time-lapsed photography) 2) Collection making. Build collections from _already_ archived websites (e.g., right-wing extremist sites) 3) Reconstruct periods of web history. Early blogosphere. Showwhat is missing from archive. Also give missing sites context. >4) Reconstruct history of the web. Tracker and cookie use bythe New York Times. Study code, not content.

Researcher usage of web archives - Amsterdam Digital Methods (digitalmethods.net)

Page 21: Richard Rogers' slides from Digital Conversations event on 26/09/2013
Page 22: Richard Rogers' slides from Digital Conversations event on 26/09/2013
Page 23: Richard Rogers' slides from Digital Conversations event on 26/09/2013
Page 24: Richard Rogers' slides from Digital Conversations event on 26/09/2013

WRITING WEB HISTORY FROMTHE WEB ARCHIVES

Concluding remark: With Web archives, study content, but also website features, digital objects and code.

Researcher usage of web archives - Amsterdam Digital Methods (digitalmethods.net)