richard rogers' slides from digital conversations event on 26/09/2013
TRANSCRIPT
WEB ARCHIVING: THEORIZED PRACTICES Prof. Richard Rogers University of Amsterdam
WEB ARCHIVING: THEORISEDPRACTICES
1. The Crisis in Web archive use 2. Addressing the crisis - Approaches to Web archive use ('Repurposing' web archive machines and building atop them)
Web archive use
WEB ARCHIVING: THEORISEDPRACTICES
>1. The Crisis in Web archive use 2. Addressing the crisis - Approaches to Web archive use (From the study of content to features, digital objects and code)
Web archive use
WEB ARCHIVING: THEORISEDPRACTICES
1. The Crisis in Web archive use >2. Addressing the crisis - Approaches to Web archive use ('Repurposing' web archive machines and building atop them)
Web archive use
WRITING (WEB) HISTORY FROM THE WEB ARCHIVES
>1) Single website history - Capture history of website, and playback as screencast documentary (time-lapsed photography) 2) Collection making. Build collections from _already_ archived websites (e.g., right-wing extremist sites) 3) Reconstruct periods of web history. Early blogosphere. Showwhat is missing from archive. Also give missing sites context. 4) Reconstruct history of the web. Tracker and cookie use bythe New York Times. Study code, not content.
Researcher usage of web archives - Amsterdam Digital Methods (digitalmethods.net)
WEBSITE ARCHIVE USE: HOWTO STUDY WHAT THEY OFFER
Follow Wayback Machine's interface - Privileges single-site histories. Capture a site's history. What does it tell?
How else to study it? As an archived object, awaiting use.
WRITING (WEB) HISTORY FROM THE WEB ARCHIVES
1) Single website history - Capture history of website, and playback as screencast documentary (time-lapsed photography) >2) Collection making. Build collections from _already_ archived websites (e.g., right-wing extremist sites) 3) Reconstruct periods of web history. Early blogosphere. Showwhat is missing from archive. Also give missing sites context. 4) Reconstruct history of the web. Tracker and cookie use bythe New York Times. Study code, not content.
Researcher usage of web archives - Amsterdam Digital Methods (digitalmethods.net)
STUDYING A WEBSITE COLLECTION
"This newspaper made an inventory of right-wing and extreme right-wing sites.... The newspaper found 150 of such sites that came and went over the past 10 years. The internet archive's wayback machine was used. We found that the number of right-wing and extreme right-wing sites grew after 11 September 2001, the rise of Pim Fortuyn...The growth was explosive... Thedifference in language use became smaller.... The internet is revealing a hardening of the Netherlands." - NRC Handelsblad (newspaper), 25 August 2007
On 20 September 2007, it fell out of the top 1,000 results.
UNDERSTANDING SOCIETAL CONDITIONS WITH THE WEB
WRITING (WEB) HISTORY FROM THE WEB ARCHIVES
1) Single website history. Capture history of website, and playback as screencast documentary (time-lapsed photography) 2) Collection making for social research. Build collections from _already_ archived websites (e.g., right-wing extremist sites) >3) Reconstruct periods of web history. Early blogosphere. Showwhat is missing from archive. Also give missing sites context. 4) Reconstruct history of the web. Tracker and cookie use bythe New York Times. Study code, not content.
Researcher usage of web archives - Amsterdam Digital Methods (digitalmethods.net)
The Archived Blogosphere: the EatonWeb Directory in the Internet Archive
Digital Methods and the Internet Archive: DMI Summer ‘09
Anaylsis_ Michael Stevenson and Marijn de Vries Hoogerwerf
Method_ Grab screenshot for earliest archived version of each
EatonWeb blog, order by date. Visualize ’missing’ blogosphere.
slashdot.org
wired.com
obscurestore.com robotwisdom.com
zdnet.com
flutterby.com cybereditions.com
amazon.com
facto.org
scripting.com
nytimes.com
washingtonpost.com
members.tripod.com
mercurycenter.com
mediagossip.com
windowseat.org
news.bbc.co.uk
cnn.com
salon.com julienne.com
newslinx.com
user.usonet.ne.jp
rc3.org
stuffeddog.com
osnews.com
dailynews.yahoo.com
catless.ncl.ac.uk
msnbc.com sfgate.com
camworld.com
pounce.vis.nu
memepool.com
dailygrail.com users.interport.net
peterme.com
my.netscape.com
microsoft.com
athens.net
tamu.edu
abcnews.go.com
geocities.com
refererlog.com
salonmagazine.com
digitalsquirrel.com
news.cnet.com
newscientist.com
news.com
theregister.co.uk
ntk.net latimes.com
chaparraltree.com
members.xoom.com
wonko.com tomalak.org
bump.net
usatoday.com
techdirt.com
us.imdb.com search.washingtonpost.com
overlawyered.com
boston.com rebeccablood.net researchbuzz.com
thenia.com
techweb.com
artsjournal.com
apple.com
thedixons.net
privacydigest.com
www-personal.umich.edu web.pitas.com
macronin.com
gilest.org
youknowwhat.com
hack-the-planet.felter.org
l0pht.com
alchemy.openjava.org
infoworld.com geeknews.net
my.userland.com
forbes.com
nandotimes.com
ultimatechaos.com
newsunlimited.co.uk
chicagotribune.com
thestandard.com
cgi.ebay.com
blogger.com
slate.com
ltseek.ltc.vanderbilt.edu
herring.com
solosier.com
kottke.org
shmooze.com
google.com
cdnow.com
roosh.com
theonion.com
worldnewyork.com
mar.anomy.net
bekkoame.ne.jp
eatonweb.com observer.com
dtheatre.com
newhomemaker.com
villagevoice.com
whump.com
byte.com
computerworld.com
calendarlive.com
evhead.com
dack.com
dandot.com
freshmeat.net
drudgereport.com
egroups.com
epinions.com
biz.yahoo.com
cgi.pathfinder.com
bradlands.com
mtv.com
seattletimes.com
sjmercury.com suntimes.com
thecounter.com
pcworld.com
nypostonline.com
thestranger.com
nospoon.org
tv-99-ad.com
news.excite.com
underbelly.org
userfriendly.org
mediainfo.com
magnetbox.com linkwatcher.com
megnut.com
yahoo.com
pigdog.org
alt0169.com pathfinder.com
indirection.skunkworks.cx unpopular.com
azstarnet.com
miscmedia.com forgo.net
metafilter.com
bud.com
zeldman.com
listology.com
musicinsight.com
coffee-a-gogo.com
pyra.com
metagrrrl.com
technocrat.net
brainsausage.com
The outlinks analyzed are directed at both archived and non-archived blogs, making it possible to estimate the latter’s position within the network.
The outlink analysis produces clusters: the subset highlighted here includes blogs more closely associated with tech news sources, including Computer World and CNet News.
The Archived Blogosphere: a Snapshot from 1999Method_ Gather outlinks from EatonWeb blogs archived in 1999, using the version closest to July 15. Perform cluster analysis and visualize network.Analysis_ Esther Weltevrede, Carolin Gerlitz, Anat Ben-David and Michael StevensonDMI Summer ‘09_ Digital Methods and the Internet Archive
WRITING (WEB) HISTORY FROM THE WEB ARCHIVES
1) Single website history - Capture history of website, and playback as screencast documentary (time-lapsed photography) 2) Collection making. Build collections from _already_ archived websites (e.g., right-wing extremist sites) 3) Reconstruct periods of web history. Early blogosphere. Showwhat is missing from archive. Also give missing sites context. >4) Reconstruct history of the web. Tracker and cookie use bythe New York Times. Study code, not content.
Researcher usage of web archives - Amsterdam Digital Methods (digitalmethods.net)
WRITING WEB HISTORY FROMTHE WEB ARCHIVES
Concluding remark: With Web archives, study content, but also website features, digital objects and code.
Researcher usage of web archives - Amsterdam Digital Methods (digitalmethods.net)