experiments towards reverse linking on the web

26
Experiments Toward Reverse Linking on the Web Yeliz Yesilada, Darren Lunn and Simon Harper Information Management Group University of Manchester

Upload: darren-lunn

Post on 08-Jul-2015

2.630 views

Category:

Technology


0 download

DESCRIPTION

Multi-headed reverse linking (incoming links) is a fundamental concept of Open Hypermedia Systems. However, this bi-directionality has been lost in the move to the World Wide Web (Web). Here, we suggest a Web based solution for rediscovering these reverse links, and develop a series of experiments to demonstrate our approach. Simply our algorithm involves parsing a Web server’s log file, identifying each Web page viewed and saving an ordered list of referrers within a 'name–matched' XML file. This file is then used as a link point within a standard XHTML Web–page using a freely available Javascript library. While we have not performed any comprehensive user evaluation initial qualitative results suggest users are positive regarding our additions and that widespread adoption would increase user satisfaction due to constancy of the browsing experience.

TRANSCRIPT

Page 1: Experiments Towards Reverse Linking on the Web

Experiments Toward Reverse Linking on the Web

Yeliz Yesilada, Darren Lunn and Simon Harper

Information Management Group

University of Manchester

Page 2: Experiments Towards Reverse Linking on the Web

Links and Browsing

• Links Allow Movement in Information Space

• Etymology of Browsing– To nibble at leaves, tender shoots, or other soft

vegetation

• A User Is In Control of What to Read or Examine

Page 3: Experiments Towards Reverse Linking on the Web

Current Web Model

• Closed Hypermedia System

• Links Embedded Within the Document By The Author

• Outbound Uni-Directional Links

• Limits the Users Browsing Experience

A B C

Page 4: Experiments Towards Reverse Linking on the Web

Bi-Directional Linking

• Used in Open Hypermedia Systems

• Users Can Travel in Both Directions

• Links Stored in a Separate Link Base

• Links Generated Dynamically

?

A B?

Page 5: Experiments Towards Reverse Linking on the Web

Existing Bi-Directional Web Linking

• Back Button– Uses the Browser Cache– User Only Knows About Pages Previously Visited

• Surfing The Web Backwards (Chakrabati ‘99)– Netscape Browser Extension– Web Server Extension

• Trackback– An Acknowledgement Between Sites that a Link

Exists– Both Sites Need to Be Trackback Enabled

Page 6: Experiments Towards Reverse Linking on the Web

Our Approach

• Use Web Logs To Establish Who Links To Our Website

• Reduced Spam Threat as Users Must Click on a Link

• Links Available to Any JavaScript Supporting Browser

Page 7: Experiments Towards Reverse Linking on the Web

Architecture

Web Page +

Browser

Client-Side

WebServer

Server-Side

1. User Clicks A Link To Request a Web Page

1

Page 8: Experiments Towards Reverse Linking on the Web

Architecture

Web Page +

Browser

Client-Side

WebServer

Log File

Server-Side

1. Server Records Request

2

Page 9: Experiments Towards Reverse Linking on the Web

Architecture

Web Page +

Browser

Client-Side

WebServer

Log File Log Processor Pages.xml

Server-Side

1. Log Processor Parses Log To Create Linkbase

3

Page 10: Experiments Towards Reverse Linking on the Web

Architecture

Web Page +

Browser

Client-Side

WebServer

Log File Log Processor Pages.xml

Pages.html

Server-Side

1. Link Base is Added To Page

4

Page 11: Experiments Towards Reverse Linking on the Web

Architecture

Web Page +

Browser

Client-Side

WebServer

Log File Log Processor Pages.xml

Pages.html

Server-Side

1. Web page Plus Reverse Links Sent To User

5

Page 12: Experiments Towards Reverse Linking on the Web

User Follows Link (1)

Page 13: Experiments Towards Reverse Linking on the Web

Server Creates Web Log (2)

• Web Server Logs HTTP Requests– Page Requested– Destination Client of the Requested Page

• Also Logs Additional Information– The Page Where the User Clicked the Link to

Request Page– Client Platform

• W3C Extended Log File Format

Page 14: Experiments Towards Reverse Linking on the Web

Example Web Log

01: 130.88.199.206 02: - 03: - 04: [08/Aug/2007:18:30:39 +0000] 05: "GET /ht07/index.php HTTP/1.1" 06: 200 07: 3811 08: "http://markbernstein.org/ 09: "Mozilla/5.0 (Windows NT 5.1; en-GB;) Gecko/20061204 Firefox/2.0.0.1"

Page 15: Experiments Towards Reverse Linking on the Web

Linkbase Creation (3)

• Parse the Log File for Referrer / Get Request Pairs

• Create Simple XML File

• Each Webpage has a Corresponding XML Linkbase– index.php index.xml

• Individual XML Linkbases Allow– Reduced Processing on the Server– Reduced Delay on the Client

Page 16: Experiments Towards Reverse Linking on the Web

Example Linkbase (index.xml)

<linkbase> <link> <title>Home page of Mark Bernstein</title> <url>http://markbernstein.org/</url> </link> <link> <title>HCI Conference and Workshops</title> <url>http://degraaff.org/hci/conference.html</url> </link> <link> <title>D-Lib Workshops and Conferences: 2007</title> <url>http://dlib.org/groups.html</url> </link> . . . </linkbase>

Page 17: Experiments Towards Reverse Linking on the Web

Links Added To The Page (4)

• Add JavaScript To Each Webpage

• Widely Supported By Most Browser Software

• When Page is Loaded, Look For Corresponding Linkbase

• Extracts Links From Linkbase

• Add Links to Page

Page 18: Experiments Towards Reverse Linking on the Web

Displaying Links - Menu (5)

• As Part of the Menu

• Immediately Available For Use

• Menu Size Increases Significantly

Page 19: Experiments Towards Reverse Linking on the Web

Displaying Links - Menu (5)

Page 20: Experiments Towards Reverse Linking on the Web

Displaying Links - Breadcrumb (5)

• Breadcrumbs Act As Navigation Aids

• They Inform Users Where They Are Within a Website

• Reverse Links Recommend Common Paths To Get To The Current Page

• Add A “Recommender” Extension To The Breadcrumb Trail

Page 21: Experiments Towards Reverse Linking on the Web

Displaying Links - Breadcrumb (5)

Page 22: Experiments Towards Reverse Linking on the Web

Evaluation

• Technical Evaluation– In the Lab– Live on the Hypertext Website

• No User Evaluation– Previous Work has Show Reverse Linking Can

Enhance Web Browsing [Chakrabati ‘99]

Page 23: Experiments Towards Reverse Linking on the Web

Issues To Address

• How Often Should The Log File be Parsed?– Too Frequent - May slow down the server speed– Too Infrequent - Links may be out of date– Monthly - Anecdotally this seemed to work OK

• How Do We Manage The Link Box Size?– We only added links that occurred more than once– Could use time to keep only the most recently

followed links

Page 24: Experiments Towards Reverse Linking on the Web

Issues To Address

• Can Fine Grained Linking Be Achieved?– We link to the page– Is it possible to link to fragments eg Blogs?

• How Do We Ensure Link Quality?– Some referrers were password protected– Some pages had been relocated eg Blogs– Some pages might be spam

Page 25: Experiments Towards Reverse Linking on the Web

Conclusions

• Reverse Linking Is Possible Using Server Logs

• Our Technique is Platform Independent

• Enhance Users Browsing Experience

• This Is A First Step - More Investigation Is Required

Page 26: Experiments Towards Reverse Linking on the Web

Questions

http://hcw.cs.manchester.ac.uk/