mapping the guardian's tags to the web of data

28
Mapping the Guardian's tags to the web of data Peter Martin & Martin Belam Guardian News & Media November 2010

Upload: martin-belam

Post on 01-Nov-2014

3.251 views

Category:

Technology


0 download

DESCRIPTION

Presentation given by Peter Martin & Martin Belam at Online Information 2010 in London, showing how the Guardian applies tag metadata to content, derives value from that metadata, and has begun to map Guardian tags to the linked open data ecosphere.

TRANSCRIPT

Page 1: Mapping the Guardian's tags to the web of data

Mapping the Guardian's tags to the web of data

Peter Martin & Martin Belam

Guardian News & MediaNovember 2010

Page 2: Mapping the Guardian's tags to the web of data

Our content model relies on tags...

...which are not anywhere near as boring as you think

Keywords

Contributor

Series

Publication

Tone

Article

Video

Audio

Gallery

Cartoon

Tags Content

Keywords

Page 3: Mapping the Guardian's tags to the web of data

Every piece of content carries aselection of hand-picked tags

Page 4: Mapping the Guardian's tags to the web of data

They are added during content production

...and the system suggests them as you type

Page 5: Mapping the Guardian's tags to the web of data

There is also a tag browser in the CMS

Page 6: Mapping the Guardian's tags to the web of data

And a search so that you can 'Batch Tag'

Page 7: Mapping the Guardian's tags to the web of data

There is an admin interface to manage tags...

Page 8: Mapping the Guardian's tags to the web of data

...and generate reports on what has been created

Page 9: Mapping the Guardian's tags to the web of data

On the site they give us related links & tag pages

(OK, that is admittedly a little bit boring)

Page 10: Mapping the Guardian's tags to the web of data

They allow us to cross-promote content

A film review for "The Damned United" is inamongst the football stories

Page 11: Mapping the Guardian's tags to the web of data

And we can create 'combiner' pages with them...

Page 12: Mapping the Guardian's tags to the web of data

...many of which are more useful than bullfighting+vuvuzelas

This page is assembled automatically by combiningthe 'review' tone with the 'books' section

Page 13: Mapping the Guardian's tags to the web of data

Tags are used to place editorial components

Stories tagged with 'Apple' in the Technology section display recent tweets on the topic by Guardian contributors

Page 14: Mapping the Guardian's tags to the web of data

And to customise commercial components

Adverts that appear in the Guardian Jobs slotare tuned by the tags applied to article content

Page 15: Mapping the Guardian's tags to the web of data

Topical navigation on the iPhone

The Guardian iPhone app uses tags to providelateral navigation into topics

Page 16: Mapping the Guardian's tags to the web of data

Topical navigation on the iPhone

The Guardian iPhone app uses tags to providelateral navigation into topics

Page 17: Mapping the Guardian's tags to the web of data

Trending on the iPhone

The iPhone app also examines the tags withthe most activity, to produce the 'trending' topic index

Page 18: Mapping the Guardian's tags to the web of data

Tags help with search results

We use links to tag pages as results for synonymsand near-synonyms commonly used by readers

Page 19: Mapping the Guardian's tags to the web of data

Tags can go in folders

...and we can turn those folders into A-Z listsand navigation on the website

Page 20: Mapping the Guardian's tags to the web of data

And our tags are on Twitter

To our knowledge, they are the only bit of our informationarchitecture to have an official presence on Twitter

Page 21: Mapping the Guardian's tags to the web of data

Now our tags are entering the world of linked data

Page 22: Mapping the Guardian's tags to the web of data

Our book reviews carry ISBNs

Page 23: Mapping the Guardian's tags to the web of data

And our content API can be queried by ISBN

http://explorer.content.guardianapis.com/

Page 24: Mapping the Guardian's tags to the web of data

Our artist tag pages have MusicBrainz IDs associated with them

Page 25: Mapping the Guardian's tags to the web of data

And the API can be queried by MusicBrainz ID

Page 26: Mapping the Guardian's tags to the web of data

Why XML and JSON?

And not something a little more rich and semantic?

Page 27: Mapping the Guardian's tags to the web of data

Where do we go next?

Where can we get the most linked data for the least effort?What will be used in the real world?

Page 28: Mapping the Guardian's tags to the web of data

Mapping the Guardian's tags to the web of data

Peter Martin & Martin Belam

Guardian News & MediaNovember 2010

@currybet@guardian_tags