harpers.org: a semantic web(ish) site for harper’s magazine paul ford associate web editor,...
TRANSCRIPT
![Page 1: Harpers.org: a Semantic Web(ish) site for Harper’s Magazine Paul Ford Associate Web Editor, Harpers.org ford@harpers.org](https://reader035.vdocuments.site/reader035/viewer/2022062409/56649f515503460f94c74df9/html5/thumbnails/1.jpg)
Harpers.org: a Semantic Web(ish) site for Harper’s Magazine
Paul FordAssociate Web Editor, [email protected]
![Page 2: Harpers.org: a Semantic Web(ish) site for Harper’s Magazine Paul Ford Associate Web Editor, Harpers.org ford@harpers.org](https://reader035.vdocuments.site/reader035/viewer/2022062409/56649f515503460f94c74df9/html5/thumbnails/2.jpg)
Harper’s is…
- A magazine of literature, politics, culture, and the arts published continuously from 1850
- A small non-profit
![Page 3: Harpers.org: a Semantic Web(ish) site for Harper’s Magazine Paul Ford Associate Web Editor, Harpers.org ford@harpers.org](https://reader035.vdocuments.site/reader035/viewer/2022062409/56649f515503460f94c74df9/html5/thumbnails/3.jpg)
Available content
- The Weekly Review, an emailed summary of world events, from 2000
- The Harper’s Index, a statistical portrait of the world, from 1998
- Public domain, scanned-in archives from 1850-1982
- Readings- Occasional features
![Page 4: Harpers.org: a Semantic Web(ish) site for Harper’s Magazine Paul Ford Associate Web Editor, Harpers.org ford@harpers.org](https://reader035.vdocuments.site/reader035/viewer/2022062409/56649f515503460f94c74df9/html5/thumbnails/4.jpg)
And that’s it.
- Maybe full text of issues will be offered someday, but not soon. So…
- How do we get more value out of limited content?
![Page 5: Harpers.org: a Semantic Web(ish) site for Harper’s Magazine Paul Ford Associate Web Editor, Harpers.org ford@harpers.org](https://reader035.vdocuments.site/reader035/viewer/2022062409/56649f515503460f94c74df9/html5/thumbnails/5.jpg)
Solution
- Hack up the what we have into bits by content type, then…
- Reassemble it according to link targets…
- Which are arranged in a taxonomy…
- Creating a very small “Semantic Web” for Harpers.org
![Page 6: Harpers.org: a Semantic Web(ish) site for Harper’s Magazine Paul Ford Associate Web Editor, Harpers.org ford@harpers.org](https://reader035.vdocuments.site/reader035/viewer/2022062409/56649f515503460f94c74df9/html5/thumbnails/6.jpg)
A quick demo…
- >>>
![Page 7: Harpers.org: a Semantic Web(ish) site for Harper’s Magazine Paul Ford Associate Web Editor, Harpers.org ford@harpers.org](https://reader035.vdocuments.site/reader035/viewer/2022062409/56649f515503460f94c74df9/html5/thumbnails/7.jpg)
How it works
- Simple set of ontological relationships (partOf, supervisorOf)
- Taxonomy of content- & narrative content
- that is split into smaller pieces
- & links into the taxonomy
![Page 8: Harpers.org: a Semantic Web(ish) site for Harper’s Magazine Paul Ford Associate Web Editor, Harpers.org ford@harpers.org](https://reader035.vdocuments.site/reader035/viewer/2022062409/56649f515503460f94c74df9/html5/thumbnails/8.jpg)
Markup
- Text: “Country Y announced that it had cut off relations with country Z. On Wednesday, something happened to persons X and Y.”
![Page 9: Harpers.org: a Semantic Web(ish) site for Harper’s Magazine Paul Ford Associate Web Editor, Harpers.org ford@harpers.org](https://reader035.vdocuments.site/reader035/viewer/2022062409/56649f515503460f94c74df9/html5/thumbnails/9.jpg)
Markup
<event> Country Y announced that it had
cut off relations with country Z.</event>
<event>On Wednesday, something
happened to persons W and X.</event>
![Page 10: Harpers.org: a Semantic Web(ish) site for Harper’s Magazine Paul Ford Associate Web Editor, Harpers.org ford@harpers.org](https://reader035.vdocuments.site/reader035/viewer/2022062409/56649f515503460f94c74df9/html5/thumbnails/10.jpg)
Markup
<event on=“2004-03-12” id=“24848”>
Country Y announced that it had cut off relations with country Z.
</event>
![Page 11: Harpers.org: a Semantic Web(ish) site for Harper’s Magazine Paul Ford Associate Web Editor, Harpers.org ford@harpers.org](https://reader035.vdocuments.site/reader035/viewer/2022062409/56649f515503460f94c74df9/html5/thumbnails/11.jpg)
Markup
<event on=“2004-03-12” id=“24848”>
<link to=“#CountryY”>Country Y</link> announced that it had cut off relations with <link to=“#CountryZ”>country Z</link>.
</event>
![Page 12: Harpers.org: a Semantic Web(ish) site for Harper’s Magazine Paul Ford Associate Web Editor, Harpers.org ford@harpers.org](https://reader035.vdocuments.site/reader035/viewer/2022062409/56649f515503460f94c74df9/html5/thumbnails/12.jpg)
Conditionals
- Some text required conditional markup
- Text: “Country Y announced that it had cut off relations with country Z, and on Wednesday, something happened to persons X and Y.”
![Page 13: Harpers.org: a Semantic Web(ish) site for Harper’s Magazine Paul Ford Associate Web Editor, Harpers.org ford@harpers.org](https://reader035.vdocuments.site/reader035/viewer/2022062409/56649f515503460f94c74df9/html5/thumbnails/13.jpg)
Conditionals: ugly, but simple<event>Country Y announced that it had cut off
relations with country Z <cond is=“id”>, and</cond> <cond not=“id”>.</cond></event><event> <cond is=“id”>on</cond> <cond not=“id”>On</cond>on Wednesday, something happened
to persons X and Y.</event>
![Page 14: Harpers.org: a Semantic Web(ish) site for Harper’s Magazine Paul Ford Associate Web Editor, Harpers.org ford@harpers.org](https://reader035.vdocuments.site/reader035/viewer/2022062409/56649f515503460f94c74df9/html5/thumbnails/14.jpg)
Conditionals: ugly, but simple- Narrative version
- Country Y announced that it had cut off relations with country Z, and on Wednesday, something happened to persons X and Y.
- Timeline-friendly version- Country Y announced that it
had cut off relations with country Z.
- On Wednesday, something happened to persons X and Y.
![Page 15: Harpers.org: a Semantic Web(ish) site for Harper’s Magazine Paul Ford Associate Web Editor, Harpers.org ford@harpers.org](https://reader035.vdocuments.site/reader035/viewer/2022062409/56649f515503460f94c74df9/html5/thumbnails/15.jpg)
All of it gets slurped up
- And turned into a set of triples
- Then processed in-memory- With HTML pages spit out
as a result
![Page 16: Harpers.org: a Semantic Web(ish) site for Harper’s Magazine Paul Ford Associate Web Editor, Harpers.org ford@harpers.org](https://reader035.vdocuments.site/reader035/viewer/2022062409/56649f515503460f94c74df9/html5/thumbnails/16.jpg)
Hard, then easy
- Hard to get started (lots of events, facts, and links)
- Easy to keep going, if you don’t mind the markup and use a good text editor
![Page 17: Harpers.org: a Semantic Web(ish) site for Harper’s Magazine Paul Ford Associate Web Editor, Harpers.org ford@harpers.org](https://reader035.vdocuments.site/reader035/viewer/2022062409/56649f515503460f94c74df9/html5/thumbnails/17.jpg)
Tools used
- emacs, vi, bbedit- XSLT2.0 (SAXON)- CVS
![Page 18: Harpers.org: a Semantic Web(ish) site for Harper’s Magazine Paul Ford Associate Web Editor, Harpers.org ford@harpers.org](https://reader035.vdocuments.site/reader035/viewer/2022062409/56649f515503460f94c74df9/html5/thumbnails/18.jpg)
Why not RDF?
- Not right for redundant content and conditionals
- Easy enough to transform arbitrary structured XML into RDF with XSLT, as needed
- (Or into RSS1.0, RSS2.0, Atom, etc.)
?
![Page 19: Harpers.org: a Semantic Web(ish) site for Harper’s Magazine Paul Ford Associate Web Editor, Harpers.org ford@harpers.org](https://reader035.vdocuments.site/reader035/viewer/2022062409/56649f515503460f94c74df9/html5/thumbnails/19.jpg)
For free…
- From 300 individual pages…
- To 1100 pages of “remixed” content – all unique and relevant
- And Google-friendly
![Page 20: Harpers.org: a Semantic Web(ish) site for Harper’s Magazine Paul Ford Associate Web Editor, Harpers.org ford@harpers.org](https://reader035.vdocuments.site/reader035/viewer/2022062409/56649f515503460f94c74df9/html5/thumbnails/20.jpg)
And also for free…
- Semantically relevant in-site advertising, if we want it
- Topic-sorted, reusable content
- Permanent, readable URIs
![Page 21: Harpers.org: a Semantic Web(ish) site for Harper’s Magazine Paul Ford Associate Web Editor, Harpers.org ford@harpers.org](https://reader035.vdocuments.site/reader035/viewer/2022062409/56649f515503460f94c74df9/html5/thumbnails/21.jpg)
Do people get it?
- Some do, and others just navigate the site as usual
- Harper’s was fine with the learning curve
- “Odd but useful” – Gawker
![Page 22: Harpers.org: a Semantic Web(ish) site for Harper’s Magazine Paul Ford Associate Web Editor, Harpers.org ford@harpers.org](https://reader035.vdocuments.site/reader035/viewer/2022062409/56649f515503460f94c74df9/html5/thumbnails/22.jpg)
Results
- Uptick in traffic and subscription revenues
- Low cost of maintenance- Ever-increasing database of
facts and events – adding one Weekly Review adds value to 50 different pages
- Happy client
![Page 23: Harpers.org: a Semantic Web(ish) site for Harper’s Magazine Paul Ford Associate Web Editor, Harpers.org ford@harpers.org](https://reader035.vdocuments.site/reader035/viewer/2022062409/56649f515503460f94c74df9/html5/thumbnails/23.jpg)
Why the SemWeb(ish) framework?
- Leaves plenty of room to grow- Web-only content- Full text of issues- Subscriber services- Etc
- Take advantage of new SemWeb tools- Incorporate RDF sources into the
taxonomy- Anticipate Semantic Web browsers
![Page 24: Harpers.org: a Semantic Web(ish) site for Harper’s Magazine Paul Ford Associate Web Editor, Harpers.org ford@harpers.org](https://reader035.vdocuments.site/reader035/viewer/2022062409/56649f515503460f94c74df9/html5/thumbnails/24.jpg)
Next?
![Page 25: Harpers.org: a Semantic Web(ish) site for Harper’s Magazine Paul Ford Associate Web Editor, Harpers.org ford@harpers.org](https://reader035.vdocuments.site/reader035/viewer/2022062409/56649f515503460f94c74df9/html5/thumbnails/25.jpg)
Make it pretty
- Redesign- Hide some of the
navigation- Turn links on and off
![Page 26: Harpers.org: a Semantic Web(ish) site for Harper’s Magazine Paul Ford Associate Web Editor, Harpers.org ford@harpers.org](https://reader035.vdocuments.site/reader035/viewer/2022062409/56649f515503460f94c74df9/html5/thumbnails/26.jpg)
Make it scale
- Currently maxes out at about 20-30 megs of content, due to limits of in-memory DOM representation (10-12x XML document size)
- Use a publicly available storage layer (Kowari, Jena, etc)
- Go triple-crazy
![Page 27: Harpers.org: a Semantic Web(ish) site for Harper’s Magazine Paul Ford Associate Web Editor, Harpers.org ford@harpers.org](https://reader035.vdocuments.site/reader035/viewer/2022062409/56649f515503460f94c74df9/html5/thumbnails/27.jpg)
Make it easy to query and navigate
- “Show me everything related to George Bush and Iraq.”
or- “Show me everything related
to politicians and the Middle East.”
- New navigation- ?
![Page 28: Harpers.org: a Semantic Web(ish) site for Harper’s Magazine Paul Ford Associate Web Editor, Harpers.org ford@harpers.org](https://reader035.vdocuments.site/reader035/viewer/2022062409/56649f515503460f94c74df9/html5/thumbnails/28.jpg)
![Page 29: Harpers.org: a Semantic Web(ish) site for Harper’s Magazine Paul Ford Associate Web Editor, Harpers.org ford@harpers.org](https://reader035.vdocuments.site/reader035/viewer/2022062409/56649f515503460f94c74df9/html5/thumbnails/29.jpg)