presentation 16 may morning casestudy 2 xavier jacques jourion
TRANSCRIPT
© 2013 RTBF -‐ DGTE -‐
GEMSThe future is now
Semantics for [audiovisual] dummies
Xavier Jacques-Jourion
FIAT-IFTA Media Management Seminar
Beeld & Geluid, Hilversum, May 16th, 2013
© 2013 RTBF -‐ DGTE -‐
Agenda
• Introduction
• Semantics 101
• Linked Data
• Demonstration
• Conclusion
3
© 2013 RTBF -‐ DGTE -‐
• Public broadcaster
• French-speaking
• 3 TV stations6 Radio stationsInternet portals
• Around 200.000 hours of archives (radio & TV)
• Digitisation in progress (SONUMA)
5
Semantics 101
An introduction to the semantic web
© 2013 RTBF -‐ DGTE -‐ 7
© 2013 RTBF -‐ DGTE -‐
From data to knowledge
8
131573076011/09/2001 - 08:46 EST
First plane hits the World Trade Center North Tower in New York
© 2013 RTBF -‐ DGTE -‐
From data to knowledge
9
Raw data
Information / Content
Knowledge
© 2013 RTBF -‐ DGTE -‐
Data triplets
• Data inside the system is qualified
• Model: subject - predicate - object
• Examples:§ Steve is Peter’s son.
§ Peter is John’s brother.
10
has the colourthe sky blue
Subject ObjectPredicate
© 2013 RTBF -‐ DGTE -‐
From searching to knowing
11
As of September 2011
MusicBrainz
(zitgist)
P20
Turismo de
Zaragoza
yovisto
Yahoo! Geo
Planet
YAGO
World Fact-book
El ViajeroTourism
WordNet (W3C)
WordNet (VUA)
VIVO UF
VIVO Indiana
VIVO Cornell
VIAF
URIBurner
Sussex Reading
Lists
Plymouth Reading
Lists
UniRef
UniProt
UMBEL
UK Post-codes
legislationdata.gov.uk
Uberblic
UB Mann-heim
TWC LOGD
Twarql
transportdata.gov.
uk
Traffic Scotland
theses.fr
Thesau-rus W
totl.net
Tele-graphis
TCMGeneDIT
TaxonConcept
Open Library (Talis)
tags2con delicious
t4gminfo
Swedish Open
Cultural Heritage
Surge Radio
Sudoc
STW
RAMEAU SH
statisticsdata.gov.
uk
St. Andrews Resource
Lists
ECS South-ampton EPrints
SSW Thesaur
us
SmartLink
Slideshare2RDF
semanticweb.org
SemanticTweet
Semantic XBRL
SWDog Food
Source Code Ecosystem Linked Data
US SEC (rdfabout)
Sears
Scotland Geo-
graphy
ScotlandPupils &Exams
Scholaro-meter
WordNet (RKB
Explorer)
Wiki
UN/LOCODE
Ulm
ECS (RKB
Explorer)
Roma
RISKS
RESEX
RAE2001
Pisa
OS
OAI
NSF
New-castle
LAASKISTI
JISC
IRIT
IEEE
IBM
Eurécom
ERA
ePrints dotAC
DEPLOY
DBLP (RKB
Explorer)
Crime Reports
UK
Course-ware
CORDIS (RKB
Explorer)CiteSeer
Budapest
ACM
riese
Revyu
researchdata.gov.
ukRen. Energy Genera-
tors
referencedata.gov.
uk
Recht-spraak.
nl
RDFohloh
Last.FM (rdfize)
RDF Book
Mashup
Rådata nå!
PSH
Product Types
Ontology
ProductDB
PBAC
Poké-pédia
patentsdata.go
v.uk
OxPoints
Ord-nance Survey
Openly Local
Open Library
OpenCyc
Open Corpo-rates
OpenCalais
OpenEI
Open Election
Data Project
OpenData
Thesau-rus
Ontos News Portal
OGOLOD
JanusAMP
Ocean Drilling Codices
New York
Times
NVD
ntnusc
NTU Resource
Lists
Norwe-gian
MeSH
NDL subjects
ndlna
myExperi-ment
Italian Museums
medu-cator
MARC Codes List
Man-chester Reading
Lists
Lotico
Weather Stations
London Gazette
LOIUS
Linked Open Colors
lobidResources
lobidOrgani-sations
LEM
LinkedMDB
LinkedLCCN
LinkedGeoData
LinkedCT
LinkedUser
FeedbackLOV
Linked Open
Numbers
LODE
Eurostat (OntologyCentral)
Linked EDGAR
(OntologyCentral)
Linked Crunch-
base
lingvoj
Lichfield Spen-ding
LIBRIS
Lexvo
LCSH
DBLP (L3S)
Linked Sensor Data (Kno.e.sis)
Klapp-stuhl-club
Good-win
Family
National Radio-activity
JP
Jamendo (DBtune)
Italian public
schools
ISTAT Immi-gration
iServe
IdRef Sudoc
NSZL Catalog
Hellenic PD
Hellenic FBD
PiedmontAccomo-dations
GovTrack
GovWILD
GoogleArt
wrapper
gnoss
GESIS
GeoWordNet
GeoSpecies
GeoNames
GeoLinkedData
GEMET
GTAA
STITCH
SIDER
Project Guten-berg
MediCare
Euro-stat
(FUB)
EURES
DrugBank
Disea-some
DBLP (FU
Berlin)
DailyMed
CORDIS(FUB)
Freebase
flickr wrappr
Fishes of Texas
Finnish Munici-palities
ChEMBL
FanHubz
EventMedia
EUTC Produc-
tions
Eurostat
Europeana
EUNIS
EU Insti-
tutions
ESD stan-dards
EARTh
Enipedia
Popula-tion (En-AKTing)
NHS(En-
AKTing) Mortality(En-
AKTing)
Energy (En-
AKTing)
Crime(En-
AKTing)
CO2 Emission
(En-AKTing)
EEA
SISVU
education.data.g
ov.uk
ECS South-ampton
ECCO-TCP
GND
Didactalia
DDC Deutsche Bio-
graphie
datadcs
MusicBrainz
(DBTune)
Magna-tune
John Peel
(DBTune)
Classical (DB
Tune)
AudioScrobbler (DBTune)
Last.FM artists
(DBTune)
DBTropes
Portu-guese
DBpedia
dbpedia lite
Greek DBpedia
DBpedia
data-open-ac-uk
SMCJournals
Pokedex
Airports
NASA (Data Incu-bator)
MusicBrainz(Data
Incubator)
Moseley Folk
Metoffice Weather Forecasts
Discogs (Data
Incubator)
Climbing
data.gov.uk intervals
Data Gov.ie
databnf.fr
Cornetto
reegle
Chronic-ling
America
Chem2Bio2RDF
Calames
businessdata.gov.
uk
Bricklink
Brazilian Poli-
ticians
BNB
UniSTS
UniPathway
UniParc
Taxonomy
UniProt(Bio2RDF)
SGD
Reactome
PubMedPub
Chem
PRO-SITE
ProDom
Pfam
PDB
OMIMMGI
KEGG Reaction
KEGG Pathway
KEGG Glycan
KEGG Enzyme
KEGG Drug
KEGG Com-pound
InterPro
HomoloGene
HGNC
Gene Ontology
GeneID
Affy-metrix
bible ontology
BibBase
FTS
BBC Wildlife Finder
BBC Program
mes BBC Music
Alpine Ski
Austria
LOCAH
Amster-dam
Museum
AGROVOC
AEMET
US Census (rdfabout)
Media
Geographic
Publications
Government
Cross-domain
Life sciences
User-generated content
© 2013 RTBF -‐ DGTE -‐
Linked Open Data (LOD)
12
But what does it do?
The power of linked data
© 2013 RTBF -‐ DGTE -‐
Do not read this.
Linked Data is about using the Web to connect related data that wasn't previously linked, or using the Web to lower the barriers to linking data currently linked using other methods. More specifically, Wikipedia defines Linked Data as "a term used to describe a recommended best practice for exposing, sharing, and connecting pieces of data, information, and knowledge on the Semantic Web using URIs and RDF."
14
© 2013 RTBF -‐ DGTE -‐ 15
© 2013 RTBF -‐ DGTE -‐ 16
© 2013 RTBF -‐ DGTE -‐ 17
The GEMS project
© 2013 RTBF -‐ DGTE -‐
GEMS
• Goal: build a proof of concept for a semantic-based multimedia browser interface, using raw extracts from our media databases.
• De-mystify the field of semantics.
• Developed with two external partners:
19
© 2013 RTBF -‐ DGTE -‐
Project intentions
20
• Use semantics to assemble the knowledge previously spread across multiple databases.
• Connect to public data sources using LOD.
• Propose a new research tool for journalists and production assistants.
• Cross-media searches.
• Speech-to-text engine.
• Ideally: change the way research is done by giving access to the knowledge harvested from the different media collection(s).
© 2013 RTBF -‐ DGTE -‐
Principle
21
Nétia Tramontane Radio Dalet Tramontane
TV
GEMS
© 2013 RTBF -‐ DGTE -‐
Content
22
• Medias linked to the end of the Belgian government crisis in July 2011§ 4 “JT 19h30”, week starting July 4th, 2011
§ 3 “Invités de Matin Première” (Radio show)
§ 1 “Mise au point” (August 28, 2011)
§ Metadata linked to the above medias
Demo
Conclusion
Questions?
Thank you!
© 2013 RTBF -‐ DGTE -‐