talk of europe – linking european parliament proceedings
DESCRIPTION
Talk held in DHBenelux 2014 by Astrid van Aggelen. Presents the project Talk of Europe, which makes available the plenary debates in the European Parliament, including all translations, as linked open data. Focus of the talk is on the data sources used and how this information is modelled, as well as the possibilities of the resulting RDF dataset in humanities research.TRANSCRIPT
Talk of EuropeLinking European Parliament Proceedings
Astrid van Aggelen - VU University AmsterdamMax Kemman (@MaxJ_K) - Erasmus University Rotterdam
About the project
● Astrid van Aggelen● Laura Hollink
● Max Kemman● Martijn Kleppe● Henri Beunders
● Marnix van Berchum
● Johan Oomen● Jaap Blom
● Steven Krauwer● Jan Odijk
● 2014-2015● Funded by CLARIN-NL & CLARIN
ERIC
Primary goals
plenary sessions 1996 - (present)● Represent in Resource Description
Framework
Primary goals
plenary sessions 1996 - (present)● Publish as linked open data
Primary goals
plenary sessions 1996 - (present)● Promote
applications
The European Parliament
sessionsessionDay
agendaItemspeech
The European Parliament
Plenary sessions are NOT
● strictly role-based● mirror of law-making● interactive
Datasets
1. Europarl debate registrydate, debates, speakers, speeches
who said what in which debate on which day?
Data model
Data model
Enriching the data (1)
?What is a member’s political background?
Datasets
● Europarl debate registrydebates, speakers, speeches
● Europarl MEP databaseparties, committee, country, delegation
Data model
Data model
Data model
Enriching the data (2)
How to categorise debates?
?
Enriching the data (2)● Foreign Affairs● Human Rights● Security and Defence● Development● International Trade● Budgets● Budgetary Control● Economic and Monetary Affairs● Employment and Social Affairs● Environment, Public Health and Food Safety● Industry, Research and Energy● Internal Market and Consumer Protection● Transport and Tourism● Regional Development● Agriculture and Rural Development● Fisheries● Culture and Education● Legal Affairs● Civil Liberties, Justice and Home Affairs● Constitutional Affairs● Women's Rights and Gender Equality● Petitions
●
Datasets
● Europarl debate registrydebates, speakers, speeches, texts*
● Europarl MEP databaseparty, committee, country, delegation
● Europarl report registrycommittee / theme
Data model
Data model
Enriching the data (3)
In which role is this person speaking?
Enriching the data (3)
Heuristic processing!
Ideas for applications
● data enrichment:geographical dataset, encyclopedia, voting info (Eur-Lex)
● applications: topic preference of speakers by country / geography / party
cross-lingual language use
sentiment analysis
Creative camp
● Bring together developers and academic researchers from across Europe
● Promoting inventive use of the EP dataset, exploiting web and natural language processing techniques to add new knowledge and functionality to the dataset
● 6-10 October 2014 at NISV (Hilversum, The Netherlands)
● Submissions due: Friday 20 June
www.talkofeurope.eu/cfp
More info
General info: www.talkofeurope.eu
Creative camp: www.talkofeurope.eu/cfp/
Astrid [email protected]
Max [email protected] / @MaxJ_K