text and data mining - the opportunities and the eu conundrum - why aren’t we doing more?
TRANSCRIPT
FutureTDM Text & Data Mining- the opportunities and the EU conundrum
September 27, 2016, Brussels, Belgium Susan Reilly LIBER, the Association of European Research Libraries
Horizon 2020
Coordination and Support Action
GARRI-3-2014 Scientific Information in the Digital Age: Text and Data Mining (TDM)
Improve the Uptake of Text & Data Mining (TDM) in Europe…
Why are we here?
2
“the discovery by computer of new, previously unknown information, by automatically extracting and relating information from different (…)resources, to reveal otherwise hidden meanings” (Hearst, 1999)
Why TDM? Why now?
3
ICT Availability
Computing
Power
Data
16 trillion gigabytes of data
by 2020 (236% growth)
Doubles every 2 years (Moores
Law, 1965)
Over 80% EU citizens have
internet access (Eurostat 2014)
• Addressing grand challenges such as climate change and global epidemics
• Improving population health, wealth and development
• Creating new jobs and employment
• Exponentially increasing the speed and progress of science through new insights and greater efficiency of research
• Increasing transparency of governments and their actions
• Fostering innovation and collaboration and boosting the impact of open science
• Creating tools for education and research
• Providing new and richer cultural insights
• Speeding economic and social development in all parts of the globe
(The Hague Declaration on Knowledge Discovery)
Potential of TDM
4
5 http://www.flickr.com/photos/21042103@N03/3136430416/ Funny Kitten
TDM is not a homogeneous, self-contained, scientific domain, but rather a diverse and complex set of
methods and technologies deployed in the framework of diverse disciplines and business
activities
The Key Challenge
6
Microcosmic view: TDM and Sentiment Analysis
7
Hedonometer
8
9
http://varianceexplained.org/r/trump-tweets/
Enablers
10
Technical - Automation of analysis - Volume of data
Legal -Data protection -Regional disparities in copyright law
Skills -Need to marry discipline specific with technical -Information literacy
Content -More diverse data sources
▪ Any kind of outcomes e.g. deliverables, research highlights, KCs, events, publications,
presentations, booths, factsheets, …
11
ELABORATE a legal and policy framework for future TDM, define policy priorities, specify a research agenda to foster the spread of TDM in various research fields within the EU
BUILD a Collaborative Knowledge Base and an Open Information Hub combined on a web-based platform including intuitive tools
ANALYSE current application areas and trends in TDM including
statistics and key figures, collect relevant research and industrial
projects and best practices
ASSESS existing studies, legal regulations and policies on TDM within the European
Union
Main Objectives of FutureTDM
INVOLVE all key stakeholders to identify practices,
requirements, and specific challenges in the field of
TDM
INCREASE awareness of TDM to attract new target groups and science domains
Process
12
Join the Community!
FutureTDM Collaborative Knowledge Base and Open Information Hub
Platform: http://futuretdm.eu/
Expert Blog Best Practice Catalogue
Project Database
Tool Selector
Baseline Reports
Comparative Analysis
FutureTDM Guidelines
Leading Projects Area (promoting/knowledge exchange - e.g. OpenMinTeD)
Landscape Overview
13
meet the TEXT & DATA MINING
Community
15
Impact
Remove existing legal, technological and skill barriers that prevent TDM technology from being adopted within the EU.
Increase awareness about the social, economic and scientific benefits of TDM.
Increase the Union’s competitiveness with other high-tech economies (like Japan, South Korea, US) by enhancing TDM adoption.
Foster the adoption of TDM in science and economy.
Lead to Research & Innovation policy that is more relevant and responsive to society
Impact
16
Contents of this presentation are cc-by 4.0 unless otherwise indicated
@skreilly @futureTDM