pheme veracity: the 4 th challenge of big...

13
PHEME http://www.pheme.eu PHEME Veracity: The 4 th Challenge of Big Data Tomás Pariente [email protected] @tpariente

Upload: others

Post on 17-Oct-2020

3 views

Category:

Documents


0 download

TRANSCRIPT

PHEME http://www.pheme.eu

PHEME Veracity: The 4th Challenge of Big Data

Tomás Pariente [email protected]

@tpariente

PHEME http://www.pheme.eu

Phemes & social media

• Memes are thematic motifs that spread through social media in ways analogous to genetic traits

• We coined the term phemes to add truthfulness and deception to the mix

2

http://en.wikipedia.org/wiki/Pheme

PHEME focuses on a fourth crucial, but hitherto largely unstudied, challenge: Veracity

PHEME http://www.pheme.eu

Rumour analysis: The Problem

Now mostly manual

Rumours are challenging Some rumours could take hours, days, weeks or even months to die out

Ill-meaning humans can currently outsmart computers (and humans) and appear genuine

PHEME http://www.pheme.eu

Rumour analysis: The Problem

Mike Brown shot by police in Ferguson We have different rumors emerging from the topic

We don’t know if they are true.

We see the spikes and sometimes they come back (different temporal dynamics)

We need to understand the overall conversation to see the different points of view and how the rumours go forward

PHEME http://www.pheme.eu

Social Media is Rife with Phemes

PHEME http://www.pheme.eu

Social Media is Rife with Phemes

PHEME http://www.pheme.eu

From manual to automatic We are investigating... Ontologies for modelling phemes

Use a priori knowledge (LOD) and reasoning to detect contradictions

Model phemes spread across media, social networks, and time

Conversational analysis

Real-time rumour classification

Pheme visualisation to support veracity checking: media maps, impact maps, geographical maps…

PHEME http://www.pheme.eu

PatientsLikeMe

Cross-Media Content Linking,

Spatio-Temporal

Grounding

Multilingual LOD-Based

IE and Opinion Mining

Rumour Detection

And Veracity

Classification

USE CASES

Veracity Intelligence In Patient

Care

Digital Journalism

Linked Open Data Rumour Ontologies & Reasoning (GraphDB)

Historical Data

Archive

PHEME Visual

Analytics Dashboard

Social Context Models

Trust, Authority, Implicit

Networks

Technology Outcome: Open Source Computational Framework

...

PHEME VERACITY INTELLIGENCE FRAMEWORK

PHEME http://www.pheme.eu Some Meeting, Some Place, Some Date

Physical Infrastructure and Virtualization

Storage Infrastructure

Processing

Knowledge Base

Stream Processing Batch Processing

Mes

sagi

ng /

Com

ms

Mul

tilin

gual

Dat

a

Data Collection

Rumour Classification Usage Curation

Data Value Chain

IT V

alue

Cha

in

IT Big Data Layer

Veracity and Language Value Chain

System Workflow Orchestration

Mul

tilin

gual

Dat

a S

ocia

l Med

ia

Mul

tilin

gual

Dat

a

Data Data

SW

LT Processing & Analytics

Raw data Repository

Lang Detection

OntoText GraphDB™

Mul

tilin

gual

Dat

a M

ultil

ingu

al D

ata

End

Use

rs

Phe

me

Das

hboa

rd,

Jour

nalis

t Das

hboa

rd

Event Detection

NLP Processing

Annotation & Training

Cross-media linking

Cross-lingual analysis

Res

ourc

e M

anag

emen

t

PHEME Big Data Architecture for veracity analysis

PHEME http://www.pheme.eu

Application areas

Open-source social intelligence tools for data journalism Involves journalists from SwissInfo.ch, the Guardian,

New York Times, and other media

Improving healthcare What health-related rumours are discussed in patient-

clinician consultations Preventative medical advice, e.g. warn patients not to

trust certain rumours, when researching their disease online

PHEME http://www.pheme.eu

PHEME Dashboard And dynamics Over Time/Location

11

vs replies

PHEME http://www.pheme.eu

Journalism Dashboard Prototype

12

PHEME http://www.pheme.eu

Acknowledgement

The PHEME research project has received funding from the European Union's Seventh Framework Programme for research, technological development and demonstration under grant agreement No. 611233.

13

This document does not represent the opinion of the European Community, and the European Community is not responsible for any use that might be made of its content

Thanks!