semantic user profiling and personalised filtering of the twitter stream

37
User Profiling on the Social Semantic Web Fabrizio Orlandi, DERI (NUI Galway, Ireland) Kno.e.sis WSU Dayton, OH 9 Feb 2012

Upload: fabrizio-orlandi

Post on 28-Aug-2014

1.937 views

Category:

Technology


0 download

DESCRIPTION

Presentation at Kno.e.sis - Feb 2012. The presentation describe my current PhD research at DERI and the work done in 5 weeks during a collaboration in Kno.e.sis with Pavan Kapanipathi, Prof. Amit Sheth, Prof. T. K. Prasad and the rest of the group. - video: http://youtu.be/MmF5HxIVUwA

TRANSCRIPT

Page 1: Semantic user profiling and Personalised filtering of the Twitter stream

User Profiling on the Social

Semantic Web

Fabrizio Orlandi, DERI (NUI Galway, Ireland)

Kno.e.sis – WSU Dayton, OH – 9 Feb 2012

Page 2: Semantic user profiling and Personalised filtering of the Twitter stream

User Profiling

“A user profile is a representation of information about an individual user

that is essential for the (intelligent) application we are considering” [1]

[1] S. Schiaffino, A. Amandi. 2009.

Contents of user profiles:

user interests;

the user’s knowledge, background and skills;

user behavior;

the user’s interaction preferences;

the user’s individual characteristics;

and the user’s context.

Page 3: Semantic user profiling and Personalised filtering of the Twitter stream

• How to collect and interlink user information from social media

websites to build enhanced and comprehensive user profiles?

• How to manage and merge user models from different

applications and social sites in an interoperable way?

• How to leverage provenance information and trust measures on

the Web of Data to improve Web personalisation?

Research Questions

Page 4: Semantic user profiling and Personalised filtering of the Twitter stream

Challenges – 1

• Information on the Social Web is stored in isolated data silos on

heterogeneous and disconnected social media websites

http://www.w3.org

Page 6: Semantic user profiling and Personalised filtering of the Twitter stream

Challenges – 3

• Lack of provenance on the Web of Data: datasets on the Social Web

are often the result of data mashups or collaborative user activities

Page 7: Semantic user profiling and Personalised filtering of the Twitter stream

Challenges – 4

• User profiles should be represented in an interoperable way in order

to exchange information across different user adaptive systems

[U. Bojārs, A. Passant, J. Breslin]

Page 8: Semantic user profiling and Personalised filtering of the Twitter stream

Outline

The user profiling data process:

1. from user activities on heterogeneous social media websites,

2. to their provenance representation,

3. to the data aggregation and analysis

12

3

Page 9: Semantic user profiling and Personalised filtering of the Twitter stream

So far…

State of the art analysis

Modelling the structure of wikis

Enabling semantic search on heterogeneous wiki systems

Provenance of data in wikis

Representation and extraction of provenance in Wikipedia and DBpedia

Privacy Aware and Faceted User-Profile Management

Personalized Filtering of the Twitter Stream…

Page 10: Semantic user profiling and Personalised filtering of the Twitter stream

Semantic Personalization of Social

Web Streams

Page 12: Semantic user profiling and Personalised filtering of the Twitter stream
Page 13: Semantic user profiling and Personalised filtering of the Twitter stream
Page 14: Semantic user profiling and Personalised filtering of the Twitter stream
Page 15: Semantic user profiling and Personalised filtering of the Twitter stream

Motivation

• How many people should I follow?

• Am I receiving latest/complete information?

• How can I quickly tell the system what are my interests?

Page 16: Semantic user profiling and Personalised filtering of the Twitter stream

Approach -- Overview

Football

Apple

User

Profiles

Filter

BroadcastThe new

iPhone has a

3.5-inch screen,

released today

Page 17: Semantic user profiling and Personalised filtering of the Twitter stream

Semantic Filter

Semantic

Hub

Profile Generator

RDF

A

N

N

O

T

A

T

O

R

RDF

RSS

Store and

Query Topics

Notify Update

Fetch Updates

Get Interested

Subscribers

Create Profile

Store FOAF

The new

iPhone has a 3.5-

inch screen,

released today

Annotate: iPhone

?user foaf:interest

dbPedia:iPhone

Union

?user foaf:interest

Category:Apple

Get

Subscribers

based on

preference

Push Updates

to Interested

Users

Update RSS

Page 18: Semantic user profiling and Personalised filtering of the Twitter stream

User Profiling

User Profile

Interlink social websites

Merge and model user data

Personalise users’ experience

using their profile

Integration&

User Modelling

Recommendations

Search Personalisation

Adaptive Systems

Page 19: Semantic user profiling and Personalised filtering of the Twitter stream

User Profiling

Page 20: Semantic user profiling and Personalised filtering of the Twitter stream

Profile Generator

• Data Extraction

– Twitter, Facebook

– Example: Tweets, FB Likes, posts, videos, etc.

• Profile Generation

– Interests extracted from collected data

• Entity spotting (user generated data)

• Explicit interests specified by user (Facebook likes etc.)

– Weighted Interests w/ DBpedia resources/categories

– FOAF profile

Page 21: Semantic user profiling and Personalised filtering of the Twitter stream

Semantic Filter

Semantic

Hub

Profile Generator

RDF

A

N

N

O

T

A

T

O

R

RDF

RSS

Store and

Query Topics

Notify Update

Fetch Updates

Get Interested Subscribers

Create Profile

Store FOAFUpdate RSS

Semantic Filter

Page 22: Semantic user profiling and Personalised filtering of the Twitter stream

Semantic Filter

• Twitter Storm:

– Distributed realtime computation system

• Microblog Metadata

– Twitter provides metadata

• Author, date, location etc..

– Metadata Extracted

• DBpedia Entities, URLs

• Generate SPARQL Query representing interested Users

– Retrieved at Semantic Hub

Page 23: Semantic user profiling and Personalised filtering of the Twitter stream

Semantic Filter

Semantic

Hub

Profile Generator

RDF

A

N

N

O

T

A

T

O

R

RDF

RSS

Store and

Query Topics

Notify Update

Fetch Updates

Get Interested Subscribers

Create Profile

Store FOAFUpdate RSS

Semantic Hub

Page 24: Semantic user profiling and Personalised filtering of the Twitter stream

Semantic Hub

• RSS Extension

– Preference – to include the SPARQL queries

• Push content

– FOAF profiles of the subscribers are matched with the

preference

– Interested subscribers receive the content

Page 25: Semantic user profiling and Personalised filtering of the Twitter stream

DERI’s Unit for Social Software

(USS)

Unit leader: John Breslin

Page 26: Semantic user profiling and Personalised filtering of the Twitter stream

Overview of research activities

• Research team at DERI

– Two postdocs (plus one starting on Monday)

• Alex Passant (10%), Maciej Dabrowski, Bahareh Heravi

– Nine PhD students

• Six supervised by John, two by Alex, one by Michael H

• Various interdisciplinary collaborations

– Exercise, e-government, political science, journalism

Page 27: Semantic user profiling and Personalised filtering of the Twitter stream

Current students

David Crowley

• Citizen sensors

– Funded by College of

Engineering and Informatics

• Attaching data from

sensors to social web

content using semantic

technologies

Ted Vickey

• Exercise adherence via

social networks

– Funded by American Council

on Exercise and IRCSET

• Developing a classification

for fitness tweets to see if

sharing exercise regimes

can encourage others

Page 28: Semantic user profiling and Personalised filtering of the Twitter stream

Current students

Antonio Aguilar (EEE)

• Heart rate variability

analysis

– Funded by Assisted Ambient

Living eCAALYX EU project

• Developing methods to

help predict sudden

cardiac death using non-

linear algorithms

Fabrizio Orlandi

• User profiling on the Social

Semantic Web

– Funded by Cisco Foundation

and IRCSET

• Consolidating user profiles

from various platforms and

deriving interests from

amalgamation

Page 29: Semantic user profiling and Personalised filtering of the Twitter stream

Current students

Lukasz Porwol

• e-Participation via social

media

– Funded by Science

Foundation Ireland

• Leveraging popular

networks for e-government

instead of standalone

platforms

Owen Sacco

• Trust, accountability and

privacy via Linked Data

– Funded by Cisco Foundation

and IRCSET

• Developing privacy

preference managers for

the Semantic Web

• Collaboration with US

Government

Page 30: Semantic user profiling and Personalised filtering of the Twitter stream

Current students

Marie Boran

• Connecting data journalists

with linked scientific data

– Funded by Science

Foundation Ireland

• Bridging the gap between

experimental data from

scientists and the

mainstream media

Jodi Schneider

• Argumentative discussions

– Funded by Science

Foundation Ireland

• Representing, classifying

and visualizing

argumentative discussions

on the Web

Page 31: Semantic user profiling and Personalised filtering of the Twitter stream

Current students

Myriam Leggieri

• Linked sensor data

– Funded by SPITFIRE

• Connecting sensor data

with explanatory facts from

the Linked Open Data

Cloud

Page 32: Semantic user profiling and Personalised filtering of the Twitter stream

Some past postgraduate students

• Sheila Kinsella

– ECE graduate, now engineer with Datahug

• Haklae Kim

– Now senior engineer with Samsung

• Uldis Bojars

– Now with the National Library of Latvia

• Gerard Cahill

– BSc IT graduate, now developer with Starlight

Page 33: Semantic user profiling and Personalised filtering of the Twitter stream

DERI – House

eBusiness

Financial Services

Health Care

Life Sciences

eLearning

Green &

Sustainable ITeGovernment

Stream 3: Semantic

Information Mining

Information

Mining

and Retrieval

Natural Language

Processing

Stream 1:

Semantic Search

Reasoning and

Querying

Data Intensive

Infrastructure

Stream 2: Semantic

Collaboration

Semantic Colla-

borative Software

Social Software

Stream 4: Semantic

Middleware

Service Oriented

Architecture

Sensor

Middleware

Linked

Data

Research

Centre

DERI Applied

ResearchCommercialisation

Page 34: Semantic user profiling and Personalised filtering of the Twitter stream

Thanks!

• Contacts:

- [email protected]

- Twitter: BadmotorF

Page 35: Semantic user profiling and Personalised filtering of the Twitter stream
Page 36: Semantic user profiling and Personalised filtering of the Twitter stream

Some additional stats…

• On average for:

– 200 Tweets

– 200 Facebook posts, and items.

• ~106 interests - DBpedia instances

• ~720 interests - DBpedia categories (~6.8 times more)

• Estimated average Recall: 0.74

• 22 users

Page 37: Semantic user profiling and Personalised filtering of the Twitter stream