new opportunities for connected data - emil eifrem @ graphconnect boston + chicago 2013

58
Neo Technology, Inc Confidential GraphConnect 2013 graphs are everywhere Emil Eifrem @emileifrem #graphconnect

Upload: neo4j-the-open-source-graph-database

Post on 26-Jan-2015

102 views

Category:

Technology


0 download

DESCRIPTION

Today’s complex data is not only big, but also semi-structured and densely connected. In this session we’ll look at how size, structure and connectedness have converged to transform the data landscape. We’ll then go on to look at some of the new opportunities for creating end-user value that have emerged in a world of connected data, illustrated with practical examples drawn from the telecommunications, social media and logistics sectors.

TRANSCRIPT

Page 1: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

GraphConnect 2013graphs are everywhere

Emil Eifrem@emileifrem

#graphconnect

Page 2: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

Page 3: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

“Five richest big data sources on the Web include social graph, intent graph,

consumption graph, interest graph and mobile graph.”

http://www.forbes.com/sites/ericsavitz/2012/10/23/gartner-top-10-strategic-technology-trends-for-2013/

- Gartner: “Top 10 Strategic Trends For 2013,” Oct 2012

Page 4: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

“[I]t is arguable that graph databases will have a bigger impact on the database landscape than

Hadoop or its competitors.”

- Bloor Research, May 2012

http://www.bloorresearch.com/blog/IM-Blog/2012/5/graph-databases-nosql.html

Page 5: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

“Graph analysis is the true killer app for Big Data.”

- Forrester Research, Dec 2011

http://blogs.forrester.com/james_kobielus/11-12-19-the_year_ahead_in_big_data_big_cool_new_stuff_looms_large

Page 6: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

http://gigaom.com/2013/05/14/were-witnessing-the-rise-of-the-graph-in-big-data/

GigaOm, May 2013

Page 7: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

FastCompany, March 2013http://www.fastcompany.com/magazine/174/exposing-yahoos-strategy-marissa-mayer

Page 8: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

FastCompany, March 2013http://www.fastcompany.com/magazine/174/exposing-yahoos-strategy-marissa-mayer

Page 9: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

Ian Robinson, Jim Webber & Emil Eifrem

Graph Databases

h

Compliments

of Neo Technology

Page 10: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

?

Page 11: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

Page 12: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

Page 13: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

Page 14: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

NASDAQ:ORCLMarket Cap Today: ~$150B

Page 15: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

More recent examples?

Page 16: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

More recent examples?

Page 17: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

More recent examples?

Page 18: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

More recent examples?

Page 19: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

Social Graph

More recent examples?

Page 20: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

Social Graph

Link GraphKnowledge Graph

More recent examples?

Page 21: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

Social Graph

Interest Graph

Link GraphKnowledge Graph

More recent examples?

Page 22: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

Page 23: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

Connected Data.

Page 24: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

Page 25: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

Wow.But it looks like graph = social.

Right?

Page 26: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

Core Industries & Use Cases:

SoftwareFinancial Services

Telecomm-unications

Network & Data Center Management

MDM

Social

Geo

Early Adopter Segments

Page 27: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

Neo4j Adoption Snapshot

Core Industries & Use Cases:

SoftwareFinancial Services

Telecomm-unications

Network & Data Center Management

MDM

Social

Geo

Select Commercial Customers (Community Users Not Included)

Page 28: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

Core Industries & Use Cases:

Web / ISVFinance & Insurance

Telecomm-unications

Network & Data Center Management

MDM

Social

Geo

Neo4j Adoption SnapshotSelect Commercial Customers (Community Users Not Included)

Page 29: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

Neo4j Adoption SnapshotSelect Commercial Customers (Community Users Not Included)

Core Industries & Use Cases: Software

Financial Services

Telecommunications

Web Social, HR & Recruiting

Health Care & Life Sciences

Media & Publishing

Energy, Services, Automotive, Gov’t, Logistics, Education,

Gaming, Other

Network & Data Center

Management

MDM / System of Record

Social

Geo

Identity & Access Mgmt

Content Management

Recommend-ations

BI, CRM, Impact Analysis, Fraud

Detection, Resource Optimization, etc.

Accenture

Page 30: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

So what’s this product they’re using?

Page 31: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

LIVES WITH

LOVES

OWNSDRIVES

LOVESname: “James”

age: 32twitter: “@spam”

name: “Mary”age: 35

brand: “Volvo”model: “V70”

property type: “car”

Graph data model

Page 32: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

Image credits: Tobias Ivarsson

“Whiteboard friendliness”

Page 33: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

thobe

Wardrobe Strength

Joe project blog

Hello Joe

Neo4j performance analysis

Modularizing Jython

Image credits: Tobias Ivarsson

“Whiteboard friendliness”

Page 34: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

Database # persons query time

MySQL

Neo4j

Neo4j

๏a sample social graph

•with ~1,000 persons

๏average 50 friends per person

๏pathExists(a,b) limited to depth 4

๏caches warmed up to eliminate disk I/O

Graph db performance

Page 35: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

Database # persons query time

MySQL

Neo4j

Neo4j

1,000

๏a sample social graph

•with ~1,000 persons

๏average 50 friends per person

๏pathExists(a,b) limited to depth 4

๏caches warmed up to eliminate disk I/O

Graph db performance

Page 36: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

Database # persons query time

MySQL

Neo4j

Neo4j

1,000 2,000 ms

๏a sample social graph

•with ~1,000 persons

๏average 50 friends per person

๏pathExists(a,b) limited to depth 4

๏caches warmed up to eliminate disk I/O

Graph db performance

Page 37: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

Database # persons query time

MySQL

Neo4j

Neo4j

1,000 2,000 ms

1,000 2 ms

๏a sample social graph

•with ~1,000 persons

๏average 50 friends per person

๏pathExists(a,b) limited to depth 4

๏caches warmed up to eliminate disk I/O

Graph db performance

Page 38: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

Database # persons query time

MySQL

Neo4j

Neo4j

1,000 2,000 ms

1,000 2 ms

1,000,000

๏a sample social graph

•with ~1,000 persons

๏average 50 friends per person

๏pathExists(a,b) limited to depth 4

๏caches warmed up to eliminate disk I/O

Graph db performance

Page 39: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

Database # persons query time

MySQL

Neo4j

Neo4j

1,000 2,000 ms

1,000 2 ms

1,000,000 2 ms

๏a sample social graph

•with ~1,000 persons

๏average 50 friends per person

๏pathExists(a,b) limited to depth 4

๏caches warmed up to eliminate disk I/O

Graph db performance

Page 40: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

Drivers of Graph Adoption

“Why did you use a graph database for your application?”

Page 41: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

Drivers of Graph Adoption

Naturally Graphy Data

Complex Graph Queries

Query Performance

0% 20% 40% 60% 80%

Page 42: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

Great product.

What’s up next?

Page 43: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

Top 12 Month Product Themes

Page 44: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

Top 12 Month Product Themes

Ease of Use

Page 45: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

Top 12 Month Product Themes

Big DataEase of Use

Page 46: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

Top 12 Month Product Themes

Big Data CloudEase of Use

Page 47: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

2.0

Q2 Q3 Q4Q12013

Theme: Ease of UseNeo4j 2.0

Page 48: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

2.0

• Labels. First expansion of the Property Graph model since its inception. Nodes can have one or more labels. Significantly improve power & ease of use.

Q2 Q3 Q4Q12013

Theme: Ease of UseNeo4j 2.0

Page 49: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

2.0

• Labels. First expansion of the Property Graph model since its inception. Nodes can have one or more labels. Significantly improve power & ease of use.

• Index automation. Improve indexing ease of use, leveraging new “Label” construct Enable indexing operations through Cypher

Q2 Q3 Q4Q12013

Theme: Ease of UseNeo4j 2.0

Page 50: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

2.0

• Labels. First expansion of the Property Graph model since its inception. Nodes can have one or more labels. Significantly improve power & ease of use.

• Index automation. Improve indexing ease of use, leveraging new “Label” construct Enable indexing operations through Cypher

• REST Improvements. Improved Transactionality & Robustness

Q2 Q3 Q4Q12013

Theme: Ease of UseNeo4j 2.0

Page 51: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

2.0

• Labels. First expansion of the Property Graph model since its inception. Nodes can have one or more labels. Significantly improve power & ease of use.

• Index automation. Improve indexing ease of use, leveraging new “Label” construct Enable indexing operations through Cypher

• REST Improvements. Improved Transactionality & Robustness

• Cypher performance improvements.

Q2 Q3 Q4Q12013

Theme: Ease of UseNeo4j 2.0

Page 52: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

Theme: Big Data2.1

Q2 Q3 Q4Q12013

Neo4j 2.1

2014

Page 53: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

Theme: Big Data2.1

• Increase upper size limits of single-machine graph

Q2 Q3 Q4Q12013

Neo4j 2.1

2014

Page 54: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

Theme: Big Data2.1

• Increase upper size limits of single-machine graph • Performance optimizations targeting densely-

connected nodes

Q2 Q3 Q4Q12013

Neo4j 2.1

2014

Page 55: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

Theme: Big Data2.1

• Increase upper size limits of single-machine graph • Performance optimizations targeting densely-

connected nodes• Bulk data import improvements: easer & faster to

bring large amounts of data into Neo4j

Q2 Q3 Q4Q12013

Neo4j 2.1

2014

Page 56: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

Theme: Big Data2.1

• Increase upper size limits of single-machine graph • Performance optimizations targeting densely-

connected nodes• Bulk data import improvements: easer & faster to

bring large amounts of data into Neo4j• Cypher performance improvements

Q2 Q3 Q4Q12013

Neo4j 2.1

2014

Page 57: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

GraphConnect BostonJune 10-11, 2013 | Catalyst Restaurantgraphs are everywhere

Page 58: New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Chicago 2013

Neo Technology, Inc Confidential

Your Mission:

Connect.

GraphConnect BostonJune 10-11, 2013 | Catalyst Restaurantgraphs are everywhere