intro to graphs for hr analytics

29
An Intro to Graphs for HR Analy)cs [email protected]

Upload: rik-van-bruggen

Post on 02-Dec-2014

3.292 views

Category:

Data & Analytics


2 download

DESCRIPTION

This is the talk that I did at the Belgian Data Science meetup on how graph databases could help with HR Analytics.

TRANSCRIPT

Page 1: Intro to graphs for HR analytics

An  Intro  to  Graphs  for  

HR  Analy)cs  [email protected]  

Page 2: Intro to graphs for HR analytics

Agenda  •  About  Graphs  •  About  Graph  Databases  •  Why  Graph  Databases  ma=er  for  HR  Analy@cs  –  Short  demonstra@on  

•  Case  Studies  •  Q&A  

Page 3: Intro to graphs for HR analytics

Introduc@on:  about  Graphs  

Page 4: Intro to graphs for HR analytics
Page 5: Intro to graphs for HR analytics

Meet ���Leonhard Euler •  Swiss  mathema@cian  •  Inventor  of  Graph  Theory  (1736)  

Page 6: Intro to graphs for HR analytics

Königsberg  (Prussia)  -­‐  1736  

Page 7: Intro to graphs for HR analytics

A

B

D

C

Page 8: Intro to graphs for HR analytics

A

B

D

C

1"

2"3"

4"

7"6"

5"

Page 9: Intro to graphs for HR analytics

About  Graph  Databases  

Page 10: Intro to graphs for HR analytics

So  what  is  a  graph  database?  

•  OLTP  database  – “end-­‐user”  transac@ons  

•  Model,  store,  manage  data  as  a  graph  

Page 11: Intro to graphs for HR analytics

What  is  a  graph?  Node  

Rela@onship  

Page 12: Intro to graphs for HR analytics

Contrast  with  Rela@onal  

Graphs are often referred to as “Whiteboard Friendly”. The data model reflects the way a domain expert would naturally

draw their data on a whiteboard “The schema is the data”. Schema flexibility allows the system

to change in response to a changing environment

Page 13: Intro to graphs for HR analytics

What  are  graphs  good  for?  

Complex  Querying  

Page 14: Intro to graphs for HR analytics

Examples  of  complex  queries?  1.  Semi-­‐structure  in  datasets  

14

– Normaliza@on  introduces  complexity  

– Forces  developers  to  develop  all  kinds  of  logic  to  deal  with  this  variability  in  their  applica@on  logic  

Page 15: Intro to graphs for HR analytics

Examples  of  complex  queries:  2.  Connectedness  in  data  

Lots  of  normalized  rela@onships  between  the  different  en@@es,  forces  developers  to  do  •  Deep  joins  •  Recursive  joins  •  Pathfinding  opera@ons  •  “open-­‐ended”  queries  

Page 16: Intro to graphs for HR analytics

Examples  of  Connectedness  

Page 17: Intro to graphs for HR analytics

Graph  Querying  

Page 18: Intro to graphs for HR analytics

Querying  a  Graph  

•  “Graph  local”  vs  “Graph  global”  – Contextualized  “ego-­‐centric”  queries  

•  “Parachute”  into  graph  – Start  node(s)  

•  Found  through  Index  lookups  

•  Crawl  the  surrounding  graph  – 2  million+  joins  per  second  

•  No  more  Index  lookups:    Index-­‐free  adjacency  

Page 19: Intro to graphs for HR analytics

Queries:  Pa=ern  Matching  

Pa=ern  

Page 20: Intro to graphs for HR analytics

Short  demo:  HR  Analy@cs  

Page 21: Intro to graphs for HR analytics

Domains  that  jump  out  

•  The  REAL  Enterprise  Social  Network  –  Be=er  understanding  of  the  “coffee-­‐room”  network  

•  Recruitment  – Micro-­‐targe@ng  –  Social  integra@on  

•  Competency  management  –  Smart  matching  –  Taxonomies  –  Op@miza@on  algorithms  

21

Page 22: Intro to graphs for HR analytics

It  always  starts  with  a  MODEL  

Page 23: Intro to graphs for HR analytics

Then  for  some  Queries  

•  Network  Analy@cs  – Degree  Centrality  – Betweenness  Centrality  – PageRank  

•  Recommenda@ons  – Triadic  closures  – Complex  pa=ern  matching  

23

Page 24: Intro to graphs for HR analytics

Use  Cases  (neo4j.com/use-­‐cases)  

Page 25: Intro to graphs for HR analytics

Customers  (neo4j.com/customers)  

Page 26: Intro to graphs for HR analytics

Graph  Gists  (h=p://gist.neo4j.org/)  

Page 27: Intro to graphs for HR analytics

Neo Technology, Inc Confidential

Neo4j License Overview

Developer!Seats!

($6K*/Developer/Year)

Test!Instances!

($6K/Instance/Year)

Production!Instances!

(Bundle / Core Pricing)

Instances whose purpose is to ensure that the software accessing

Neo4j is meeting specification.!!

(e.g. System Test, Integration Test, UAT, Performance Test, Staging)

Instances that store and process data in a way that benefits and

advances an organization’s goals.!!

May be accessed by applications and/or end users

Includes access by programmers to licensed test instances, and

private instances on the programmer’s personal machine for the sole purpose of writing, debugging, or testing software

designed to access Neo4j

*Or otherwise, depending on the Bundle, and negotiation

Neo4j  versions  /  licenses  

Personal  <  Startup  /  Departmental  <  Enterprise  deployment  models  Open  source  &  Commercial  license  terms  available  

Specific  OEM  models  

Page 28: Intro to graphs for HR analytics

Future  trainings  &  events!  

28

Page 29: Intro to graphs for HR analytics

Neo  Technology  www.neotechnology.com    Neo4j  www.neo4j.org      [email protected]  or  +32  478  686800  

Q&A,  Conclusion,  Next  Steps