Transcript
Page 1: Beyond ratings and followers (RecSys 2012)

Anmol BhasinSr. Manager

Analytics Engineeringwww.linkedin.com

Beyond Ratings

& Followers

Page 2: Beyond ratings and followers (RecSys 2012)

Linkedin

Page 3: Beyond ratings and followers (RecSys 2012)

In a social (professional) networking context,

its about building a..

Recommender Ecosystem

Page 4: Beyond ratings and followers (RecSys 2012)

4

50%

The answer is

Page 5: Beyond ratings and followers (RecSys 2012)
Page 6: Beyond ratings and followers (RecSys 2012)
Page 7: Beyond ratings and followers (RecSys 2012)
Page 8: Beyond ratings and followers (RecSys 2012)
Page 9: Beyond ratings and followers (RecSys 2012)
Page 10: Beyond ratings and followers (RecSys 2012)
Page 11: Beyond ratings and followers (RecSys 2012)

Similar Profiles

Events You May Be Interested In

News

The Recommender Ecosystem

11

Network updates

Connections

Page 12: Beyond ratings and followers (RecSys 2012)

Frameworks are revolutions

evolutions

Page 13: Beyond ratings and followers (RecSys 2012)

LinkedIn Recommendation Engine

BehaviorAnalysis

CollaborativeFiltering Popularity

Sim

ilar

Pro

files

Ref

erra

l C

ente

r

Tale

ntM

atch

Peo

ple

Bro

wse

Map

People

Recommen-dation Types

Shared, Dynamic,Unified

CoreService

Products

RecommendationEntities

Jobs

Bro

wse

M

ap

Sim

ilar J

obs

Jobs

Jobs

You

M

ay b

e in

tere

sted

in

… AdsCompaniesSearchesNewsEvents… and more

GY

ML

Gro

ups

Bro

wse

Map

Groups

Sim

ilar G

roup

s

User Feedback

API

(R-T) Feature Extraction, Entity Resolution & Enrichment

(R-T) matching computations

A/B

Offline data munging (hadoop)

Page 14: Beyond ratings and followers (RecSys 2012)

different strokes for different

folks

Page 15: Beyond ratings and followers (RecSys 2012)

Cloning

Page 16: Beyond ratings and followers (RecSys 2012)

Possible Approaches

Naïve K Nearest Neighbor solution Complexity is

Clustering Latent Factor Models like PLSI or LDA Hierarchical Agglomerative clustering

Self Organizing Maps

Item based Collaborative Filtering Find pairs of Users viewed in the same session

Page 17: Beyond ratings and followers (RecSys 2012)

Scale 175+ M profiles

Dimensionality ~2M companies ~200K schools ~147 industries ~200 countries ~25K titles ~40K Skills ~200 Job Functions

Similar means different things to different people Similar Behavior doesn’t mean you can replace me at my job Accuracy vs Relevance (me & my boss.. )

Realtime.. It’s a problem of accuracy.. Not recall..

Challenges

Page 18: Beyond ratings and followers (RecSys 2012)

Approach

Rank

FILTER

Cluster

Focus attention only on pairs likely to be similar

Filter out the possibly dis-similar pairs

Run Similarity Functions on filtered in pairs

Page 19: Beyond ratings and followers (RecSys 2012)

LSH function family for Cosine Distance

Locality Sensitive Hashing

Page 20: Beyond ratings and followers (RecSys 2012)

Approach

Rank

FILTER

Cluster

Focus attention only on pairs likely to be similar

Filter out the possibly dis-similar pairs

Run Similarity Functions on filtered in pairs

Page 21: Beyond ratings and followers (RecSys 2012)

Similarity Functions

Different bands of attributes Boolean, Jaccard or Cosine Similarities across attribute

pairs.

• Logisitic Regression with Elastic Penalty

Learn model params on a set of hand labeled data points

Predicted value interpreted as score

Page 22: Beyond ratings and followers (RecSys 2012)

Impedance Mismatch

Page 23: Beyond ratings and followers (RecSys 2012)

Ad Ranking Given

Objective

Goal: Increase revenue Respect daily budgets of Advertisers Good user experience

Page 24: Beyond ratings and followers (RecSys 2012)

Campaign creation

Page 25: Beyond ratings and followers (RecSys 2012)

Virtual Profiling

Targeted Segment Population

Title : Eng MgrCompany : LinkedInLocation : CA,USA Skills : ML, RecSys

Title : Vice PresidentCompany : TwitterLocation : CA,USA Skills : DM, ML, RecSys ……………….

Page 26: Beyond ratings and followers (RecSys 2012)

Virtual Profiling

Title : Eng MgrCompany : LinkedInLocation : CA,USA Skills : ML, RecSys

Title : Sr. SECompany : GoogleLocation : PA, USASkills : ML, DMTitle : Eng DirCompany : LinkedinLocation : PA, USASkills : ML, Stats, DM

Title : Sr. SE<1>, Eng Mgr<1>, Eng Dir<1>

Company : LinkedIn<2>,

Google<1>,

Location : CA,USA <2>, PA, USA<1>

Skills : ML<2>,

RecSys<1>, Stats<1>, DM<1>

Clicker Feature Distribution

Page 27: Beyond ratings and followers (RecSys 2012)

Virtual Profiling

Information Gain

Pick Top K overrepresented features from the clicker distribution vs the target

segment

A representative projection of the item in the member feature space

Page 28: Beyond ratings and followers (RecSys 2012)

CTR Prediction – CF Similarity

RankerMEMBER FEATURES

Score to pCTR correction

L2 regularized Logistic Regression (Liblinear, VW, Mahout, ADMM)

For new ad creatives back-off to the advertiser / ad category nodes till they reach critical impression/click volume (explore/exploit)

AD CREATIVE VIRTUAL PROFILE

Creative features

Page 29: Beyond ratings and followers (RecSys 2012)

the magic is in the models

features

Page 30: Beyond ratings and followers (RecSys 2012)

30

Feature Engineering – Entity Resolution

Companies

Huge impact on the business and UE Ad targeting TalentMatch Referrals

‘IBM’ has 8000+ variations- ibm – ireland- ibm research- T J Watson Labs- International Bus. Machines- Deep Blue

K-Ambiguous

Asonam’11, KDD’11

Page 31: Beyond ratings and followers (RecSys 2012)

Open to relocation ? Region similarity based on profiles or network Region transition probability

predict individuals propensity to migrate and most likely migration target

Impact on job recommendations 20% lift in

views/viewers/applications/applicants

Feature Engineering – Sticky Locations

Page 32: Beyond ratings and followers (RecSys 2012)

32

What should you transition to .. and when ?

Months since graduation

Prob

abili

ty o

f sw

itch

Page 33: Beyond ratings and followers (RecSys 2012)

rethinking delivery

Page 34: Beyond ratings and followers (RecSys 2012)

Social Referral

Page 35: Beyond ratings and followers (RecSys 2012)

Social Referral

Mohammad Amin, Baoshi Yan, Sripad Sriram, Anmol Bhasin, Christian Posse. Social Referral : Using network connections to deliver

recommendations. To appear in Proceedings of the Sixth ACM conference on Recommender systems (RecSys '12)

> 2X Conversion

Linkedin Group: Text Analytics

I found this group interesting, and I think you will too

Deepak

Linkedin Group: Text Analytics

From: Deepak Agarwal – Engineering Director, LinkedIn

2X conversion

Page 36: Beyond ratings and followers (RecSys 2012)

Big Data A/B is the

new

Page 37: Beyond ratings and followers (RecSys 2012)

Orthogonality in A/B

Page 38: Beyond ratings and followers (RecSys 2012)

383838

1. Novelty effect E.g., new job recommendation

algorithms have week-long novelty effect that shows lifts twice the stationary (real) one

2. Cannibalization Zero-sum game or real lift?

3. Random sampling destroys network effect

Beware of some A/B testing pitfalls

1 week lifts 2weeks lifts

Page 39: Beyond ratings and followers (RecSys 2012)

Tech Stack

Page 40: Beyond ratings and followers (RecSys 2012)

Open Source Technologies

ZoieBobo

KafkaVoldemort

40http://data.linkedin.com

Page 41: Beyond ratings and followers (RecSys 2012)

It takes a village

Page 42: Beyond ratings and followers (RecSys 2012)

Credits

Engineering : Abhishek Gupta, Adam Smyczek, Adil Aijaz, Alan Li, Baoshi Yan, Bee-Chung Chen, Deepak Agarwal, Ethan Zhang, Haishan Liu, Igor Perisic, Jonathan Traupman, Liang Zhang, Lokesh Bajaj, Mario Rodriguez, Mitul Tiwari, Mohammad Amin, Monica Rogati, Parul Jain, Paul Ogilvie, Sam Shah, Sanjay Dubey, Tarun Kumar, Trevor Walker, Utku Irmak

Product : Andrew Hill, Christian posse, Gyanda Sachdeva, Mike Grishaver, Parker Barrile, Sachit Kamat Alphabetically sorted

Page 43: Beyond ratings and followers (RecSys 2012)

You

Picture yourself with this New Job:

Applied Researcher /Research Engineer

A Recommendation for you..

Page 44: Beyond ratings and followers (RecSys 2012)

?Contact:

[email protected]

http://data.linkedin.com/


Top Related