recommender trends 2014
DESCRIPTION
Coming right from the Recommender Systems conference in San Francisco, I present some latest developments in the field of large scale recommendation engines and machine learning.TRANSCRIPT
![Page 1: Recommender Trends 2014](https://reader034.vdocuments.site/reader034/viewer/2022052601/558de52b1a28abe7428b46a6/html5/thumbnails/1.jpg)
@torbenbrodt #recsys
Recommender TrendsACM RecSys 2014Silicon Valley USA
Torben Brodtplista GmbH
inspired by ..StammtischNov 13th 2014
![Page 2: Recommender Trends 2014](https://reader034.vdocuments.site/reader034/viewer/2022052601/558de52b1a28abe7428b46a6/html5/thumbnails/2.jpg)
@torbenbrodt #recsys
Silicon Valley
Image by New Media at the University of Maine
![Page 3: Recommender Trends 2014](https://reader034.vdocuments.site/reader034/viewer/2022052601/558de52b1a28abe7428b46a6/html5/thumbnails/3.jpg)
@torbenbrodt #recsys
RecSys 2014 was ..
● 1 day workshop● 3 day tech conference (see )● 1 day conference
![Page 4: Recommender Trends 2014](https://reader034.vdocuments.site/reader034/viewer/2022052601/558de52b1a28abe7428b46a6/html5/thumbnails/4.jpg)
@torbenbrodt #recsys
biased with my experience
● Head of Data Engineering● > 6y plista
○ News, advertising, real-time● Open!
DevOps MathCore
![Page 5: Recommender Trends 2014](https://reader034.vdocuments.site/reader034/viewer/2022052601/558de52b1a28abe7428b46a6/html5/thumbnails/5.jpg)
@torbenbrodt #recsys
Contents
1. Product2. Algorithms3. Metrics4. Openness5. Crazy Stuff6. Missing
![Page 6: Recommender Trends 2014](https://reader034.vdocuments.site/reader034/viewer/2022052601/558de52b1a28abe7428b46a6/html5/thumbnails/6.jpg)
@torbenbrodt #recsys
Product, ”Data Driven Decisions”
“We take a proposal for an original production or for a piece of content we’re going to buy and we plug in all the data we can abou tit into our models. We’re able to predict reach and hours for that piece of content even before it exists with reasonable precision in a way that helps us to say, ‘this is worth funding’ or ‘that’s not worth funding,’ ”
NEIL HUNT Netflix
![Page 7: Recommender Trends 2014](https://reader034.vdocuments.site/reader034/viewer/2022052601/558de52b1a28abe7428b46a6/html5/thumbnails/7.jpg)
@torbenbrodt #recsys
Product, “Search & Recommendationshould (not?) converge”
HECTOR GARCIA-MOLINAProfessor, Stanford University
DEBORA DONATOStumbleUpon
![Page 8: Recommender Trends 2014](https://reader034.vdocuments.site/reader034/viewer/2022052601/558de52b1a28abe7428b46a6/html5/thumbnails/8.jpg)
@torbenbrodt #recsys
Product, “Use Human Experts”
ERIC COLSONStitch Fix
Humans send you customized outfits. Machines suggest clothes and judge stuff.
![Page 9: Recommender Trends 2014](https://reader034.vdocuments.site/reader034/viewer/2022052601/558de52b1a28abe7428b46a6/html5/thumbnails/9.jpg)
@torbenbrodt #recsys
Product, “Explain your knowledge”
● Xbox explains why their recommendations are utile
● Cortana builds ML model of user and still allows to change it
Build Trust!
![Page 10: Recommender Trends 2014](https://reader034.vdocuments.site/reader034/viewer/2022052601/558de52b1a28abe7428b46a6/html5/thumbnails/10.jpg)
@torbenbrodt #recsys
Product, “Care about Privacy”
once you lose your customer because of privacy, you will never get him back
solutions● store user history on client side● ..
![Page 11: Recommender Trends 2014](https://reader034.vdocuments.site/reader034/viewer/2022052601/558de52b1a28abe7428b46a6/html5/thumbnails/11.jpg)
@torbenbrodt #recsys
Product, ”Allow User Interaction”
HECTOR GARCIA-MOLINAProfessor, Computer Science and Electrical Engineering Departments of Stanford University
![Page 12: Recommender Trends 2014](https://reader034.vdocuments.site/reader034/viewer/2022052601/558de52b1a28abe7428b46a6/html5/thumbnails/12.jpg)
@torbenbrodt #recsys
Product, “active learning”
Why do vague passive learning when you can ask the user?
.. implicitly or explicitly
http://en.wikipedia.org/wiki/Active_learning_(machine_learning)
SMRITI BHAGATTechnicolor
![Page 13: Recommender Trends 2014](https://reader034.vdocuments.site/reader034/viewer/2022052601/558de52b1a28abe7428b46a6/html5/thumbnails/13.jpg)
@torbenbrodt #recsys
Algorithms
![Page 14: Recommender Trends 2014](https://reader034.vdocuments.site/reader034/viewer/2022052601/558de52b1a28abe7428b46a6/html5/thumbnails/14.jpg)
@torbenbrodt #recsys
Algorithms, ”Matrix Factorization”
[...] faster by replacing inner product with PCA trees
NOAM KOENIGSTEINMicrosoft R&D
![Page 15: Recommender Trends 2014](https://reader034.vdocuments.site/reader034/viewer/2022052601/558de52b1a28abe7428b46a6/html5/thumbnails/15.jpg)
@torbenbrodt #recsys
Algorithms, “Ensembles”
● Multi Armed Bandits● Ensemble Methods● Global Optimization
https://github.com/Yelp/MOE
![Page 16: Recommender Trends 2014](https://reader034.vdocuments.site/reader034/viewer/2022052601/558de52b1a28abe7428b46a6/html5/thumbnails/16.jpg)
@torbenbrodt #recsys
Algorithms, “How does MOE work”
DR. SCOTT CLARKYelp
1. Build Gaussian Process (GP) with points sampled so far
2. Optimize covariance hyperparameters of GP3. Find point(s) of highest Expected Improvement
within parameter domain4. Return optimal next best point(s) to sample
https://github.com/Yelp/MOE
![Page 17: Recommender Trends 2014](https://reader034.vdocuments.site/reader034/viewer/2022052601/558de52b1a28abe7428b46a6/html5/thumbnails/17.jpg)
@torbenbrodt #recsys
Algorithms, “Topic Modelling”
● LDA is standard● datascience tasks
○ where to cut○ how many topics
● where to use?
http://en.wikipedia.org/wiki/Topic_model
![Page 18: Recommender Trends 2014](https://reader034.vdocuments.site/reader034/viewer/2022052601/558de52b1a28abe7428b46a6/html5/thumbnails/18.jpg)
@torbenbrodt #recsys
Algorithms, “Content”
● Sense identifiers (int) instead of keywords● Word sense disambiguation
![Page 19: Recommender Trends 2014](https://reader034.vdocuments.site/reader034/viewer/2022052601/558de52b1a28abe7428b46a6/html5/thumbnails/19.jpg)
@torbenbrodt #recsys
Metrics, “Stakeholders”
● Business Value● Consumer Value● Conflicting goals?● Diversity?
NEIL HUNT Netflix
![Page 20: Recommender Trends 2014](https://reader034.vdocuments.site/reader034/viewer/2022052601/558de52b1a28abe7428b46a6/html5/thumbnails/20.jpg)
@torbenbrodt #recsys
Metrics, “Dwell Time”
● Client Side implementation
● Yahoo ensures dwell-time is comparable across different context (device, etc)
● it correlates to clicks, but is more meaningful XING YI
Yahoo Labs
![Page 21: Recommender Trends 2014](https://reader034.vdocuments.site/reader034/viewer/2022052601/558de52b1a28abe7428b46a6/html5/thumbnails/21.jpg)
@torbenbrodt #recsys
Metrics, “Increasing signals”
Get the full lifetime journey● reservation● rating● billing / tipping
JEREMY SCHIFFOpenTable
![Page 22: Recommender Trends 2014](https://reader034.vdocuments.site/reader034/viewer/2022052601/558de52b1a28abe7428b46a6/html5/thumbnails/22.jpg)
@torbenbrodt #recsys
Openness, “Software Side”
Companies share software● credits to Twitter, Yelp, others
Finally Paper results can be reused (github)
![Page 23: Recommender Trends 2014](https://reader034.vdocuments.site/reader034/viewer/2022052601/558de52b1a28abe7428b46a6/html5/thumbnails/23.jpg)
@torbenbrodt #recsys
Openness, “Data Side”
Wikipedia, DBPedia, common crawl
Companies share Data & Challenges● credits to Netflix, Tmall, Criteo
![Page 24: Recommender Trends 2014](https://reader034.vdocuments.site/reader034/viewer/2022052601/558de52b1a28abe7428b46a6/html5/thumbnails/24.jpg)
@torbenbrodt #recsys
Openness, “Connectivity”
Everything is possible!To Me and to You
● Connect to Facebook○ access open graph
● Get Fulltext without 10k servers● Use Apache Mahout, Azure ML, etc
![Page 25: Recommender Trends 2014](https://reader034.vdocuments.site/reader034/viewer/2022052601/558de52b1a28abe7428b46a6/html5/thumbnails/25.jpg)
@torbenbrodt #recsys
Openness, “Connectivity”
● Give students the chance to learn
● CoLaboratory Notebook
http://venturebeat.com/2014/08/08/google-whips-up-a-chrome-app-to-let-data-scientists-work-together/
![Page 26: Recommender Trends 2014](https://reader034.vdocuments.site/reader034/viewer/2022052601/558de52b1a28abe7428b46a6/html5/thumbnails/26.jpg)
@torbenbrodt #recsys
Openness, “Connectivity”
● Azure Marketplace allows to exchange machine learning models
● RapidMiner makes workflows reproducable
https://datamarket.azure.com/browse/data
![Page 27: Recommender Trends 2014](https://reader034.vdocuments.site/reader034/viewer/2022052601/558de52b1a28abe7428b46a6/html5/thumbnails/27.jpg)
@torbenbrodt #recsys
Crazy Stuff
Industry Sessions…● Facebook News● Shopkick● Stumble Upon● climate institute● ...
![Page 28: Recommender Trends 2014](https://reader034.vdocuments.site/reader034/viewer/2022052601/558de52b1a28abe7428b46a6/html5/thumbnails/28.jpg)
@torbenbrodt #recsys
Crazy Stuff, “music genome project”
1 song = 450 musical characteristics from trained music analyst
ERIK M. SCHMIDTPandora
![Page 29: Recommender Trends 2014](https://reader034.vdocuments.site/reader034/viewer/2022052601/558de52b1a28abe7428b46a6/html5/thumbnails/29.jpg)
@torbenbrodt #recsys
Crazy Stuff, “LinkedIn A/B testing”
● XLNT Platform● Key Component !● Continuous Deployment
YA XULinkedInhttps://engineering.linkedin.com/ab-testing/xlnt-
platform-driving-ab-testing-linkedin
![Page 30: Recommender Trends 2014](https://reader034.vdocuments.site/reader034/viewer/2022052601/558de52b1a28abe7428b46a6/html5/thumbnails/30.jpg)
@torbenbrodt #recsys
Crazy Stuff, “Google Deep Learning”
● Application?○ Pixels, Audio, Searches,
Translation● Embeddings● Language Models● Scalability
JEFF DEANGoogle
![Page 31: Recommender Trends 2014](https://reader034.vdocuments.site/reader034/viewer/2022052601/558de52b1a28abe7428b46a6/html5/thumbnails/31.jpg)
@torbenbrodt #recsys
Missing? “Uncovered Topics”
![Page 32: Recommender Trends 2014](https://reader034.vdocuments.site/reader034/viewer/2022052601/558de52b1a28abe7428b46a6/html5/thumbnails/32.jpg)
@torbenbrodt #recsys
Missing, “Probabilistic Data Structures”
probabilistic counting, hyperLogLog, etc
http://research.neustar.biz/https://streamdrill.com/
![Page 33: Recommender Trends 2014](https://reader034.vdocuments.site/reader034/viewer/2022052601/558de52b1a28abe7428b46a6/html5/thumbnails/33.jpg)
@torbenbrodt #recsys
Missing
Large Scale?● Computational Costs● Real-Time Recs
![Page 34: Recommender Trends 2014](https://reader034.vdocuments.site/reader034/viewer/2022052601/558de52b1a28abe7428b46a6/html5/thumbnails/34.jpg)
@torbenbrodt #recsys
Questions?
Torben Brodtplista GmbH
![Page 35: Recommender Trends 2014](https://reader034.vdocuments.site/reader034/viewer/2022052601/558de52b1a28abe7428b46a6/html5/thumbnails/35.jpg)
@torbenbrodt #recsys
● hard to convince mgmt (?!)● start measuring
example● coupons 1/week might
decrease revenueJEREMY SCHIFFOpenTable
Metrics, “Long Term Satisfaction”
![Page 36: Recommender Trends 2014](https://reader034.vdocuments.site/reader034/viewer/2022052601/558de52b1a28abe7428b46a6/html5/thumbnails/36.jpg)
@torbenbrodt #recsys
Resume, ”we enhance services”
Large Size Companies cannot exist without data science● Netflix● Zalando● etc