this is your presentation (or slidedoc) title...announcements homework 7 grades have been released...

52
CSE/STAT 416 Ethics, Explainable ML Course Review Vinitra Swamy University of Washington Aug 17, 2020

Upload: others

Post on 10-Oct-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

CSE/STAT 416Ethics, Explainable MLCourse Review

Vinitra SwamyUniversity of WashingtonAug 17, 2020

Page 2: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

Announcements ▪ Homework 7 grades have been released

▪ Congratulations to Team Otterhog! + creative team names- weak learner- machine teachers- indecisive tree classifiers- beff jezos- deep mind 2.0- confused matrix- ham in spam

▪ Final Exam is on Wednesday on Gradescope- Have scratch paper handy- Compile your notes / study materials- Work fast (if you’re getting stuck on a problem, skip it,

and come back to it later)

▪ Office Hours

2

Page 3: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

FairnessML Ethics

3

Page 4: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

Assessing Accuracy

Always dig in and ask critical questions of your accuracy.

▪ Is there a class imbalance?

▪ How does it compare to a baseline approach?- Random guessing- Majority class- …

▪ Most important: What does my application need?- What’s good enough for user experience?- What is the impact of a mistake we make?

4

Page 5: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

Which is Worse?

What’s worse, a false negative or a false positive?

▪ It entirely depends on your application!

In almost every case, how treat errors depends on your context.

5

Detecting Spam

False Negative: Annoying

False Positive: Email lost

Medical Diagnosis

False Negative: Disease not treated

False Positive: Wasteful treatment

Page 6: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

When ML goes wrong…

COMPAS, the algorithm used for recidivism prediction produces much higher false positive rate for African American people than Caucasian people. (Larson et al. ProPublica, 2016)

6

XING, a job platform, was found to rank less qualified male candidates higher than more qualified female candidates (Lahoti et al. 2018)

Page 7: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

5 sources of bias in ML

▪ Historical bias arises when the world, as it is, is biased- “CEO” Image Search

▪ Representation bias occurs when some groups of the population are underrepresented in the training data

- ImageNet: 45% US, 1% China

▪ Measurement bias arises when there are granularity issues with measuring features of interest.

- GPA -> Student Success

▪ Aggregation bias occurs when a one-size-fits-all model is used for groups that have different conditional distributions

- Hb1ac level for diabetes across different populations

▪ Evaluation bias arises when evaluation and/or benchmark datasets are not representative of the target population

- Facial Recognition datasets

7A FRAMEWORK FOR UNDERSTANDING UNINTENDED CONSEQUENCES OF MACHINE LEARNING, BY HARINI SURESH AND JOHN V. GUTTAG, 2019

Page 8: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

Bias in the ML pipeline

8Solutions for mitigating bias need to be tailored to the source of the bias.

Page 9: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

Fairness▪ Can we define fairness?

- Legal precedent

- Civil Rights Act of 1964, Title VII, Equal Employment Opportunities, 1964.

- Barocas and Selbst, 2016

- disparate treatment: decisions based on sensitive attributes

- disparate impact: outcomes disproportionately hurt or benefit people with certain sensitive attribute values

9

Page 10: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

Fairness Frameworks

▪ No consensus on the mathematical definition of fairness

▪ Many papers have attempted to add frameworks- unawareness- demographic parity- equalized odds- predictive rate parity- individual fairness- counterfactual fairness

▪ Tradeoff between accuracy and fairness

10

Page 11: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

Research directions in de-biasing ML

▪ Debiasing ML when you don’t have access to private data- [“race”, “gender”, “age”] are protected attributes- Let’s take advantage of the societal biases in names.

▪ Bias in word embeddings exist- Debiasing methods by zero-ing projection onto the

gender direction conceals the bias instead of removing it

11

WHAT’S IN A NAME? REDUCING BIAS IN BIOS WITHOUT ACCESS TO PROTECTED ATTRIBUTES, BY ALEXEY ROMANOV ET. AL

LIPSTICK ON A PIG: DEBIASING METHODS COVER UP SYSTEMATIC GENDER BIASES IN WORD EMBEDDINGS BUT DO NOT REMOVE THEM, BY HILA GONEN AND YOAV GOLDBERG

Page 12: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

Other interesting ethics thoughts

▪ What effect does deep learning research have on the planet?- “training one model on GPU, with tuning and

experimentation, results in CO2 emissions that are comparable to the two-year carbon footprint of an average American”

- “cloud computing costs for developing a state-of-the-

art NLP model may account for $103–350K”

12

ENERGY AND POLICY CONSIDERATIONS FOR DEEP LEARNING IN NLP, BY EMMA STRUBELL, ANANYA GANESH, ANDREW MCCALLUM

Page 13: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

Think

pollev.com/cse416

▪ Select the text sample that was generated by GPT-2.

13

Page 14: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

Pair

pollev.com/cse416

14

▪ Select the text sample that was generated by GPT-2.

Page 15: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

ExplanationsInterpretable ML

15

Doge, our vacuum cleaner, got stuck. As an explanation for the accident, Doge told us that it needs to be on an even surface.

Page 16: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

Interpretability ▪ What is an explanation?

▪ Interpretable Models

▪ Model-Agnostic Methods

▪ Example-Based Explanations

▪ Future of Interpretability

16

Page 17: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

Explanations ▪ What Is an explanation?- An explanation is the answer to a why-question

(Miller 2017)- Why did the treatment not work on the patient?- Why was my loan rejected?- Why have we not been contacted by alien life yet?

▪ What is a “good” explanation?- Humanities research can help us out!- Good explanations are contrastive (Lipton 1990)- Good explanations are selected- Good explanations are context-driven- Good explanations focus on the abnormal- Good explanations are general and probable

17

Page 18: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

Interpretable Models

▪ Linear Regression

▪ Logistic regression

▪ Decision Trees

▪ Naïve Bayes

▪ K Nearest Neighbors

(GLM, RuleFit, Decision Rules)

18

Page 19: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

Model-Agnostic Methods:PDP

▪ The partial dependence plot (PDP plot) shows the marginal effect one or two features have on the predicted outcome of a machine learning model (J. H. Friedman 2001).

19

pro: easy to implementcon: avg. line hides heterogenous effects

Page 20: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

Model-Agnostic Methods: Surrogate Models

▪ Global Surrogate Model- An interpretable model is trained to approximate the

prediction of a black box model.

X Y

▪ Local Surrogate Model

LIME (Riberio et. al, 2016)

20

pro: human-centric explanationscon: instability, hard to define “neighborhood”

pro: flexible, intuitivecon: draw conclusions about model, not about data -- can differ by subset of dataset

Page 21: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

Model-Agnostic Methods: Shapley values

▪ Shapley value draws from game theory.

- Each feature value of the instance is a “player” in a game.

- The contribution of each player is measured by

adding/removing the player from all subsets of players.

- The Shapley Value for one player is the weighted sum of

all its contributions.

▪ If you add up the Shapley Values of all the features, plus the

base value, which is the prediction average, you will get the

exact prediction value.

21

pro: good explanations, fairly distributed con: lot of computing time, no prediction model

Page 22: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

Example-based explanations

▪ Adversarial examples- small, intentional feature perturbations that cause a

machine learning model to make a false prediction

▪ Counterfactual examples- "If X had not occurred, Y would not have occurred“- describes the smallest change to the feature values that

changes the prediction to a predefined output

22

Page 23: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

Future of Interpretability

ML Ethics

23

Page 24: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

Brain Break

24

Page 25: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

CSE/STAT 416Course Wrap Up

Vinitra SwamyUniversity of WashingtonAug 17, 2020

Slides borrowed from Emily Fox

Page 26: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

One Slide▪ Regression▪ Overfitting▪ Training, test, generalization

error▪ Bias-Variance tradeoff▪ Ridge/LASSO regularization▪ Cross validation▪ Gradient Descent▪ Classification▪ Logistic Regression▪ Decision trees▪ Boosting▪ Precision and recall▪ Nearest-neighbor retrieval,

regression, and classification▪ Kernel regression▪ Locality Sensitive Hashing

▪ Dimensionality Reduction (PCA)

▪ K-means clustering▪ Hierarchical clustering▪ Supervised vs. unsupervised

learning▪ Recommender systems▪ Matrix Factorization (NMF)▪ Coordinate Descent▪ Neural Networks▪ Convolutional Neural Networks▪ Transfer learning

26

Page 27: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

27STAT/CSE 416: Intro to Machine Learning

TrainingData

Featureextraction

ML model

Qualitymetric

ML algorithm

ŵy

x ŷ

Page 28: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

One Slide▪ Regression▪ Overfitting▪ Training, test, and

generalization error▪ Bias-Variance tradeoff▪ Ridge, LASSO▪ Cross validation▪ Gradient descent▪ Classification▪ Logistic regression▪ Decision trees▪ Boosting▪ Precision and recall▪ Nearest-neighbor retrieval,

regression, and classification

▪ Kernel regression▪ Locality sensitive hashing▪ Dimensionality reduction,

PCA

▪ k-means clustering▪ Hierarchical clustering▪ Unsupervised v. supervised

learning▪ Recommender systems▪ Matrix factorization▪ Coordinate descent▪ Neural networks▪ Convolutional neural

networks▪ Transfer learning for deep

learning

28STAT/CSE 416: Intro to Machine Learning

Case Study 1:Predicting house prices

DataML

MethodIntelligence

$

$$

$ = ??

house size

pri

ce (

$)

+ house features

Regression

Page 29: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

29STAT/CSE 416: Intro to Machine Learning

RegressionCase study: Predicting house prices

• Linear regression

• Regularization: Ridge (L2), Lasso (L1)

Models

Including many features:

- Square feet

- # bathrooms

- # bedrooms

- Lot size

- Year built

- …

house size

pri

ce (

$)

y

x

house size

pri

ce (

$)

y

# bat

h

x[2]

x[1]

0 50000 100000 150000 200000

10

00

00

010

00

00

20

00

00

coefficient paths Ridge

co

eff

icie

nt

bedroomsbathroomssqft_livingsqft_lotfloorsyr_builtyr_renovatwaterfront

λ

coef

fici

ents

ŵj RSS(w) + λ||w||2

2

Page 30: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

30STAT/CSE 416: Intro to Machine Learning

RegressionCase study: Predicting house prices

• Gradient descentAlgorithms

ŵ

RSS(w0,w1) = ($house 1-[w0+w1sq.ft.house 1])2

+ ($house 2-[w0+w1sq.ft.house 2])2 + ($house 3-[w0+w1sq.ft.house 3])2 + … [include all houses]

Page 31: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

31STAT/CSE 416: Intro to Machine Learning

RegressionCase study: Predicting house prices

• Loss functions, bias-variance tradeoff, cross-validation, sparsity, overfitting, model selection

Concepts

Model complexity

Erro

rx

y

x

y

OVERFIT

Validation set

Training setTest set

fit ŵλ

test performance of ŵλ to select λ *

assess generaliza on error

of ŵλ*

1. Noise

2. Bias

3. Variance square feet (sq.ft.)

pri

ce (

$)

x

y

fw ̄

fw(true)

y

square feet (sq.ft.)

pri

ce (

$)

x

Page 32: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

32STAT/CSE 416: Intro to Machine Learning

Case Study 2:Sentiment analysis

DataML

MethodIntelligenceClassification

All reviews:

“awesome”

“aw

ful”

Score(x) > 0

Score(x) < 0

Sushi was awesome, the food was awesome, but the service was awful.

Page 33: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

33STAT/CSE 416: Intro to Machine Learning

ClassificationCase study: Analyzing sentiment

• Linear classifiers (logis c regression)

• Mul class classifiers

• Decision trees

• Boosted decision trees and random forests

Models

#awesome

#aw

ful

0 1 2 3 4 …

0

1

2

3

4

1.0 #awesome – 1.5 #awful = 0

Score(x) > 0

Score(x) < 0

Start

Credit?

Safe

excellent

Income?

poor

Term?

Risky Safe

fair

5 years3 years

Risky

Low

Term?

Risky Safe

high

5 years3 years

Page 34: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

34STAT/CSE 416: Intro to Machine Learning

ClassificationCase study: Analyzing sentiment

• Boosting

• Learning from weighted dataAlgorithms

weighted_error= 0.2

weighted_error= 0.35

weighted_error= 0.3

weighted_error= 0.4

Income>$100K?

Safe Risky

NoYes

Credit history?

Risky Safe

GoodBad

Savings>$100K?

Safe Risky

NoYes

Market conditions?

Risky Safe

GoodBad

Page 35: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

35STAT/CSE 416: Intro to Machine Learning

ClassificationCase study: Analyzing sentiment

• Decision boundaries, maximum likelihood estimation, ensemble methods, random forests

• Precision and recall

Concepts

ℓ(w

0,w

1,w

2)

recall

pre

cisi

on

0

1

1

Classifier ABest classifierClassifier B

Page 36: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

36STAT/CSE 416: Intro to Machine Learning

Case Study 3:Document retrieval

DataML

MethodIntelligence

Nearestneighbor

Page 37: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

37STAT/CSE 416: Intro to Machine Learning

Case Study 3+:Document structuring for retrieval

DataML

MethodIntelligenceClustering

SPORTS WORLD NEWS

ENTERTAINMENT SCIENCE

Page 38: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

38STAT/CSE 416: Intro to Machine Learning

Case Study 3++:Dimensionality reduction

DataML

MethodIntelligencePCA

[Saul & Roweis ‘03]

Images with thousands or

millions of pixels

Can we give each image a coordinate,

such that similar images are near each

other?

Page 39: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

39STAT/CSE 416: Intro to Machine Learning

Clustering & RetrievalCase study: Finding documents

• Nearest neighbors

• Clustering

• Hierarchical clusteringModels

query article

set of nearest neighbors

SPORTS WORLD NEWS

ENTERTAINMENT SCIENCE

Athletes Non-athletes

Wikipedia

Baseball Soccer/Ice hockey

Musicians, artists, actors

Scholars, politicians, government officials

Page 40: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

40STAT/CSE 416: Intro to Machine Learning

Clustering & RetrievalCase study: Finding documents

• k-means

• Locality-sensitive hashing (LSH)

• NN regression and classification

• Kernel regression

• Agglomerative and divisive clustering

• PCA

Algorithms

0 0.1 0.2 0.3 0.4 x0 0.6 0.7 0.8 0.9 1

1

0.5

0

0.5

1

1.5

f(x0)

Epanechnikov Kernel (lambda = 0.2)

$ = ???

y

house size

pri

ce (

$)

x$ = ???

Cluster distance

Data points

Page 41: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

41STAT/CSE 416: Intro to Machine Learning

Clustering & RetrievalCase study: Finding documents

• Distance metrics, kernels, approximation algorithms, dimensionality reduction

Concepts

1 0 0 0 5 3 0 0 1 0 0 0 0

3 0 0 0 2 0 0 1 0 1 0 0 0

1*3+5*2= 13

Feature 1

Feat

ure

2

θ

#awesome

#aw

ful

0 1 2 3 4 …01234… Line 1

Line 2

Line 3

Bin index:[0 0 0] Bin index:

[0 1 0]

Bin index:[1 1 0]

Bin index:[1 1 1]

Principal components:

Reconstructing:

Page 42: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

42STAT/CSE 416: Intro to Machine Learning

Case Study 4:Image classification

DataML

MethodIntelligenceDeep Learning

x1

x2

1

z1

z2

1

y

Layer 1 Layer 2

Page 43: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

43STAT/CSE 416: Intro to Machine Learning

Deep LearningCase study: Image classification

• Perceptron

• General neural network

• Convolutional neural networkModels

x[1]

x[2]

x[d]

Σ

1 w0

w1

w2

w d

v[1]

v[2]

x[1]

x[2]

1

y

1

Page 44: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

44STAT/CSE 416: Intro to Machine Learning

Deep LearningCase study: Image classification

Forward

Gradient / Backprop

• Convolutions

• Backpropagation (high level only)Algorithms

Page 45: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

45STAT/CSE 416: Intro to Machine Learning

• Activation functions, hidden layers, architecture choicesConcepts

Deep LearningCase study: Image classification

Page 46: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

46STAT/CSE 416: Intro to Machine Learning

Case Study 5:Product recommendation

DataML

MethodIntelligence

MatrixFactorization

Your past purchases:

+ purchase histories of all customers

Recommended items:Customers

features

features

features

Productsfeatures

features

features

Page 47: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

47STAT/CSE 416: Intro to Machine Learning

Recommender Systems & Matrix FactorizationCase study: Recommending Products

• Collaborative filtering

• Matrix factorizationModels

Xij known for black cells

Xij unknown for white cells

Rows index moviesColumns index users

X =Rating

Parameters of model

Page 48: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

48STAT/CSE 416: Intro to Machine Learning

Recommender Systems & Matrix FactorizationCase study: Recommending Products

• Coordinate descentAlgorithms

Xij known for black cells

Xij unknown for white cells

Rows index moviesColumns index users

X =RatingForm estimates

Lu and Rv

Page 49: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

49STAT/CSE 416: Intro to Machine Learning

Recommender Systems & Matrix FactorizationCase study: Recommending Products

• Matrix completion, cold-start problemConcepts

Xij known for black cells

Xij unknown for white cells

Rows index movies

Columns index users

X =

Cu

sto

mer

s

Products

Cu

sto

mer

s

Products

Page 50: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

50STAT/CSE 416: Intro to Machine Learning

TrainingData

Featureextraction

ML model

Qualitymetric

ML algorithm

ŵy

x ŷ

Page 51: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

Big Picture Improving the performance at some task through experience!

▪ Before you start any learning task, remember fundamental questions that will impact how you go about solving it

51

What is the learning problem? From what

experience?

What model?What loss function are you optimizing?

What consequences does your model have?

With what optimization algorithm?

Are there any guarantees?

How will you evaluate the model?

Page 52: THIS IS YOUR PRESENTATION (OR SLIDEDOC) TITLE...Announcements Homework 7 grades have been released Congratulations to Team Otterhog! + creative team names - weak learner - machine

52

YOU

Congrats on finishing CSE/STAT 416!Thanks for the hard work!