optimizing generalized rate metrics with three players · optimizing generalized rate metrics with...

34
Optimizing Generalized Rate Metrics with Three Players Harikrishna Narasimhan, Andrew Coer, Maya Gupta

Upload: others

Post on 14-Jun-2020

8 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Optimizing Generalized Rate Metrics with Three Players · Optimizing Generalized Rate Metrics with Three Players Harikrishna Narasimhan, Andrew Cotter, Maya Gupta. Constrained Learning

Optimizing Generalized Rate Metrics with Three Players

Harikrishna Narasimhan, Andrew Cotter, Maya Gupta

Page 2: Optimizing Generalized Rate Metrics with Three Players · Optimizing Generalized Rate Metrics with Three Players Harikrishna Narasimhan, Andrew Cotter, Maya Gupta. Constrained Learning

Constrained Learning ProblemsGeneral evaluation metric

Complexpolicy / fairness goal

Page 3: Optimizing Generalized Rate Metrics with Three Players · Optimizing Generalized Rate Metrics with Three Players Harikrishna Narasimhan, Andrew Cotter, Maya Gupta. Constrained Learning

Constrained Learning Problems

Example: Fair Hiring

G1

G2

Page 4: Optimizing Generalized Rate Metrics with Three Players · Optimizing Generalized Rate Metrics with Three Players Harikrishna Narasimhan, Andrew Cotter, Maya Gupta. Constrained Learning

Constrained Learning ProblemsF-measure

Example: Fair Hiring

G1

G2

Page 5: Optimizing Generalized Rate Metrics with Three Players · Optimizing Generalized Rate Metrics with Three Players Harikrishna Narasimhan, Andrew Cotter, Maya Gupta. Constrained Learning

Constrained Learning ProblemsF-measure

Equal opportunity

Example: Fair Hiring

G1

G2

Page 6: Optimizing Generalized Rate Metrics with Three Players · Optimizing Generalized Rate Metrics with Three Players Harikrishna Narasimhan, Andrew Cotter, Maya Gupta. Constrained Learning

Constrained Learning ProblemsF-measure

Example: Fair Hiring

G1

G2

Equal precision

Page 7: Optimizing Generalized Rate Metrics with Three Players · Optimizing Generalized Rate Metrics with Three Players Harikrishna Narasimhan, Andrew Cotter, Maya Gupta. Constrained Learning

Constrained Learning ProblemsF-measure

Example: Fair Hiring

G1

G2

Equal precision

G-mean

Page 8: Optimizing Generalized Rate Metrics with Three Players · Optimizing Generalized Rate Metrics with Three Players Harikrishna Narasimhan, Andrew Cotter, Maya Gupta. Constrained Learning

Constrained Learning ProblemsF-measure

Example: Fair Hiring

G1

G2

Equal precision

Match distribution

G-mean

Page 9: Optimizing Generalized Rate Metrics with Three Players · Optimizing Generalized Rate Metrics with Three Players Harikrishna Narasimhan, Andrew Cotter, Maya Gupta. Constrained Learning

Constrained Learning ProblemsF-measure

Example: Fair Hiring

G1

G2

Equal precision

Match Distribution

G-mean

How does one design learning algorithms to handle general performance metrics and constraints?

Page 10: Optimizing Generalized Rate Metrics with Three Players · Optimizing Generalized Rate Metrics with Three Players Harikrishna Narasimhan, Andrew Cotter, Maya Gupta. Constrained Learning

Problem Setup

where there are K prediction rates

Page 11: Optimizing Generalized Rate Metrics with Three Players · Optimizing Generalized Rate Metrics with Three Players Harikrishna Narasimhan, Andrew Cotter, Maya Gupta. Constrained Learning

Problem Setup

where there are K prediction rates:

for some Expectations of counts E.g. false positive rate, coverage

Page 12: Optimizing Generalized Rate Metrics with Three Players · Optimizing Generalized Rate Metrics with Three Players Harikrishna Narasimhan, Andrew Cotter, Maya Gupta. Constrained Learning

Problem Setup

where there are K prediction rates:

for some Expectations of counts E.g. false positive rate, coverage

Non-continuous in 𝜃

Non-decomposable: not simple averages of pointwise errors

Page 13: Optimizing Generalized Rate Metrics with Three Players · Optimizing Generalized Rate Metrics with Three Players Harikrishna Narasimhan, Andrew Cotter, Maya Gupta. Constrained Learning

Prior Methods ● Solution 1: Use Surrogates‡

○ Relax indicators with continuous surrogates

‡Joachims, 15; Kar et al. 14; 16; N et al. 15; Goh et al. 16; Zafar et al. 17

Page 14: Optimizing Generalized Rate Metrics with Three Players · Optimizing Generalized Rate Metrics with Three Players Harikrishna Narasimhan, Andrew Cotter, Maya Gupta. Constrained Learning

Prior Methods ● Solution 1: Use Surrogates‡

○ Relax indicators with continuous surrogates

○ Relaxing constraints with surrogates may make the problem infeasible

○ Surrogates may output values outside the range of

‡Joachims, 15; Kar et al. 14; 16; N et al. 15; Goh et al. 16; Zafar et al. 17

Defined for [0, 1]

Page 15: Optimizing Generalized Rate Metrics with Three Players · Optimizing Generalized Rate Metrics with Three Players Harikrishna Narasimhan, Andrew Cotter, Maya Gupta. Constrained Learning

Prior Methods ● Solution 1: Use Surrogates‡

○ Relax indicators with continuous surrogates

○ Relaxing constraints with surrogates may make the problem infeasible

○ Surrogates may output values outside the range of

● Solution 2: Cost-weighted Minimization Oracle*

○ Sequence of weighted objectives and use an oracle to solve sub-problem. Strong oracle assumption!

Defined for [0, 1]

*Parambath et al. 14; Koyejo et al. 14; N et al. 15; Yan et al. 18; N 18; Alabi et al. 18

Page 16: Optimizing Generalized Rate Metrics with Three Players · Optimizing Generalized Rate Metrics with Three Players Harikrishna Narasimhan, Andrew Cotter, Maya Gupta. Constrained Learning

Prior Methods ● Solution 1: Use Surrogates‡

○ Relax indicators with continuous surrogates

○ Relaxing constraints with surrogates may make the problem infeasible

○ Surrogates may output values outside the range of

● Solution 2: Linear Minimization Oracle*

○ Sequence of weighted objectives and use an oracle to solve sub-problem

Defined for [0, 1]

*Parambath et al. 14; Koyejo et al. 14; N et al. 15; Yan et al. 18; N 18; Alabi et al. 18

This Paper

● General framework that recovers prior methods as special cases● Practical algorithms with minimal use of surrogates and tighter handling

of constraints

Page 17: Optimizing Generalized Rate Metrics with Three Players · Optimizing Generalized Rate Metrics with Three Players Harikrishna Narasimhan, Andrew Cotter, Maya Gupta. Constrained Learning

Min-max Game Formulation

Convex, monotonic

Unconstrained problem; same ideas apply with constraints

Page 18: Optimizing Generalized Rate Metrics with Three Players · Optimizing Generalized Rate Metrics with Three Players Harikrishna Narasimhan, Andrew Cotter, Maya Gupta. Constrained Learning

Min-max Game Formulation

Slack variables to decouple rates

Page 19: Optimizing Generalized Rate Metrics with Three Players · Optimizing Generalized Rate Metrics with Three Players Harikrishna Narasimhan, Andrew Cotter, Maya Gupta. Constrained Learning

Min-max Game Formulation

Lagrangian min-max problem

Page 20: Optimizing Generalized Rate Metrics with Three Players · Optimizing Generalized Rate Metrics with Three Players Harikrishna Narasimhan, Andrew Cotter, Maya Gupta. Constrained Learning

Three-player Viewpoint

● 𝝀-player: Linear

Page 21: Optimizing Generalized Rate Metrics with Three Players · Optimizing Generalized Rate Metrics with Three Players Harikrishna Narasimhan, Andrew Cotter, Maya Gupta. Constrained Learning

Three-player Viewpoint

● 𝝀-player: Linear● ξ-player: Convex

Page 22: Optimizing Generalized Rate Metrics with Three Players · Optimizing Generalized Rate Metrics with Three Players Harikrishna Narasimhan, Andrew Cotter, Maya Gupta. Constrained Learning

Three-player Viewpoint

● 𝝀-player: Linear● ξ-player: Convex● θ-player: Non-continuous due to indicators

Expectation of Indicator

Page 23: Optimizing Generalized Rate Metrics with Three Players · Optimizing Generalized Rate Metrics with Three Players Harikrishna Narasimhan, Andrew Cotter, Maya Gupta. Constrained Learning

Choosing Player Strategiesξ-player 𝝀-player θ-player

Best Response SGDwith Indicators

Oracle-based Alg.

(analytical solution)

Page 24: Optimizing Generalized Rate Metrics with Three Players · Optimizing Generalized Rate Metrics with Three Players Harikrishna Narasimhan, Andrew Cotter, Maya Gupta. Constrained Learning

Choosing Player Strategiesξ-player 𝝀-player θ-player

Best Response SGDwith Indicators

Best Response with Indicators

Cost-weighted minimization oracle

Oracle-based Alg.

(analytical solution)

Page 25: Optimizing Generalized Rate Metrics with Three Players · Optimizing Generalized Rate Metrics with Three Players Harikrishna Narasimhan, Andrew Cotter, Maya Gupta. Constrained Learning

Choosing Player Strategiesξ-player 𝝀-player θ-player

Best Response SGDwith Indicators

Best Response with Indicators

Best Response SGDwith Indicators

SGD with Surrogates

Idea: Use original objective for 𝝀 and surrogates for θExtends work of Cotter et al.‘19

Replace with convex surrogate

Oracle-based Alg.

Surrogate-based Alg.

Enables tighter handling of constraints

Page 26: Optimizing Generalized Rate Metrics with Three Players · Optimizing Generalized Rate Metrics with Three Players Harikrishna Narasimhan, Andrew Cotter, Maya Gupta. Constrained Learning

Players Find Equilibriumξ-player 𝝀-player θ-player

Best Response SGDwith Indicators

Best Response with Indicators

Best Response SGDwith Indicators

SGD with Surrogates

Oracle-based Alg.

Surrogate-based Alg.

● Iterative approach returns a stochastic classifier

Page 27: Optimizing Generalized Rate Metrics with Three Players · Optimizing Generalized Rate Metrics with Three Players Harikrishna Narasimhan, Andrew Cotter, Maya Gupta. Constrained Learning

Players Find Equilibriumξ-player 𝝀-player θ-player

Best Response SGDwith Indicators

Best Response with Indicators

Best Response SGDwith Indicators

SGD with Surrogates

Oracle-based Alg.

Surrogate-based Alg.

● Iterative approach returns a stochastic classifier○ Near-optimality & near-feasibility guarantees (convex )

Page 28: Optimizing Generalized Rate Metrics with Three Players · Optimizing Generalized Rate Metrics with Three Players Harikrishna Narasimhan, Andrew Cotter, Maya Gupta. Constrained Learning

Players Find Equilibriumξ-player 𝝀-player θ-player

Best Response SGDwith Indicators

Best Response with Indicators

Best Response SGDwith Indicators

SGD with Surrogates

Oracle-based Alg.

Surrogate-based Alg.

● Iterative approach returns a stochastic classifier○ Near-optimality & near-feasibility guarantees (convex )

Equilibrium

Mixed Nash

Mixed Coarse Correlated

Page 29: Optimizing Generalized Rate Metrics with Three Players · Optimizing Generalized Rate Metrics with Three Players Harikrishna Narasimhan, Andrew Cotter, Maya Gupta. Constrained Learning

Players Find Equilibriumξ-player 𝝀-player θ-player

Best Response SGDwith Indicators

Best Response with Indicators

Best Response SGDwith Indicators

SGD with Surrogates

Oracle-based Alg.

Surrogate-based Alg.

● Iterative approach returns a stochastic classifier○ Near-optimality & near-feasibility guarantees (convex )

● Surrogates: Weaker optimality guarantee with a smaller comparator class

Equilibrium

Mixed Nash

Mixed Coarse Correlated

Page 30: Optimizing Generalized Rate Metrics with Three Players · Optimizing Generalized Rate Metrics with Three Players Harikrishna Narasimhan, Andrew Cotter, Maya Gupta. Constrained Learning

Framework Generalizes Prior Methodsξ-player 𝝀-player θ-player

Best Response SGDwith Indicators

Best Response with Indicators

Best Response SGDwith Indicators

SGD with Surrogates

Oracle-based Alg.

Surrogate-based Alg.

Best Response SGDwith Surrogates

SGD with Surrogates

- FTLwith Surrogates

SGD with Surrogates

- FTL*

with IndicatorsBest Response

with Rates

SPADE [N et al. 15]

NEMSIS [Kar et al. 16]

Frank-Wolfe [N et al. 15]

*Uses result of Abernethy & Wang ‘19

Equilibrium

Mixed Nash

Mixed CoarseCorrelated

Pure Nash

Pure Nash

Mixed Nash

Do not handle constraints

Page 31: Optimizing Generalized Rate Metrics with Three Players · Optimizing Generalized Rate Metrics with Three Players Harikrishna Narasimhan, Andrew Cotter, Maya Gupta. Constrained Learning

Heuristic Algorithm for Non-convex

Heuristic Alg.

Non-convex : e.g. sums or differences of ratios

ξ-player 𝝀-player θ-player

Best Response SGDwith Indicators

Best Response with Indicators

Best Response SGDwith Indicators

SGD with Surrogates

SGD SGDwith Indicators

SGD with Surrogates

Equilibrium

Mixed Nash

Mixed CoarseCorrelated

-

Oracle-based Alg.

Surrogate-based Alg.

Page 32: Optimizing Generalized Rate Metrics with Three Players · Optimizing Generalized Rate Metrics with Three Players Harikrishna Narasimhan, Andrew Cotter, Maya Gupta. Constrained Learning

Experiment

Page 33: Optimizing Generalized Rate Metrics with Three Players · Optimizing Generalized Rate Metrics with Three Players Harikrishna Narasimhan, Andrew Cotter, Maya Gupta. Constrained Learning

Experiment

Law School(black / others)

Trades-off objective for constraints

Unconstrained Baselines

Page 34: Optimizing Generalized Rate Metrics with Three Players · Optimizing Generalized Rate Metrics with Three Players Harikrishna Narasimhan, Andrew Cotter, Maya Gupta. Constrained Learning

Poster: East Exhibition Hall B + C #51

Open-source Library: TensorFlow Constrained Optimization (TFCO)https://github.com/google-research/tensorflow_constrained_optimization